Journal of Political Economy 


Volume 82 Number] January/February 1974 


Articles 

I Il< lu (il-C'dSi \ii,i!ssi\ and '1 r<t(l< I’oln i< s 
DuM^upln amt /mifi/i I Slight'. 

‘5 5^ llidonu Pi u rs and Inipluit M.iikils Prodm l DifTt rLlilialinn m 
Pun ( (imjK titt'iti 
'i/ll rain Rintn 

!■)(> \ 1 )un'iainiuatu lAjmsituin nf tin tlarhi'rgcr Model \su!i Out 

Inunolnli i a( lor 
( hail'll /- Miluii’ Ir 

'\ii lioiuuuii llicoie III Sun id< 

Daniil S tlamtimi \h and \iiit \I San 

i|i| ]olin M i\ n.ud K.I Mil ^ S( n imst or Politn lan ' 

I li'jihtlh /ii/'Hi'in 

iii (iioup S( gn [I Hinii ind (il !iii I'-du lions 
Maitin \fif,!ini 

ijj Sl.dnlm iiid lli( (hianlitx I hi ot \ 

/’ II t/^mi/l 

Communications 

1 lu InlMiiliUon liitiMin Lilioi and ( apn.d (. onipoia nis of 

Rai id I 111 oiiu I )illi u III 1 s 
(iitiilli \I ,1111 I umhiiliiTi; 

ill) \nOjUnnd Piogianiloi Man iging ( oininon Piopun Ri'ouuis 
Midi ( oiigi slum 1 aI( III ilitn s 
(aiulhii thoun jt 

17, C ogniliM R ingi in dll 1 in m \ ul Rt \ i di d Pi i fi n in i 
I11//11111] 1 ( him and I’tIII SDimul! 

did (onsniiu's Din dill Iroods Hiniiaii ( tpit d ind du I'lnio 
()))limdits 111 du 1 ingi-liinn Slad 
Miihiifl R Darb] 



JOURNAL OF 

POLITICAL ECONOMY 


Edited by 

harry G. JOHNSON and GEORGE J. STIGLER 

ROBERT BARRO, Associate Editor 
In cjg^ration with OTHER MEMBERS of the DEPARTMENT OF 
jKRjOMICS and the GRADUATE SCHOOL OF BUSINESS 
. ^ of the UNIVERSITY OF CHICAGO 

(SEE INSIDE BACK COVER) 

/Jale-VS. WELCH, Editorial Assistant 


The Jonnul of Political Economy is published bimonthly m February, Apnl, June, August, 
October, and December by The University of Chicago Press Subscription rates, USA institu¬ 
tions, 1 year $20 00, 2 years $38 00, J years $55 00, individuals, 1 year $15 00, 2 years $28 00, 
3 years $40 00 Student subscription rate. USA 1 year $9 00 (letter from professor must accom¬ 
pany subscripuon) Other countries add $1 00 for each year’s subscription to cover postage Single 
copy rates institutions $4 00. individuals $3 00 Supplements insUtutions $4 50, individuals $3 50 
Back issues are available from 1962 (vol 70) Make all remittances payable to Journal of Political 
Economy, The Untversity of Chicago Press, in United Slates currency or its equivalent 

Claiini for mlasiiig numbers should be made within the month following the regular month 
of publication The publishers expect to supply missing numbers free only when losses have been 
sustained in transit and when the reserve stock will permit 

Business correspondence should be addressed to The University of Chicago Press, 5801 Ellis 
Avenue, Chicago, Illinois 60637 

Letters to the editors and manuscripts should be addressed to the Editor of the Journal 
of Political Economy, 1126 East 59th Street, Chicago, Illinois 60637 Manuscripts ahould be 
submitted in duplicate (3 copies if posslbie), accompanied by a $10 OO submiasioD fee made payable 
to the Journal Accepted manuscripts must be typed according to the University of Chicago 
Manual of Style References should be typed double-spaced at the end of the article Footnotes 
should be numbered in sequence and double-spaced following the references Tables should follow 
the footnotes Originals of the figures, diawn in india mk, should be submitted if the manuscript 
IS accepted Abstracts not exceeding 100 words should be submitted in duplicate along with the 
manuscript 

AppUeatioiia for permission to quote from this journal should be addressed to The Umveriity 
of Chicago Press and will be freely granted Reproduction of articles requires written permission 
from the publisher and the author 

Reprinted volumes 1-72 available from Walter J Johnson, Inc , 111 Fifth Avenue, New York, 
New York 10003 Volumes available In mlcrofUm from University Microfilms, 300 North Zeeb 
Road, Ann Arbor, Michigan 48106, in microfiche from J S Canner & Co, 49-65 Lansdowne 
Street. Boston, Massachusetts 02215 

Notice to subscribers If you change your address, please notify us and your local postmaster 
Immediately, givmg both your old and your new address Allow four weeks for the change 

- 

Second-elass postage paid at Chicago, Illinois, and at additional mailing office 
© 1974 by the University of Chicago All rights reserved 
ISSN 00r2 JSOS 


PRINTED IN U S A 



Journal of Political Economy 


Volume 82 Number 1 


January/February 1974 


Articles 

I Benefit-Cost Analysis and Trade Pohcii* 

Parlha Dasgupta and Joseph E Sttghtz 

34 Hedonic Prices and Implicit Markets* Product DifTcrcnUation m 
Pure Competition 
Sherwin Rosen 

56 A Diagrammatic Exposition of the Harbcrgcr Model with One 
Immobile Factor 
Charles E McLure, Jr 

83 An Economic Theory of Suicide 
Darnel S Hatnermtsh and Neal M Soss 

99 John Maynard Keynes Scientist or Politician^ 

Elizabeth Johnson 

IIS Group Segregation and Optimal Jurisdictions 
Martin McGuire 

133 Stability and the Quantity Theory 
P W Howitt 

Communications 

152 The Interrelation between Labor and Capital Components of 
Racial Income Differences 
George M von Furstenberg 

163 An Optimal Program for Managing Common Property Resources 
with Congestion Externalities 
Gardner Brown, Jr 

174 Cognitive Range in the Theory of Revealed Preference 
Anthony Y C Koo and Peter Schmidt 

180 Consumers’ Durable Goods, Human Capital, and the Pareto 
Optimality of the Lange-Lcmer State 
Mwhael R. Darby 



185 A Test of the Permanent-Income Hypothesis 
Comment by Janies M Holmes 
Reply by Prem S Laumas 

199 “Preferences, Separability, and the Patinkin Model” A Comment 
Ltlan Berglas and Assaf Razin 

20a A Paradoxon of the Market Mechanism 
Gerhard 0 Orosel 

Miscellany 

210 Driiq-Law Enfoncmcnt Should We Arrest Pushers or Users^ 

BiHv J halherly 

215 Puzzles and Problems 

217 Book Reviews 

Robot A Munddi, Monetary Theory Injlalion, Interest, and Growth 
in the H or Id Economy 
Don Patinkin 

Trout Raclci, 'Iheory oj General Economic Equilibrium 
Janits ()uiik 

Naihan Rosoiberg, Technology and American Economic Growth 
Lots (j Sandberg 

Tiigeiit /all ski, Planning for Economic Growth in the Soviet Union, 
19 IH 1932 
R VV Davies 



Journal of Political Economy 


Volume 82 Number 2, Part I March/April 1974 


Articles 

225 Private Savings Ultrarationality, Aggregation, and “Denison’s 
Law” 

Paul A Davtd and John L Scaddmg 

251 The Cost of School Time, Foregone Earnings, and Human Capital 
Formation 
Donald 0 Parsons 


267 Advertising and Profitability A Reappraisal 
Bloch 



Incentives and the Ck)rporaie Income Tax 
to 


, Yield Curves, and Wage Settlements An Empincal 
Camay, Aru Melnik, and Abraham Subotmk 


retations of Departures from the Pareto f^urve Firm-Size 
butions 

Ijiri and Herbert A Simon 


333 Soviet Growth in Absence of Centralized Planning A Hypothetical 
Alternative 

Mwha Gtsser and Paul Jonas 


353 Ghetto Employmcni and the Model Cities Program 
Bennett Hamson 


373 Optimum City Size The External Diseconomy Question 
J V Henderson 



Communications 

389 A Conditional Logit Model of Occupational Choice 
Mukael J Boskin 

399 Occupational Licensing and the Public Interest 
Alex Maunzt 

414 On Equalizing the Distnbuoon of Political Income 

Dennis C Mueller, Robert D Tolltson, and Thomas D WtlUtt 

423 Some Principles of Inflationary Finance 
Donald A Nichols 

431 A Bias in Current Measures of Econonuc Growth 
Milton Fnedman 

Miscellany 

433 Human Migration A Survey of Preclassical Literature 
Robert Weinstein 

437 Book Reviews 

David M Gordon, Theories of Poverty and Unemployment 
Sherwin Rosen 

Leonard J Kirsch, Soviet Wages Changes in Structure and Admini¬ 
stration since 1956 
Murray Yanowuch 

Michael J Webber, Impact of Uncertainty on Location 
Jacques Thissc 



Journal of Political Economy 


Volume 82 


Number 



M^y/June 1974 

1 


Articles 

443 International Capital Hows and Portfolio Equilibrium 
Pentli J K Kouri and Michael G Porter 

469 Taxation and the Rates of Return from Some U S Asset Holdings 
Abroad, 1960-1969 
Herbert G Grubel 

489 Cattle as Capital Goods and Ranchers as Portfolio Managers An 
Application to the Argentine Cattle Sector 
Lovell S Jarvis 

521 Rational Expectations and Bayesian Analysis 
Richard M Cyert and Moms H DeGroot 

537 Competitive Firm and the Theory of Input Demand under Pnce 
Uncertainty 

Raveendro N Batra and Aman L’llah 

549 Lifetime Earnings in a Professional Labor Market Academic 
Economists 

George E Johnson and Frank P Stafford 

Frank Knight in Retrospect 

571 Frank Knight and the Tradition of Liberalism 
Scott Gordon 

578 Frank H Knight as Economic Thconst 
Arthur H Leigh 

Communications 

587 Rational Response to the Money Supply 
Richard Roll 

598 The Honest Government’s Guide to the Revenue from the Creation 
ol Money 

I^onardo Auemheimer 

607 Cardinal Welfare, IndividualisUc Ethics, and Interpersonal C-om- 
parison of Utilities A Note 
Antonio Camacho and Jon SonsUlit 



612 On Killing off the Market for Used Textbooks and the Relationship 
between Markets for New and Secondhand Goods 
H Laurence Miller, Jr 

620 A Note on Income (Labor) Inequality Income Tax Systems and 
Human Capital Theory 
Claude Montmarquette 

626 A Note on Equilibrium in Monopohstic Competition 
Ynram Peles 

631 Optimal Replacement ot Capital Goods in Early New England and 
British Textile Firms A Comment 
David Denslow, Jr , and David Schulze 

638 Optimal Replacement of Capital Goods in Early New England and 
British Textile Firms Reply 
Jeffrey C lYilliamson 

641 A Note on the Stability of Mctzlcr’s Phase Diagram 
Don Pahnkin 

645 Managerial Tenure under Private and Government Ownership in 
the Elettric Power Industry 
Louk De Alesst 

654 Utility Maximization Sufficient for Competitive Survival Comment 
Robert Piron 

657 On the Economics of a University Toward the Determination of 
Marginal Cost of Teaching Services 
Shlomo Brovender 

Book Review 

665 Philip Cagan, The Channels of Monetary Effects on Interest Rates 
William Poole 


Announcement from the Editors 

In order to expedite the flow ot manuscripts and to improve the 
quality of referee reports, tlie JPE is planning to initiate a policy of 
paying referees (in monetary form) The hnancing of these pay- 
inents will require an increase in the submission fee, effective Sep¬ 
tember 1, 1974, to S25 00 However, the fee will be refunded to 
authors whose papers are accepted for publication in the Journal 



Journal of Political Economy 


Volume 82 Number 4 July/August 1974 


Articles 

669 An Econometne Evaluation of Sulfur Taxes 
Jams M. Gr^in 

689 Searching for the Lowest Pnee When the Dutnbuuon of Prices Is 
Unknown 
Michael Rolhsckdd 

713 Karl Marx on the Ek:onomic Role of Science 
Nathan Rosenberg 

729 Advertising as InformaUon 
Phillip Nelson 

755 Taxauon and National Defense 
Earl A Thompson 

783 Vertical Control with Variable Proportions 
Frederick fV Warren-BoulUm 

Communications 

803 The Risk Element in Occupauonal and Educauonal Choices 
Comment 
John C House 

809 Purchasing-Power Parity Theory and Economic Reform in 
Yugoslavia 

Ryan C Amacher and John S Hodgson 

817 Authontanan versus Nonauthtmtanan Approaches to Econonuc 
Development 
G William Dick 

829 Factor ProporUons and Comparative Advantage in the Long Run 
Comment 
Alan V Deardatff 

835 The Value of Changes in the IVobability of Death or Injury 
Michael Jones-Lee 



851 A Reexamination of the Kinky Oligopioly Demand Curve 
Walter J Pnmeaux, Jr , and Mark R Bomball 

863 Voucher Plans, Voting Models, and the Efficiency of Local School 
Finance 
M L Eysenbach 

873 Employer C'-osts and Dis( rimmation The Case of Baseball 
Jamer Gwartnej and Charles Haworth 

883 Friedman versus Tobin A Comment 
T Norman Van Cott and Gary Santoni 

Miscellany 

887 The Economu s of Brushing leeth 
Alan S Blinder 

892 Proxies and IJumniKs 
bntz Maclilufi 

Review Article 

893 The Cainbndge-( ambridgc Controversy in the Theory of Capital, 
A View from New Hai'cn A Review Article 

Joseph E Slinlilz 


Announcement from the Editors 

In order to expedite the flow of manuscripts and to improve the 
quality of referee reports, JPE is planning to initiate a policy of 
paying referees (in inonetarv form) The financing of these payments 
will require an increase in the submission fee, effective September 1, 
1974, to $25 00 However, the fee will be refunded to authors whose 
papers arc accepted for publication in the Journal 





Benefit-Cost Analysis and Trade Policies 


Partha Dasgupta 

London School oj Etonomus and Tniuty Hall, Cambridge 

Joseph E. Stiglitz 

Cowles Foundatum, Yale University 


Thu paper extends the theory of optimal taxation and government 
production to open economies Appropriate rules for project evaluation 
and the determination of consumption, production, and trade taxes 
under a variety of restrictions (e g , less than 100 percent profit taxes, 
government budget constraint, foreign exchange constraint) arc derived 
Among the results are (a) international prices should be used for 
evaluating public projects, unless there a a government budgetary con¬ 
straint or there is a quota (this result docs not require that tariff rates be 
optimally chosen), (b) no tariff should be levied on intermediates and 
only consumption taxes should be employed if there arc 100 percent 
profit taxes If profits arc not taxed at 100 percent, both consumpuon and 
trade taxes should be employed 


1. Introduction 

11 Motivation 

In this paper wc address ourselves to two questions- (1) In an ojien 
economy, in which the government is pursuing various trade and tax 
policies, and where some of the policy instruments arc optimally chosen 
but the others arc not, and in which, as a consequence, domestic pace 
ratios differ from intemationai price ratios, what is the appropriate 
relationship between international prices, domesuc pnccs, and shadow 

Research toward this paper was conducted while Dasgupta was at the Delhi School 
of Economics, Delhi, and Slights was at the InsUtute of Development Studies, Nairobi 
We have gained much from conversauons with James Mirrlees and Amaitya Sen. 
Financial support from the Rockefeller Foundation, the National Science Foundatioai, 
and the Ford Foundation for Stiglitz'i research is gratefully acknowledged. 




2 


JOURNAL OF POLITICAL ECONOMY 


prices m benefit-cost analysis’ (2) Under what condiUons is it optimal 
to impose taxes on imports and exports which are different from those 
imposed on domestically produced commodities, and what is the optimal 
tariff structure in these circumstances’ 

The motivation behind these questions is probably transparent Much 
of the recent discussion on benefit-cost analysis (in particular, Little and 
Mirrlccs [1969] and Dasgupta, Marglin, and Sen [1972]) has been aimed 
at a detailed specification of the procedure that governments ought to 
follow in thoosmg industrial projects in the public sector But neither of 
the two documents referred to above presents a fully articulated model 
for arriving at the detailed recommendations It is, therefore, of some 
importanif to (heck the kinds of circumstances under which these 
recommtndalions are indeed appropriate 

The second set ol questions that we arc concerned with here, on the 
other hand, is closely rtlafed to the recent literature on effective pro¬ 
tective rates'(see, eg, Corden 1966, Ruffin 1969) The interest in this 
cone opt was presumahly motivated by asking the question. What structure 
of tariffs leaves undistorted the relative prices of different commodities 
and laclors ol produc tion ^ But the converse ol the well-known proposition 
that m general equilibrium it is only relative prices that are determined 
IS that, without changing relative prices, a Inonlump-sum) tax system 
can raise no revenue A more limited objective seems to have been the 
starch for a lanffsystem which will leave undistoried relative commodity 
prices (bui which will in general change relative factor prices and prices 
ol commodiucs relative to factors) But it is pertinent icask why one should 
be interested in a tariff strut ture that leaves relative commodity prices, or 
even relative supplies ol different commodities, unchanged We know 
that most taxation is disiortionary The relevant question is. If you must 
have distortions, what is iht best set of distortions to have’’ 


I 2 The Open Economy 

Some years ago Pigou (1947) attempted to extend to economies that trade 
the basic principles of opiinial taxation that Ramsey (1927) had earlier 
described for economies that do not trade Ramsey had, for instance, 
argued that, in the case of independent demand and supply curves,^ for 
small government revenue, taxes ought to be proportional to the sum of 

^ These art questions in the theory of the '‘second best,’* a subject of much discussion 
in recent years Although the first cxlpnstvc discussion of “second-best” welfare economics 
was that of Mcadt (1955), the first real exercise in second-best welfare economics was 
probably that of Ramsey (1927) This paper is an extension to an open economy of 
the problems of optimal taxation discussed by Boiteux (1956), Diamond and Mirrlees 
(1971), Stightz and Dasgupta (1971), Dasgupta and Stiglitz (1972), and Mirrlees (1972) 

’ That IS, where the demand and the supply for a commodity depends only on the 
price of that commodity (relative to, say, the price of labor) 



BENEFIT-COST ANALYSIS 


3 

the inverse of the demand and supply clasucitics ^ Since the supply 
elasticity of foreign commodities is presumed in general to be greater 
than that for domestically produced commodities, Ramsey’s analysis 
implied that, rather than impose a surtitx, that is, impose a higher tax, 
on imports than on domestically produced goods, one should tax imports 
at a lower rate But Pigou did not answer the question as to what ought 
to be done for commodities which arc both domestically produced and 
imported. One would wish to know if the two activities ought to receive 
differential treatment 

There are, moreover, some important differences between taxaUon 
policy in closed economies and that in open economies, and it is not 
immediately apparent, when proper account of this is taken, whether 
Pigou’s conclusions remain valid Three differences come immediately to 
mind 

1 For imported goods, the producer surplus which accrues to pro¬ 
ducers of commodities which are not in perfectly elasUc supply accrues 
to nationals of other countries and, accordingly, docs not affect domestic 
welfare For domestically produced goods it does 

2 Many countries face what is sometimes referred to as a “foreign 
exchange constraint ” This implies that the earnings from exjxirt do not 
measure their social value, and the cost of imports does not measure the 
social cost To what extent this should affect the rates at which different 
imports should be uxed, and whether this should result in governments’ 
using, in benefit-cost analysis, prices that are different from those that 
rule at the border are questions that are still often debated To see what 
IS involved, consider an open economy in which there are no barners to 
trade or to the flow of capital If P/, is the international pnee of com¬ 
modity i at time t, and Zi, is the net import of commodity i at t, then 
trade balance requires only that 

= 0 ( 11 ) 

A “foreign exchange” constraint imposes an additional condition in any 
particular year, t, for instance, the deficit may not be larger than so much, 
say, e, That is,* 

( 12 ) 

I 

^ Some of the assumpUotu implicit in Ranucy’i analysis, c g , the nontaxation of 
profits, have recently bc^ made cxpliat (see Stiglitz and Dasgupta 1971) 

* Perhaps a more realistic way of writing the constraint is that the cumulative deficit 
at time t be less than some number, e, 

S S *1 

tst 1 

This complicates the mathemaucs but does not affect the qualitaUve results given in 
section 7 



JOURNAL OF POUnCAL EOONOUy 
L . equ^vaJent to 'jZng^Z!tZ7^T) J 

Sing .f and"onl7i^the country would like to boiTow more than it can, 
which in turn imphes that there is a constraint on the supply of invcsUble 
funds (in other words, a savings constraint) If, as the analysis seems to 
suggest, the savings and foreign exchange constraints arc in fact closely 
related, it is pertinent to ask whether it is fair to argue that the foreign 
exchange constraint does not affect relative prices used in benefit-cost 
analysis while the savings constraint does * 

3 Although most governments do use excise and specific commodity 
taxes on domestic consumption to a limited extent, they arc usually 
justified in terms of distributional objecuves (e g , taxing luxuries, like 
perfumes), in terms of merit wants (eg, cigarettes and tobacco) or 
externalities, or as benefit taxes (e g , gasoline) On the other hand, almost 
all countries impose extensive import duties at varying rates, and both the 
commodities taxed at the border and the rates at which they are levied 
have probably less to do with welfare considerations than with the 
strength of various pressure groups 

I’o put the matter differently, the conventional planning literature 
treats the “government” as if it were a single unit, the central planning 
agency being the basic coordinating body It simultaneously decides on 
taxes, tariffs, and investment projects Although this assumption has its 
merits—it allows one to pose a number of quesuons in their most pnstinc 
form—It IS doubtful whether it makes for a good theory of government 
action * To take an example, suppose that project X requires equipment 
E as an input which it is optimal to import (since domestic producUon of 
E IS too costly) If E is in fact imported, then assume X is at desirable 
project But the government project evaluator may know that if X is 


undertaken there will be powerful pressure groups at work forcing the 
departments responsible for levying tariffs to raise the tariff on E so as to 
enable private domestic producers to produce more of it domestically to 
meet the augmented demand for it What, then, should the decision be 
with regard to Z!* It does not seem apparent to us, at least, whether, 
in a closely interdependent economy, a nonoptimal tanff on one com¬ 
modity should affect the tariffs and shadow prices of other commodities 
Nevertheless, it has been argued that tariffs, whether optimally chosen 
or not, should generally be neglected for the purpose of benefit-cost 
analysis ^ One of the puiposes of this paper is to investigate the kinds of 
circumstances under which this argument is correct. 


’ See Litde and Mirrlees (1969) for a discussion of this 
* See Sen (1972) for a lucid ducunion of this range of inues 
’’ See, in particular, Little and Mirrlees (1969) 



5 


bbnefit-oo»t analyos 

In order to bring out the principle! involved m clearly ai poisible, we 
consider in the following sections a sequence of models, in each of which 
the piowcrs of the planner are somewhat more circumscnbed than in the 
previous In section 2, the only constraint on the government is the 
impossibility of lump-sum taxation The government can impose 100 
percent profit taxes and taxes on all production, consumption, and trade 
In secuon 3, we then consider the consequences of the government not 
being able to impose 100 percent profit taxes In section 4, we further 
restrict the government in allowing it to impose only trade taxes In 
section 5, some of the trade taxes arc assumed given, and quotas arc 
assumed to be imposed on some commodities, but the government must 
still make decisions concerning the choice of investment projects and the 
rates of tariffs on the remaimng commodities In section 6, the con¬ 
sequences of trade policy partially dependent on project selection arc 
explored In section 7, the consequences of a foreign exchange constraint 
are considered, while in section 8 those of a budgetary constraint on the 
government are invesugatcd In section 9, we indicate briefly how 
distributional considerations may be integrated with our analysis 
Finally, in section 10, we summarize the major conclusions of the paper 
in terms of a set of rules * 

Before setting out, we note three assumpuons which will limit the 
generality of our conclusions (1) We shall assume that every commodity 
IS either tradeable at a fixed international price or not tradeable at all. 
This not only simplifies the mathcmaucs, but also allows us to separate 
the tax revenue-welfare effects of concern here with the terms of trade 
effects, which have been the focus of the optimal tariff literature so far 
(2) We shall assume that the only “distorUons” in the economy arc those 
introduced by government tsix policy, otherwise, prices of factors and 
goods are compcutively determined Accordingly, there is, for instance, 
no unemployment The presence of other distortions, for example, 
unemployment, undoubtedly would introduce discrepancies between 
market prices and shadow prices, whether there were tariff distortions 
or not ’ (3) For most of the analysis, we shall assume all individuals to 


* The two authors who have come closest to discussing the issues on tariff policy 
presented here are Meade (1955) and Ramaswanu and Snnivasan (1968) The model 
considered by Ramaswami and Snnivasan may be viewed as a speaaJ case of that 
presented here, for they are concerned exclusively with an economy in which only 
border taxes can be levied (a circumstance we discuss in section 4) Moreover they do 
not allow for the possibility of public production Their conclusion that duties need 
lot be uniform in a second-best revenue tanff is reestablished here (section 4 I) and the 
precise formulae for the optimum tanffs arc given For an excellent recent survey see 
Bhagwau (1971) 

• For an analysis of this class of situations see Marglin (1972) and Stigliti (1973, 
in press) 



^ JOURNAL or mitMAL ROONONV 

be identical Tins means that we will BOt inmtigAte Oie implications 
for tariffs and shadow prices of the interactioa betwam aavingi and 

income distribution 

All of these raise important problems which we hc^ to pursue else- 

where 


2 -The Fully Controlled Economy 

nnsider an economy in which there are no foreign 
In this section, we c domestic distortions apart from 

exchange constraints an no imposes The only constraint 

those winch the is the unfeasibility of its employing Jump- 

sZTws" The'iv^Tnment’s problem is lo decide 

oZomclnc producers, what lariffs to levy on imports (both intermediate 
and final commodities), what excise taxes to set on domestic consumption, 
and which projects to undertake It wishes to do this in such a way that dll 
markets clear, trade balances, and welfare is maximized 


21 Notation 

We introduce the following notation 

c, = consumption of the ith commodity (for factors supplied, C| < 0 ),'® 
ji; = production of commodity i by domestic private industry (for factors 
used by private industry _)>, < 0), 
y\ = production of commodity i by the jth private producer, 
qi = consumption price of commodity t, 

Pi ~ producer price of commodity i When different producers face 
different prices for the same commodity, the price is denoted asp/, 
Zi = net import of commodity i If it is a net export, then Zt < 0, 

V, = net output of commodity i in the public sector If it is a net input, 
then X, < 0, 

tf — tax on the import of commodity i, 

r( = tax on producers of commodity i (when different producers face 
different taxes, the tax is denoted by F-j), 

T( = difference between the international price for commodity i and the 
domestic producer price 

Where there is no risk of ambiguity we shall denote a vector, say (fj), 
simply by c The scalar product of two vectors, say q and c, will be de¬ 
noted hy q c Occasionally we shall denote by [rj] a square matrix with 
typical element 

These commodities may be time dated, so the results are valid for the intertemporal 
case as well as the static case 



BENiriT'OOn'AKALVttt 


7 

Not all commoditiei are tradeable, neither are aU commoditief con¬ 
sumable. We shall denote by d the set of all commodities in the economy; 
by T the set of all tradeable commodities; by the set of all nontradeable 
commodities; by D the set of all domestically produced commodities (some 
of which sull might be traded); by / the set of all intermediate goods, 
and by C the set of all consumption goods. Thus A C I T u N, 
Cnl = 9, Tr\N=il), N<zD. Our numeraire good will be indexed 
as t = 0. For notational simpliaty, we shall suppose, unless noted 
otherwise, that it is at the same time traded, domestically produced, and 
consumed Our units arc chosen so that the international pnce of every 
traded commodity is unity 


2 2 Basu Relations 

There are a few simple relations among the variables defined in section 
2 1 which we set forth here To begin with, the commodity balance 
equations are 

Ci=yi xt + z„ (2 1) 


where it is understood that ^, = 0 for i 6 A^, c, = 0 for i e /, and 
Xt = ^1 = 0 for 1 ^ Z) 

The total output,)', from domesuc pnvate firms is 


II 

ibD 

(2 2) 

Now domestic consumption prices equal international pnces plus the 

taxes on traded goods We then have 

9 , - + r, = 1 + f, 

t G C n T r\ D, 

(2 3a) 

^t= Pi + F, 

i e C n N, 

(2 3b) 

9, = 1 + t, 

t e C n T and i ^ D, 

(2 3c) 

A + r, = 1 4- /, 

t e T n D n I 

(2 3d) 

In other words, 

Fi - <, = T, = 1 - 

Pi t e T n D 

(2 3e) 

Thus, /, > 0 for a tariff on an import or a subsidy on an export. 

/, < 0 


for a tariff on an export (i e , an export tax) or a subsidy on an import, 
T, > 0 for a producuon tax in excess of the tanff, -f, < t, < 0 for a 
production tax that is less than the tanff If t, = — f,, domestic production 
is neither taxed nor subsidized (?, = p,) 



8 

2 3 Consumer Behamor 


JOURNAL OF POLmCAL ECONOMY 


Throughout the discussion (except in section 9) we shall suppose that all 
individuals are identical We shall, therefore, talk of the “representative” 
individual He is assumed to maximize his utility, 

max U{c), (2 4) 

subject to his budget constraint, 

g-c = M, (2 5) 

where M is his income If there are no lump-sum taxes or subsidies, then 
M IS just equal to profits (pure rents) after tax If denotes the net 
profit of thejth firm, and t, denotes the tax rate on profits, then 

Af = 2 (1 - T,)7f' = (1 - tjit, where x = if' (2 6) 
i j 

The solution to (2 4) may be written as 

V(q, M) = max U, (2 7) 

where T is known as the consumer’s indirect utility function 


2 i Behavior of Firm 

There arc m different private firms in the economy For simplicity, we 
assume that all firms face strictly concave production functions * ’ 

Ffy) = 0 j = ], , m (2 8 ) 

Thejth firm maximizes profits at the given price vector />•' 

max p-' y subject to F^{y) =0 (2 9 ) 

The solution to (2 9) is denoted by ^ Moreover, in equilibrium 

It will be the case that 




(2 9a) 


?hfdyi 

These equations can be solved for the supply of commodities (demands 
for factors) as a fumtion of ‘ ^ 

yl = yliP^) (2 9b) 

“ As we have shown elsewhere (Dasgupta and Stiglitz 1972) there is no important 
loss of generality by assuming strict concavity Sec Dasgupta and Stiglitz (1972) and 
Mirrlces (1972) for a discussion of some possible anomalies 

It has become customary in the literature on general equilibrium to refer to if* 
as the net profit for the jth firm (see, in particular Debreu [1959]), whereas, of course, 
It denotes what has traditionally been referred to as “pure renu’’ accrued by the firm 
We stick to the modern usage here largely because it sounds more reasonable to say 
that a firm is concerned with maximizing profits rather than its rents 

* ’ Note that, if we had a constant returns to scale function, then we could solve for 
relauve factor intensities, but not for total factor demands without knowing quantities 
produced See, however, Dasgupta and Stiglitz (1971) 



BENEFIT-COST ANALYSIS 


9 


Indeed, it is a well-known result that 



(2 9c) 


One other fact will be of use 
we obtain 



Differentiaung the production constraint, 


dF^ldyi ^ 


(2 9d) 


2 5 Trade Balance 

The condition for the balance of trade may be wntten as 


F = ^ Zk = - yt - ^k) = ^ 

ktT keT 

(2 10) 

Thus, we can view B as a function of q, p, and x 


dB ^ dcf^ 
dqt kTr dqi 

(2 10a) 

dB ^ dc^ ^ dn 

-7- =yt 2-,-r ~ > 

dpi ktT on ktT dp, 

(2 10b) 

1 

II 

Si 

(2 lOc) 


2 6 The Government 

The government’s production function we shall denote by 

C7(;r)=0 (2 11) 

As equation (2 11) indicates, wc arc tacitly supposing that the public 
sector IS concerned with the transformation of the same set of goods as the 
domestic private sector is We assume this simply for notational case 

‘ * We are supposing, then, that there are no public goods in this economy If we were 
to assume that the public sector produces public goods as well we would obtain one set 
of first-order conditions in addition to the sets of conditions obtained below This 
additional set would dictate the optimum supply of public goods For a discussion of the 
implications of such a condiuon see Diamond and Mitrlees (1971) and Stiglitz and 
Dasgupta (1971), Our concern here is with obtaining shadow prices of private goods, 
and It IS trivial to show that the results derived here apply without modification to the 
case where public goods are also produced by the government For this reason we ignore 
public goods here Similarly, there may be several production units in the public sector 
But the result derived earlier in Boiteux (1956), Disunond and Mirrlees (1971), and 
Suglita and Dasgupta (1971), that the public sector should always be productively 
efficient, remains valid. To keep the notation simple we have therefore amalgamated 
all public sector production units into one single umt 



Q 


jOVRSAt OF POlJTWAt COONOlfV 

B-ovemincnt control* trade and private 

We shall suppose that s poifcjc. Jhe 

productjon, but only indjrec y “ j,,pj,j^arcl: it wi*he* to maximize 

lem before the planning agency subject to the pnvate sector 

the welfare F of the rcpresenla iv production possi- 

produftjon possibility curves ’ ,5 tondiUon (2 10 ), and the 

bihty curve (2 11). the The government’s 

market clearing cqua ion oroduction tariffs, and consumption, 

controls are direct ^ ^ establish (using [2 3]) that 

produfiion, and profits taxes But It IS easy lo LSI t ei 

Loniinllmg taxes and tariffs is essentially equivalent to controlling pro- 
duc. and consumer prices, and it (urns out that the latter arc concep¬ 
tually and analytically easier to use Similarly, although controlling p, 
q, and r determines so that (he only constraint on ihc traded goods 
IS till balance of payments constraint (i c , we do not need to impose 
separate market clearing constraints on each of the traded commodilics), 
jl IS convenient to formulate the problem as if were control variables 
and impose the additional market clearing equation for each of the 
traded goods Finally, we note (see Sligiitz and Dasgupta 1971) that 
optimality requires the government to set T, = 1 , provided the govern¬ 
ment requires some resources to be raised by distortionary taxes In view 
of the fact that the analysis is trivial in those cases where the government 
docs not need to impose distortionary taxes to raise Us revenue, wc set 
T„ = 1 whenever it is assumed to be feasible For the remainder of this 
section we suppose that it is 

finally, wc normalize by setting = 1 and pi^ = 1 for allj 


' ’ In other words, ihe problem 


max V[g. t(\ - rj] + ^ f Y (x, + - r.) 


and the problem 


+ T + V. - r,) + pC + X VjFH/} 


max ir(l-rj] 

{x,p,q,i] 


^ Y. + 2 pkizk + + j** - ffc) -f -I- y 

ke4 


arc equivalent When the outputs and inputs art viewed as functions of producer prices, 
the production constraints (2 8j arc already embedded in the problem (through 2 9), 
and hence wc could omit the term 


i ■•’/'(y) 

in our Lagrangian expression In iht centralized problem, where outputs and inputs 
are considered as direct controls, this term must be retained For purposes of symmetry 
with the centralized problem, wc retain the production constraints explicitly within our 
Lagrangian expression 

“ For a discussion of these normalizauons, see Daigupu and Sughtz (1972) 



BENEPrr-COST ANALYSIS * * 

2 7. optimal Public Policy 

We now form the Lagrangian of the problem •' ’ 

m 

if = V{q, 0) - XB + ftC + '^ 

+ p» X)(2 12) 

kfA \y-i / 

We obtain the first-order conditions 

— = 0 .eCand,#0 (2 13) 

dq, ktc dqi 


fS’ w ^ i6Z)andi # 0, j = 1, ,m, 

\ dyi J dpi 




(2 14) 


dG « rv 

/I — 4- Pi = 0 leD, 

£x, 

(2 15) 

and 


-A -(- p, = 0 le T 

(2 16) 

From equations (2 14-2 16) we obtain the now-familiar result 

u 

dGISxi 

= = -Pi - 1 leTnD, j = 1, 

, m. 

dOldxo 

Po 

(2 17a) 

and 


rC tX ^ r^FV>/ _ p. 
dGjCxo dF>!dy>o > 

(2 17b) 


Equations (2 17a) and (2 17b) describe the desirability of overall pro¬ 
duction efficiency for the economy All production units in the econom\ 
ought to use international prices in selecting their optimum techniques, 
and there ought to be no taxes or tariffs on intermediate goods ‘ ’ 

It IS then apparent that, at least in this case, Pigou’s (1947) conclusion, 
that domestic production ought to be taxed at a higher rate than impioris, 
IS incorrect No differentiation between domestic and foreign production 
should be made 


‘^VVe recall the convention adopted in discussing (2 1) that it is understood that 
z, = 0 for 1 e N, Cl = 0 for I e = 0, and *, = 0 for i ^ 2) 

'* The second equality follows from observing that, forgiven j, we have, with (2 9d), 
as many equations of the form (2 14) as commodities, since we thus have a system of a 
homogenous equations in n unknowns, we require + Pi, = 0 See Dasgupta 

and Stiglitz (1971) for a more detailed discussion 

“This IS the central result in Diamond and Mirrlees (1971) For a thorough ex¬ 
ploration of the application of this result see Little and Mirrlees (1969) 



JOURNAL OF POLITICAL ECONOMY 

Turning to equation (2 13), it is now simple to show that it reduces 




It follows from equation (2 18) that for small taxes the consumption of all 
romraodities is reduced by the same amount (from what it would have 
been had producer prices been charged) 


teC, (2.18) 

m’ 


3 . Limited Profit Tax on Private Firms 


3 1 General Considerations 

For a number of reasons, no government imposes a 100 percent tax on 
profits This requires a good deal of modification in the analysis Now 
profits enter directly into the utility function, and changes in profits 
affect the demands for various goods For simplicity, we consider the 


To confirm the validity of equation (2 18) one uses (2 3a), (2 3b), (2 16), and 
(2 I7b) in equation (2 13), and noting that dVjoqt = —CiidVlSM), one obtains 


— c, ciV 

~rdM' 


I (v.-Op- 

ketTn 7 Ji€ 


y (7.-r.)^ = o 

c^N ^gt 


I e C 


On using Slutsky’s equation, the fact that the Slutsky matrix is symmetric, arui the fact 
that 


= 0 , 

ker ^VQ,/n 


the foregoing equation reduces, on collecting terms, to the form 


ke 



keCnN ^^gj v 


BVfdM 

X 


+ 


km^n N 



which immediately yields equation (2 18) 

Note the difference between this result and the underlying notions of the effective 
tariff literature There, the focus is on a tariff structure which reduces production of 
each commodity by an equal percentage from the pretax situation and the analysis 
often Ignores the change in factor pnccs of nontraded inputs, here, we reduce, con¬ 
sumption by an equal percentage from what it would have been with the new producer 
pnccs, for traded goods, the only change in production results from the change in 
pnccs of nontraded inputs 



benefit-cost analysis 


'3 


polar case where no tax on profits can be imposed The intermediate cases 
are trivial to work out 

The Lagrangian now takes the form 


p m _ m 

V n^(P^) - + tiG{x) + 

>»1 J kfT J=1 


ZJtyi 

keA \J=* 


+ + Zt 


The first-order conditions now read as 


, - Pk = 0 t 6 C, i # 0, 

dqi k^c dq, 


CM Cp\ kTl) \ ‘ Cyi ) dpi ktc i M dpi 


j = I, , m, I 6 D and i ^ 0, 


dO _ f. 

p — -f pj = 0 1 € D, 

cx, 

—/‘+P, = 0 leT 


(3 1) 

(3 2) 

(3 3) 
(3 4; 
(3 5; 


3 2 All CommoditKs Tradeable 

Consider, for simplicity, the case where all goods are tradeable Using 
equation (3 3), we can write equation (3 2) as 


— — / ^ — =0 I 6 C and i # 0 (3 2a) 

Cq, k*c cqi 


Equation (3 2a) has an immediate interpretation By raising tariffs (con¬ 
sumer prices), we improve the balance of payments, ^by discouraging 
imports The “cost” of this is, of course, a decrease in utilit)—higher pnccs 
for consumer goods makes one worse off There is, therefore, a trade-off 
between “trade surplus” and welfare Equation (3 2ai reflects the fact 
that the marginal rate of substitution between the tvso should be the same 
regardless of which price we var)’, that is, the loss in welfare pier unit 
gam in the balance of pa)mcnis surplus should at the margin be the 
same for all commodity prices Turning to equation (3 3), b\ using 
(2 9c) and the fact that 


kTo dpi 


all 1 , 


Exactly the same reasoning applies to exports An export tax lowers the domesuc 
price, increases welfare, and, because it increases consumption of the given commodity, 
makes the balance laf payments situation worse 



H 

we obtain 


JOURNAL OF POLITICAL ECONOMY 



Equation (3 3a) has an interpretation similar to (3.2a) An increase in 
the producer price of an output increases profits and makes the consumer 
better off It has two effects on the balance of payments by increasing 
the supply of the commodity it is improved, but by increasing the demand 
(as a result of the higher income from the higher profits) it is deteriorated 
Equation (3 3a) reflects the fact that at the optimal point the latter effect 
on the balance of payments must dominate the former, and indeed, the 
marginal late of substitution of gains in welfare and the worsening of the 
balance of payments must be the same, regardless of which producer 
price IS varied 

Equations (3 4) and (3 5) have the following similar interpretation 
the marginal rate of transformation between two commodities in public 
production must he equal to their relative marginal effects on the balance 
of payments at fixed producer and consumer prices Thus, dBldx, = 1, 
the government project evaluator should use international prices in select¬ 
ing public sector projects 

To see more precisely what equations (3 2a) and (3 3a) imply for 
the structure of tariffs and taxes, we begin by noting that equation (3 2a) 
reduces to the form ' 




J‘ 
k7c c, 


dc 






= -0 < 0 where i e C 


(3 6) 


This IS the familiar formula for the optimal tax structure obtained earlier 
as equation (2 18) It implies that taxes (tariffs) ought to be such that 
consumption of all commodities is reduced by the same percentage along 
the compensated demand curve from what it would have been had inter¬ 
national prices been charged 

To see what equation (3 3a) implies, we note that, from it, we can 
derive the following relationship “ , c can 

dc. 


VI*^ -Yb. 

k^yi dpi *4^ c, {dqJjj 


2$ 

we obtain 


= > 0 , 

.1 = 1. I e Z) and if 6 C (3 7) 

SufaUtuting (3 2a) into (3 3a), and uamg the fact that dVjdg, = 


_ i y ^ ^ y ^ ^ 

^ dM y, dpi 

Using the Slutsky equation, we obtain 



BBNBFIT-COST ANALYSIS 


'5 


Equation (3 7) implies that the percentage change in the jth firm’s output 
of the tth commodity or input of the ith factor should be the same for 
all firms, commodities, and factors. Nouce that we have included for the 
possibility of intermediate goods Therefore, as far as producer taxaUon 
IS concerned, one observes that intermediate goods are treated in no way 
different from final consumer goods 

To see more clearly what equation (3 7) imphes for the structure of 
producer taxes, consider the case where we treat all the private producers 
identically and, hence, can aggregate them together Define 




1 - T* A 


and ti'u, s ^ ^ 
yi ^Pk 


(3 8) 


Then equation (3 7) can be written in the matrix form 


f-[»,*]=0£, ori^9E[,f]-\ (3 9) 

where £ = (1, , 1), a vector with unity everywhere 

Equation (3 9) provides a simple expression for calculating the optimal 
structure of producer taxes Notice that only m the limiUng case of 
infinite supply elasticities (constant returns to scale) is it opUmal to 
impose no tax on domestic producUon 

Since a tax-tariff structure affects only rclauvc pnccs, whether there 
IS a tax or subsidy on a particular commodity depends on what wc choose 
as our numeraire, that is, the price of a given commodity will nse relative 
to that of some other commodities, fall relative to still others It follows 
that Pigou’s (1947) conjecture that the tax on the output of a given com¬ 
modity by a domestic producer ought to be higher than the tax on the 
import of the same commodity, is not, in fact, a well-formulated one 
A natural choice of numeraire for a small country with a principal export 
crop IS that export crop Then, if the final gjood is produced by the inter¬ 
mediate good alone, fy = 0 (by definiuon), r, = - 0 /> 7 y(l - x/py^y) < 0 
(where i/y is the supply elasticity of the final good, /»y and^y are its price 
and output), that is, a production subsidy (relative to the tariff on the 
good) is imposed on the intermediate good 


Finally, since 




Hhen e C and t e D 


“I?'* == i.keD, = (8f ^,keC. using (2 3), we 

Obtain (3 7) 

Our general formulation allows us to tax imports of intermediate goods at a different 
me from domestic production of intermediate goods, and this in fact will in general be 
desirable, since there will be different elasticities associated with each 

” The question may be reformulated, as follows Is it true that 9 ,/yo > 0 implies 
f^o < 1’ Consider a change of numeraire to^ Assume that gy/g, > 1, pjjp^ < 1 
Then the si^ of (g,/gy) - 1 and (p,IPj) - 1 are ambiguous In particular, it is powble 
for ,,/jy > 1 while p,IPj > 1 



JOURNAL OF POLITICAL ECONOMY 


l6 

3 3 Social Valuation of Private Projects 

In many developing countries, the government also authorizes the con¬ 
struction of projects in the private sector Much of benefit-cost analysis 
IS directed to an evaluation of these “private” projects Assume that no 
restrictions are imposed on the government in its tax treatment (except 
the levying of profit taxes) Then, Ironi (3 3a), we note that whether the 
A:tli small project increases welfare depends on the sign of 

+ /(lA/- I Ac‘), (3 10) 

wIk re Ac* is ifit induced increase in consumption The first term is the 
profits, cvalitattd at domestic prices, the second term is the effect of the 
pioji (t on the halaiice of pa\ merits Thus, profits at domestic prices must 
exceed the total foreign exchange cost ol the project evaluated at the 
shadow price ol foreign exchange, 

I Ins mas lie rcloi niulnted since 

Y. <], Af‘ = Tt*', 

I At‘ - 2: Ac‘ = I Ai‘ - tt^ + I /, AeJ 
= I Tf AtJ + Yl, Acf = /?*, 

where R‘' is the change m goveinment revenue from trade taxes from the 
protect riius, we reeptite + |/-(e'r,<M/, |A>* ^ 0 Profits at domestic 
prices, plus changes m government tiade tax revenues, evaluated at the 
shadow price- o( (orcign exch mge, [//(e'r.e'U)[ > 1, must be non- 
ncgative Hits makes chai that the eriteria sometimes advocated, 
namely, n + R >0, or indeed the critciion I A)‘ > 0 (that profits 
evaluated at international prices be nonnegalivc), arc both overly 
restrictive The point is that piofits and government revenue should not 
be evaluated at the same set tel shadejvv prices 


3 4 Rontradiabhs 

When thee arc noritradeables, ue must me lude in our calculations the 
change in consumption and production ol these eommod.ties As equa¬ 
tions f, ; and (3 5' make clear, nontradeables are to be evaluated in 
terms of their marginal foreign cxe hange costs 1 hat is. 


Pi = '' 


i f' e'v, 


k e T and i e N 


(3 11) 


domermcome ' ' of 



BENEFIT-COST ANALYSIS 


*7 


The shadow price of a nontradcable (in utility numeraire) is, therefore, 
equal to the value (?t intemauonal prices) of the foreign exchange which 
would be generated were one umt less of it produced domestically, and 
instead more units of the tradeables were produced, valued at the shadow 
price of foreign exchange (in utility numeraire) 

The rest of the analysis proceeds easily One obtains from equations 
(3 2) and (3 3) tax formulae which are similar to equations (3 6) and 
(3 7), except now, for the nontradeables 

teNnC, (3>2a) 

^i = ~ - Pi « e JV (3 12b) 

which give the difference between the consumer price and the shadow 
pnee (cq [3 12a]) and the difference between the shadow price and the 
producer pnee (eq [3 12b]) If we now suppose that there is a smgle 
producer of a given nontraded good (or that the producers of the good 
can be aggregated) and also assume that the demand and supply curves 
are independent of other prices, we find that the tax levied on a unit of a 
nontraded good is 

-/-i ='i + r, = -I-41 ‘eiVnC (3 13) 

The formula given in (3 13) is a gcneralizauon of the result in Ramsey 
(1927) 

3 5 Foreign Firms 

In the foregoing analysis we assumed that profits accrued to the nauonals 
of the country in which production occurred In most developing countnes, 
a significant part of the profits is repatnated to some foreign country 
Consider then the case where all profits of a given firm are repatnated 
These profits do not affect the welfare of the representative consumer 
and, indeed, contribute to the detcnoration of the balance of payments 
situation In this situation, the balance of payments condition reads as 

kTr jtF 

where F is the set of foreign firms Supjxisc again, for ease of exposition, 
that all goods are tradeable For j e Fv/c note that the first-order condition 

It would perhaps have been more natural to choose as our numeraire labor, a 
nontraded “good ” Then, assume we have two methods of obtaimng a commodity, 
one by producing it with labor, the other by importing This is the case that Pigou 
(1947) probably had in imnd, and a simple recasung of our results yields the conclusion 
that a surtax u imposed on the domesuc production of the commodity. 



i8 


JOURNAL OF POLITICAL ECONOMY 


(3 3) can be wntten as 

jeFandieA 

dp\ kTo \ dyi ) dp’, 

(3.14) 

On using equations (3 4), (3 5), and (2 9a), in equation (3 14) one obtains 


leO d/p, IeDU/P, 


or using (2 9d) 


S ^ - P"! = 0 i e Z) and ; 6 F 

keU Cp\ 


An increase in tlic producer price has, therefore, two effects it increases 
domestic production, thus reducing the need for imports and improving 
the balance of payments, and it incieascs profits, which worsens the 
balance of payments (v\hcn repatriated) Optimality requires at the 
margin that two effects offset each other 
Again using (2 9d) and (2 3), we obtain 


ieZ>and;eF (315) 

kci>y\ dpi 

Writing f( = xl pl and = (n(/tW)(/^/A'I)> one obtains from equation 
(3 1 5) the foi inula 

i’ = K (3 16) 

One should obseive that, as before, industries which have moie inelastic 
supply curses should be taxed at a higher rate More interesting and more 
important is the fact that equation (3 16) implies that the tax rates are 
independent of tfic goscrnmtnis need for revenue (This is in contrast 
with the case of domestically owned firms, where the tax rates depend on 
9 ) That IS to say, the tax rate s on the commodities produced by forcign- 
owned firms are determined completely by balance of payments consider¬ 
ations This results in the tax rates depending only on the properties of the 
production function of the industry 

The cost-benefit criterion for accepting projects of foreign-owned 
corporations must similarly be modified The criterion should be simply 
whether ;.(-7r^ + == ;[^(| _ p^) = A(i: > 0, that 

IS, whether the surtax (subsidy) on domestic production yields net tax 
revenue (This assumes that the project will not affect prices facing other 
firms) 


Compare equation (3 7) 



BENEFIT-COST ANALYSIS 


«9 


4. Inability of Taxing Domestic Production 


41 No Profit Taxation 


We now modify the model somewhat to assume that the government can¬ 
not impose producer taxes We suppose that the only taxes that are 
feasible are trade taxes Domestic producers then face the same price 
vector as does the consumer To keep the notation simple, we suppose 
initially that all goods are tradeable, consumable, and domestically 
produced That isd = X) = T=CWe can a^ain formulate the prob¬ 
lem as if the controls at the disposal of the government are the domestic 
price vector, q, the vector of public production, x, and the vector of trade 
flows, (Note that we must have q — p = 1 +0 Using the Lagran- 
gian expression (3 1), we now obtain the first-order conditions as 



where dVjdqf and dcjdq^ are to be regarded as total derivatives Now, 
equations (4 2) and (4 3) together imply once again that international 
prices ought to be used in benefit-cost analysis of public sector projects 
Using this fact we can rewrite equation (4 1) as 


where 



m 

.u = 'Tlyi 


I e A and i ^ 0, (4 la) 


The interpretation of equation (4 lai is as before' the marginal cost 
in welfare per unit gain in the balance of pavinents should be the same 
regardless of which price (tanff") is varied Novs, since the government 
can be viewed as controlling both trade and public production, and since 
the shadow prices in the public sector are-equal to the international prices, 
wc can treat trade flow and public production indifferenllv Writing 
e* = Cj — = Xj + Zii as the “excess demand” or “imports” (from 


** The actual controls arc larifls, I, and government production x See abo\c, p 10 
That IS, 

dVIdg, = {BVIBq,) + [(aa/3?,)(aK/3Af)], dcjdg, = + [{dn/Sg,){ec,l?.\I)] 

** The evaluation of projects in the private sector follows as in the previous section 



20 


JOiniNAt OF POLmOAL BOONOMY 

public production and foreign trade)** for commodity k we obtain from 
equation (4 1 a) the fact that 


where 



0 = 1 - 


dVjdM yp ^ 


(4.4) 


That IS to say, the percentage reduction m “excess demand’’ should be the 
same for all commodities 
Now clearly 



Therefore, equation (4 4) can be rewritten (on writing = 

1,/1 -t- t,) as 

+ »?■'] j = eE (4 5) 

(where / is the identity matrix and E = (1, 1, 1, , 1)). In equation 

(4 5) (_>>/«) denotes a vector an element of which is We can re- 

express equation (4 5) to obtain the optimal tanff structure as 

i = 6E [i/T* = 0£ [[i,-*] 4- [r,‘ + !/-]]-> (4 6) 

We should contrast equation (4 5) with the results of secUon 3.2, as 
embodied in equation (3 7) 


i W] 




Wc shall use the quoution marks on "excess demand” or "imports” to sigmfy this 
’ ’ Equation {4 1a) can be rewritten as 




i5» 

dM 


+ Ari Z (?s - It) ^ 
*«A SM 


y sc V i 

h >, “ sfa * ad/ “ ‘ >1 


dV 
A dM 


m -e 


r, if«, 0, 



BENEPIT'CIOST analysis 


21 


In the case of independent demand and supply curves, equation (4 6) 
reduces readily to 

I, _ J-- 1 - ,eA. (4 7) 


The formula given in (4 7) expresses the fact that, at the optimum, the tax 
rate (as a percentage of the consumer price) is inversely proportional to 
the elasticity of “excess demand ’’ Notice also that equation (4.7) imphcs 
an export tax for exports whose net supply curve is positively sloped, and a 
subsidy for exports whose net supply curve is backward bending It is clear 
then that duties need not be uniform in this second-best revenue tariff ** 

We turn now to some other cases For instance, if _)>( = 0, which 
implies that the commodity docs not enter into domestic producUon, then 
the tariff is identical to that described in section 3 2, as embodied m 
equation (3 7) 

For intermediate goods, yi = —e,, so the tanffs/taxes are chosen to 
make the producers prices the same as discussed in section 3 2 (cf cq 
[3 7]) Note that this implies a tariff structure with different rates on 
different commodities, but such that the production of all intermediate 
goods IS increased the same percentage, imports of intermediate goods 
not produced domestically arc reduced by the same percentage 

But not even this simplicity of structure obtains if firms arc foreign 
owned For foreign-owned firms, equation (4 4) must be modified to 
read ^ ’ 



r, VdM A dMj 


(4 8) 


Since raising the tanff increases profits, which accrue in this case to 
foreigfners and hence hurt the balance of piayments, tariffs on commodities 
produced by foreign-owned firms should be lower than on those produced 
by domestically owned firms (for the same elasucities of the excess demand 


See Ranuuwami and Snnivasan (1968) 
” Equation (4 la) becomes 


or 


Hence, 




(*i +j'i) 


IL 

au 



23 


JOURNAL OF POLITICAL BOONOUY 


curves), the difference in the percentage reduction in "imports” from 
what they would have been without tariffs is proportional to the ratio of 
domestic production to “imports.” 

4 2 Possibility of Taxing Profits at 100 Percent 

There is one further case that deserves brief mention, We now suppose 
that the government can (and therefore, does) set the tax on pure profits 
at 100 percent Then, (4 1) becomes 



This implies that the relative reduction in “excess demand” (“imports”) 
IS simply inversely proportional to the ratio of “excess demand” (“im¬ 
ports”) to consumption In particular, in the case of intermediates used to 
produce a nonconsumed export good, note that no tariffs should be 
imposed on imports Thus, the conclusion of Ramaswami and Snnivasan 
(1968), that “inputs used in export production must be free of duty” is 
only partially correct if the tax authorities can impose only import 
duties and 100 percent profits taxes, then only if the good is also not 
consumed should it be exempt from duties If it can impose 100 percent 
profits taxes and consumption-production taxes as well as tariffs, no inter¬ 
mediate, whether used for producing exports or import substitutes, 
should be taxed If profits are not taxed at 100 percent, and if the same 
tax rate must be imposed on domestic production as on imported inter¬ 
mediates, then, even though the good is not consumed, a tariff-production 
tax ought to be imposed 


5. Benefit-Cost Analysis with Given Taxes 

5 1 All Taxes Fixed 

We now consider the case where the government project evaluator needs 
to take the tarifl and tax structure as given and fixed. The question is 
what shadow prices ought he to use in project evaluation ^ Notice that the 
number of controls in the planning exercise is now drastically reduced 
The Lagrangian of the planning problem is still expression (3 1), but now 
the set of controls to be chosen arc the vector of public production, x, 


’* The introduction of nontradcables does not change these results Letting Tj ■ 
(Pt/ F) — pi, k e N be the difference between the shadow ]>rtce and the private producer 
price of a nontradeable, (4 9) can be revmtten 



-Oc, 


*■' Ckunpare Ramaswami and Snnivasan (1968, p 576) 



mmefit-cost analysis 


23 


and the volumes of trade, z. If all goods were traded, fixing all taxes 
would fix all consumer and producer prices. Since our focus m this section 
is explicitly on public production, it is convenient to assume there is a 
single pubhc good; resource savings from greater efficiency m the public 
sector can be used to increase the expenditure on the pubhc good. Lotting 
g denote the public good, our Lagrangian can now be formulated:^* 


•2’ = y(q. ^) - -1 ■«:* + P*(*s + -z* + A - 

kTr kMA 

«*) + pG(x, g) 

The first-order conditions now read as 


v, + = 0. 

dg 

(5 1) 

— k + pi^ = 0 k e T, 

(5 2) 

Pk + ^ *eZ). 

(5 3) 


From equations (5 2) and (5 3), we note that for traded goods the marginal 
rates of transformation in the public sector must equal the international 
price ratios, unity That is, the project evaluator ought to use inter¬ 
national prices in project evaluation NoUce that we have not assumed 
that the fixed tax rates are in any sense opumal For nontradeables, 
we note a result similar to that obtained in secUon 3 3 • the shadow price 
of a nontraded good is its marginal foreign exchange cost (It is obvious 
that this set of results continues to hold if profits can be taxed up to a 
100 percent) 

Turning to the “evaluation” of a pnvate domcsUcally owned project, 
the analysis for arbitrary but fixed tariffs and taxes is identical to that 
of section 3 where it was supposed that tanffs and taxes were opumally 
chosen 


5 2 Some Taxes Controllable 


How does the fact that some tariff rates are unalterable affect the choice 
of the remaining tariff rates^ Assume that, for one set of commodities, 
say R, q, can be chosen by the government but that, for the rcmaimng 
goods, they are given as fixed Assume for notaUonal case that ail goods 
arc tradeable, that is, = T It is then trivial to confirm, on repeaung 
the analysis of section 3 1, that 


ktR \cgxjv MX \oqJfj 


—efi 


leR 


(5.4) 


We usunie for notauonal umpliaty in this formulation that the public good is 
not traded, and all other goods are. The reader can eauly extend the analysu for the 
situation where the^ are nontraded pnvate goods in the economy 



2^ JOUHNAL OF FOtmOAL EOONOUV 

It follows that the consumption of all freely taxable goods should be 
reduced by the same percentage. This implies that, if demand elasticities 
are assumed independent, the tax rates on those commodities for which 
we can choose tax rates are unaffected. 


5 3 Quotas 

It IS not surprising that the presence of quotas alters both the tariff 
structure of the remaining commodities and the shadow prices to be 
used in the public sector for those commodiUcs on which quotas have been 
imposed To the basic Lagrangian (31) we now add an extra term to 
obtain 

JSf = v\q,Y, ^ + 52 l/'/V) 

L 1 J i.r j-i 

+ Pk(Y>A + - cA + y{z, - z.), (5 5) 

ksA \ J ) 

where z, is the quota imposed on commodity s It is immediate that the 
first-order conditions (3 4) and (3 5) continue to hold here for all t € i) 
and 1 / j If j enters into domestic production, then the first-order 
conditions pertaining to its shadow price in the public sector read 


dG ^ 

p — + p. = 

ox. 

0, 

(5 6) 

where y ^ 0 

and y(z, - z,) = 0, 

(5 7) 


from which we obtam the fact that {dGldx,)l{dG/dxg) = 1 + y/A ^ 1 
Generally speaking then, the shadow pnee of s in the public sector is 
equal to unity only when the quota docs not bite Otherwise it is greater 
than unity Turning to the structure of optimum taxes we note simply that 
both the tariffs and the production taxes arc affected For instance, if 
we assume that tariffs may be chosen on all commodities and that s is 
consumable, then the equation (corresponding to cq [3 6]) yielding the 
structure of taxes on consumer goods reads as 


keC Ci\dqnJjj 


(5 8) 


In the case of independent demand curves, we notice from equation (5.8) 
that the tariff of only commodity s is altered, and it is set so as to capture 
the profits which would otherwise accrue to the importers of s. Comple¬ 
ments of s have their consumption reduced proportionately more than is 
the consumption of subsututes. That is to say, the tax structure ought 
to be designed so as to encourage the consumption of substitutes for the 
commodity on which a quota has been imposed. This is clearly what 
intuition suggests 



25 


]IENEFIT«C08T ANALYSIS 


6. Trade Policy Dependent on Prqect Selection 


Qpite often a decision maker realizes that the presence of a particular 
project in the country u likely to give rise to changes in the tariff pokey. 
In particular, there is a long expierience of newly established industries 
requesting and obtaining protection, even when on “purely economic’’ 
gfrounds this protection is questionable. To attempt to capture this kind 
of situation we pxistulate for simpUaty of notation that for some com¬ 
modity i the tariff if is an increasing function of the level of imports of 
f That IS, tf = > 0 Assume also for simphaty of notauon that 

the model is otherwise the same as that in section 3. The drastic assumpi- 
tion that we make here, as the reader will immediately recognize, is the 
assuming away of any game theoreUc problems that this kmd of tariff 
“respionse’’ obviously has bmlt within it We are thus assuming that the 
tariff on i responds passively to the level of imports of/. It is then simple 
to see that the equaUons correspxindmg to equauons (3 4) and (3.5) read 
here as 




dxt 


+ 


Pi = 


— A + Pi — 0 


dqt dzt 


— X + Pf — 


0 leD, 

(6 1) 

t # / and » e T, 

(6 2) 

V' ft 

*Tc dZi 

(6 3) 


It follows that, for all tradeable goods except /, shadow prices in the 
pubhc sector are their mtemauonal paces. So far as commodity / is 
concerned, we can reexpress equation (6 3) as 

From equation (6.4), we can readily verify that the shadow pince of 
commodity / is^* 

dh 


1 -r, 


-[»+ E-f-ll/l • (6-5) 

dZf I i.cCf i/pj/ 5-5# 


dVjdM 

X 


, V . I X' ' A ^ I 

‘ ft- Or'- 

A ^ ^ 8M Vafc/j; 

+ ft-*)&-- 

'A / V 

TTui analysii unimes all goods are tradeable The modifications required if some goods 
are not traded are obs^ious. 


dVI3M 
A 

SVIdM 



36 


JOtntNAL OF POUnCAL ECONOMY 


It follows from exprtssion (6.5) that the shadow price of commodity / is 
below or above the international price of^ depending on whether the given 
tax structure reduces its consumption by less or more than is optimal. In 
particular, if demand curves arc independent it depends simply on whether 

w(i + 1,) ^';/(! + '*) = shf, 

More generally, it is simple to demonstrate (Dasgupta and Stiglitz 
1971) that, for tradeable commodities, if the tariff on some commodity k 
depends on the level of imports of some other commodity k' as well as 
on, say, the government’s net production of yet some other commodity, 
k", then the shadow prices of onlj k' and A:" differ from their international 
prices It follows from this that the fact that the tariff on commodity k 
responds unoptimally to the import (or the public production) of some 
other commodity k' is no argument for setting the shadow price of k (or 
for that matter an> commodity other than k') different from its inter¬ 
national price We have found it surprising that this should be so in such 
an interdependent system as we have been considering 

If demand curves are interdependent, then the tariffs on other goods 
will, however, be affected by the constraint on the tanff on commodity ^ 
Just as in the case of quotas, substitutes for commodity ^ will have their 
production reduced by less than complements 


7. Foreign Exchange Constraint 

Although the most natural interpretation of the models presented thus 
far IS in terms of the convenuonal models of static international trade, 
there is no reason why the different commodities could not be treated 
as well as dated commodities As we noted earlier in section 1 2, a foreign 
exchange constraint may be interpreted as a limitation on the amount a 
country can borrow in any period To make the interpretation clearer, 
we let c,„y„, z„ be, respectively, the consumption, net output in the 
private sector, net output in the public sector, and net imports of com¬ 
modity i at time t We assume that all relative international prices {withtn 
each period) are constant, so that we normalize all at unity Let r, be the 
international rate of interest at t Then the trade balance condition 
can be written as 

_ 'Lz,, 

The borrowing constraint at some year /' reads as 

Z) ^ 1 ,' :S e,. (7.2) 


40 

at time 


^at 15, 1/1 + r, IS the price today of a promiie to deliver tl 00 of foreign exchange 



BBNxnr-aon- analyib 


a? 

The basic Lagrangian (3.1) nowr hat an extra term, and it reads at 


vU, ^ j E + /■ow + |j 

Put + *« + 

+ Z r»- ^8.' - 5 • ( 


where y,. 2: 0 and 


^ ^ 0 . 


The first-order conditions, corresponding to equations (3.4) and (3.5), 
now read as 


/<■;— + Pii = 0, 
OXfg 


(7.4) 

Pit , “ ® > 

1 + T, 

ibT, 

(7.5) 

X . 

Pit- . =0 

1 + T,. 

teT. 

(7.6) 


Equations (7.4-7.6) yield the important imphcadon that mtkm any period 
international prices ought to be used m the evaluauon of pubhc sector 
projects. But if the constnunt on foreign exchange u brnding at t', the 
rate of interest (the rate of discount) at t' should not be r,.; rather, it 
should be somewhat higher than that.*^ On the other hand, for non- 
traded goods, the shadow price is simply the value at mtemational prices 
of the foreign exchange that could have been generated if the production of 
the nontraded good had been decreased by a unit and the production 
of traded goods increased. 

If profits are taxed at 100 percent, then it is easy to confirm that the 
pnvate sector ought to face the same set of prices as the pubhc sector. 
That IS to say, at the optimum the economy is productively efficient. 
With less than 100 percent profit tax this is no longer true, and indeed the 
rate of discount that should be used for different nontraded goods may 
well be different. 

Turning to the structure of taxes, we note once again that consumption 
of commodities in periods which are substitutes for consumption in poriods 
in which the constraint is binding is reduced by less than the consumptioa 


♦* (8C/a*„)/{8C/8*^) . fjpj, - I; (dGldx^l[9Gldt,„.) - a*./#*. - i/d + rj/ 
{ti/(l + r..)] + r..}. 



33 JOURNAL OF POUTICAL BGONOMY 

in pcnods that are complements. That is to say, if the constraint is binding 
for iniual years, the tariff strucfure should be designed so as to encourage 
savings in the imtial pcnods (e g., by having higher taxes on complements 
of leisure in order to encourage work in initial periods) and a gradual 
lowenng of tanffs over time (the subsUtution effect encouiaging the post¬ 
ponement of consumption) We do not reproduce the formal argument 
here, since it is pretty straightforward 


8. Budget Constraint 

In this section we consider a constraint on the government budget 
deficit in any given year For a closed economy, this constraint was looked 
at originally by Boiteux (1956), and his analysis was extended in Stightz 
and Dasgupta (1971) 

The constraint may be written as^^ 

S ^ K- (8 1) 

itD 


If S, IS the dual associated with the constramt (8 1) then we have, on 
wnting 




A 

f] (1 + ^) 


the simple result 


SGIdx,, ^ X, + S,q„ 
dGldx^., X, + d,q^, 


i, k e D n T 


(8 2 ) 


Without loss of generality, let > q^, Then 


£« ^ dGIdx,, ^ j 
qu dGIdx^, 


(8 3) 


The shadow pnees to be used in the public sector he between the inter¬ 
national prices and the domestic pnees, regardless of whether the domestic 
prices arc determined by optimal or nonopumal tanffs and taxes 
Similarly, 


^Gj^x^, ^ Xf + 6,qi, _ 

3G/3x„+j (X,/l + f,+i) + 


(8.4) 


** In wnung (8 1), we have unpliatly anuined that D = Cibt intermediate goods, 
we have to use production pnees. This does not change the results at all 



BENEFIT-OOST ANALYSIS 

If wc were to assume jj, =■ fn+t, then 


29 


dGIdx,, 

dGldxi,+ i 


^ (1 + r,^i) 



$ 


1 


(8.5) 


Condition (8.5) gives a qualitative rule for choice of the rate of discount 
in the public sector.** 


9. Distributional Objectives 

Thus far we have been assuming the existence of a representative in¬ 
dividual whose welfare the government is maximizing It is plain, however, 
that considerations of income distribution ought to weigh heavily both in 
the determination of the structure of taxes and tariffs and in the selection 
of investment projects in the public sector Assume that the government 
IS interested m maximizing the individualistic social welfare funcuon of 
the form fV = , f/,) where there are r nonidenncal individuals 

in the economy It is then easy to show that corresponding to kV there is 
an indirect social welfare function 

T = V(^, M„ , A/,) (9 1) 

where, if fij is the share of thejth firm owned by individual A, 

Afs = E fiy 

j 

Then, in the analysis of sections 1-8 we simply replace I^[?,(l — r,)7t £] 
by the function defined in expression (9 1) It is then easy to show (see 
Dasgupta and Stiglitz 1971) that, in this instance, even though the struc¬ 
ture of optimal taxes and tariffs is affected by distribuUonal considerations, 
the basic qualitative propositions concerning pubhc investment criteria 
(i e , the nature of the shadow pnees to be used) remain unchanged That 
IS to say, the government will still use international pnees to evaluate 
public projects in those cases where, when we ignored distnbutional 
objecUves, it was optimal to do so ** 

10. Conclusions 

We now summarize the basic results obtained in this paper in the form 
of a few simple rules. 

** It IS, perhaps, of some interest to note that if profils were taxed at 100 percent and 
the only constraint facing the government were the budget constraint (8 1), the optimum 
tax structure would be such that the prices of tradeable goods faced by the pnatU sector 
vm^d be unity That is to say, at the opumum r, «= 0 for all i (see Da^pipta and Stiglitz 

** For a fairly extensive discussion of the imphcauont of distributional olyectivet fcr 
the structure of taxation in a closed economy, see Atkinson and Sttglits (1973) 



30 

10 1 Public Investment Cntena 


JOURNAL OF POLTnOAL BCONOUY 


Rule 1. The public sector ought to use a uniform set of shadow prices 
in all Its projects 

Rule 2 The shadow price of a tradeable commodity is its international 
price unless (a) there is a government budgetary constraint, {h) there is a 
foreign exchange constraint, (c) there is a quota on that commodity, or 
[d) the level of net imports (or net publu producUon) of that parucular 
commodity influences unoptimally some control (e.g., some tanflf) at the 
disposal of the government 

Rule 3 When there is a government budgetary constraint, the shadow 
price of tradeable lies between the world price and its domestic price. 

Rule 4 When there is a foreign exchange constraint the relaUve prices 
of commodities within each period are equal to the world price ratios; 
but the rate of interest used in project evaluation is not equal to the world 
rate of interest 

Rule 5 When there is a quota on a particular commodity, its shadow 
price should be greater than its international price, if the quota is binding 

Rule 6 When the net level of imjxirts of a commodity influences its 
tariff level, its shadow price should be greater or less than its inter¬ 
national price depending on whether as a consequence of the tariff 
response the consumption of the commodity is reduced (from what it 
would have been had the tariff been directly controllable and had been 
at its optimal level) or increased 

Rule 7 Except under those exceptions noted in rule 2, nontradeables 
ought to be valued at their “foreign exchange” equivalent That is the 
value of the foreign exchange that would be earned if one less unit of 
the given nontradeable were produced and the resources diverted to the 
production of tradeables 

What IS particularly useful about these results is that they show the 
project evaluator what to do even in “second-best” (optimally chosen 
tariffs and taxes) or “third-best” (nonoptimally chosen tariffs and taxes) 
situations They show that, even when there arc quotas (or similar 
restrictions), although one does not use international pnccs for the com¬ 
modities on which there is a quota, for other commodities one does. 

10 2. Tariffs and Taxes 

All the following rules are predicated on the assumption that lump-sum 
taxation is unfeasible, so that the government has to resort to distortionary 

** This does not, of course, pertain to projects at diRerent locauonsin an economy where 
there are substantial transport costs As in usual general equilibrium analysis, one would 
then wish to expand the commodity space to account for locauonal differences. 

** There is a sense in which (c) may be viewed as a special case of (d). A quota is 
like a discontinuoui tariff imposition 



BENEFrr<ao«T analyus 


31 


taxei to raise revenue. These rules are not dependent on the assump¬ 
tion of a representative consumer. (Although our analysis employed this 
assumption, a careful reading of the proofs show that it nowhere entered 
the arguments for any of these results, or any of the results cited above.) 

Rule 1. In a centrally controlled economy, or in an economy where 
100 percent profit taxes may be levied, and in which consumption, trade, 
and production taxes may be imposed, no taxes (either on trade or on 
domestic production) should be levied on intermediates Only general 
consumption taxes ought to be imposed 

Rule 2. Under the same circumstances as in rule 1, if the only taxes 
which can be levied are trade taxes, then the output of intermediate goods 
should not be changed from what it would be at mtemauonal prices. 
Goods which are used both as mputs into production and as consumption 
goods should be taxed (if it is impossible to treat the same good differendy 
according to use) 

Rule 3 In an economy where profits are not completely taxed away, 
both consumption and trade taxes should be employed. Imported inter¬ 
mediate goods should not be taxed if they can be treated differently 
from the (same) domestically produced intermediate goods; otherwise 
they should be 

Rule 4 If profits are not taxed at 100 percent, and firms are foreign 
owned, then the production tax ought to be such as to reduce the output 
of all commodiucs by the same percentage; the tax is independent of the 
desire of the government for tax revenue (see eq [3 16]) 

The following rules provide the detailed form of the tax and tariff 
structure (for small taxes) under the assumption of a “representative” 
consumer The exact formulae that are valid regardless of the size of the 
revenue are given by the equaUons m parentheses m the text 

Rule 5 If there is no budget constramt, no foreign exchange con¬ 
straint, no quotas, and no commodity whose tariff responds unoptimally 
to the level of import or pubUc producuon, and if it is feasible to impose 
both consumption and production taxes,then the tariffi and export 
duties should be such that the consumption of all commodiucs is reduced 
(along the compensated demand curve) by the same percentage from 
what It would have been had intcmaUonal pnees been charged (eqq 
[2 18] and [3 6]) 

Rule 6 If profits are not taxed at all and all firms are domesucally 
owned, then there should be a tax on domesUc producuon such that the 
output of all commodities is reduced exactly by the same percentage as 
that of the consumpUon of all commodities due to the trade taxes** (see 
eq [3.8]). 

Hie lune quaiificatioia apply to rules 6 and 7. 

*' As we not^ earlier, intermediate goods are treated just like final goods. Recall 
that we are treating outputs and inputs symmetrically. 



JOURNAL OF POLITICAL BOONOKy 

Rule 7 If only trade can be taxed and there is no profit tax, then the 
trade taxes ought to be chosen so as to reduce “excess demand of each 
commodity by the same percentage (again along the compensated demand 
curve) from what it would have been had international prices been 
charged (eq [4 4]) If firms are foreign owned, the relative reducUon in 
excess demand should be smaller the smaller is the ratio of excess demand 
to domestic private production (eq [4 8]) If there is 100 percent profit 
taxation, then the reduction in excess demand should be inversely 
proportional to the ratio of excess demand to consumption** (eq [4 8a]) 
Rule 8 If some tariffs (export duoes) arc fixed, then the remaining 
duties ought to be chosen with a view to seeing that the consumption of 
commodities for which trade taxes can be varied is reduced by the same 
percentage (along the compensated demand curve) from what it would 
have been had international prices for all commodities been charged.*® 
Rule 9 If there is a quota on commodity A or if the level of tariff on 
k is an increasing function of the net import of k, then the tariffs are chosen 
so as to ensure that the consumption of complements of k is reduced by 
more than is the consumption of substitutes 
The rules for the evaluation of projects in the private sector are con¬ 
siderably more complicated Typical of the kind of results available is 
the following, for the case where trade taxes are given and profits are 
untaxed, the profits, valued at domestic prices must be gteater than the 
foreign exchange cost (valued at international prices) times the shadow 
price of foreign exchange in terms of domestic consumption 


References 

Atkinson, A B, and Stiglitz, J E “The Structure of Indirect Taxation and 
Economic Efficiency.” y Public Earn 1 (April 1972) 97-119 

- “Alternative Approaches to the Redistribution of Income ” Mimeo¬ 
graphed 1973 

Bhagwati, J N “Generalized Theory of Distortions and Welfare " In Trade, 
Balance of Payments and Growth, edited by J N Bhagwati, R V\' Jones, R 
Mundell, and J Vanck Amsterdam North-Holland, 1971 

Boiteux, M “Sur la gcstion des monoples publics astreints a Tequilibre 
budgetaire ” Econometnea 24 (January 1956) 22-40 

Corden, W M “The Structure of a Tariff System and the Effective Protective 
Rate" J PE 74 (June 1966) 221-37 

Dasgupta, P , Marglin, S A , and Sen, A K Guidelines for Project Evaluation, 
New York United Nations, 1972 

Dasgupta, P, and Stiglitz, J E "Valuation of Tradeables in Benefit-Cost 
Analysis.” Mimeographed 1971 


** In particular, this means that output of intermediate goods is not reduced at all. 
Rules 8 and 9 are predicated on there being both consumption (trade) and pro¬ 
duction taxes for the commodities in question Moreover, we assume that profits are 
not taxed 



BBKBRT-COer ANALYSIS 33 

-. “On Optimal Taxation and Public Productkili.” Ru. Eton. StaSes 39 

aanuary 1972): 87-104. 

Debreu, G. Tht Thmy (J Valiu. New York; Wiky, 1959. 

Diamond, P. A., and Mirrleet, J. A. “Optimal Taxation and Public Production.’’ 
A.E.E. 61 (March-June 1971): 8-27, 261-78. 

Little, I M. D., and Mirrlea, J. A. Matml <tf Industnal Pnjeet Analysis in Dtvelopbtg 
Countrus. Pant: Org Econ. Cooperation and Develi^ment, 1969. 

Marglin, S. A. “Value and Pnce m tbe Labour Surplus Economy.” Mimeo¬ 
graphed Cambndge, Matt.: Harvard Univ., 1972. 

Meade, J. E. Trade and Weffare. Oxford' Oxford IJmv. Prem, 1955. 

Mirrleet, J. A. “On Producer Taxation.” Rev Eton. Studies 39 (January 1972): 
105-12. 

Pigou, A. C. A Studf in Publie Finance. New York* Macmillan, 1947. 

Ramatwami, V. K., and Srimvatan, T. N. “Optimal Subtidut and Taxes When 
Some Factors Are Traded.” J.P E 76 (July/August 1968): 569-82. 

Ramsey, F. P. “A Contribution to the Theory of Taxation ” Eton. J 37 (March 
1927): 47-61. 

Rufiin, R. J. “Tariffs, Intermediate Goods, and Domesuc Protection.” A~E.R 59 
(January 1969) 29-69 

Sen, A K. “CcmtTol Areas and Accounting Pnces* An Approach to Project 
Evaluation.” Econ. J. 82 (March 1972)* 486-501. 

Stiglitz, J. E “Alternative Theones of Wage Determination and Unemployment 
in L.D.C.’s. I The Labor Turnover Model ” 0 J.E (m press). 

-. “Alternative Theones of Wage Determination and Unemployment. 

II The Efficiency Wage Hypothesis ” Discussion Paper no. 357, Cowles 
Found., Yale Umv, 1973 

Stiglitz, J. E., and Dasgupta, P “Differential Taxauon, Public Goods, and 
Economic Effiaency.” Rw. Econ. Studies 38 (April 1971): 151-74. 



Hedonic Prices and implicit Markets: 
Product Differentiation in Pure 
Competition 


Sherwin Rosen 

Unwrrstty af Rocktster and Harvard Umverstfy 


A class of differentiated products is completely described by a vector of 
objectively measured characteristics Observed product prices and the 
specific amounts of characteristics associated with each good define a set 
of implicit or “hedonic” prices A theory of hedonic prices is formulated 
as a problem in the economics of spatial equilibrium in which the entire 
set of implicit prices guides both consumer and producer locational 
decisions in characteristics space Buyer and seller choices, as well as the 
meaning and nature of market equilibrium, are analyzed. Empirical 
implications for hedonic price regressions and index number con¬ 
struction are pointed out 


I. Introduction and Summary 

This paper sketches a model of product differentiation based on the 
hedonic hypothesis that goods are valued for their utility-bearing attrib¬ 
utes or characteristics Hedonic prices are defined as the implicit prices 
of attnbutes and are revealed to economic agents from observed prices of 
differentiated pioducts and the specific amounts of characteristics asso¬ 
ciated with them They constitute the empirical magnitudes explained 
by the model Econometncally, implicit prices are estimated by the first- 
step regression analysis (product price regressed on characteristics) in the 
construction of hedonic price indexes With few exceptions, structural 


The subttance of this paper arose from conversations with H Gregg Lewis several years 
ago A multitude of other people have contributed advice and criticism Among them are 
William Brock, Swnley Engerman, Robert J Gordon, Zvi Gribches, Robert E Lucas, Jr., 
Michael Mussa, and the referee Remaining errors are my own responsibility Financial 
support from the Center for Naval Analysis and the National Institute of ^ucaUon M 
gratefiilly acknowledged 


34 


HEDONIC PS10E8 


35 

interpretations of the hedonic method are not available.* Therefore, our 
primary goal is to exhibit a gennating mechanism for the observations 
in the competitive case and to use that structure to claniy the meamng 
and interpretation of estimated implicit prices. It will be shown that these 
data generally contain less information than is commonly supposed How¬ 
ever, the model suggests a method that often can identify the underlying 
structural parameters of interest Also, as a general methodological point, 
it is demonstrated that conceptualizing the problem of product differen¬ 
tiation in terms of a few underlying characteristics instead of a large num¬ 
ber of closely related generic goods leads to an analysis having much in 
common with the economics of spatial cqmlibnum and the theory of 
equalizing differences 

The model itself amounts to a descnption of competitive equilibrium 
in a plane of several dimensions on which both buyers and sellers locate 
The class of goods under consideration is described by n objectively mea¬ 
sured characteristics Thus, any location on the plane, is represented by a 
vector of coordinates z = (Zt, Zi, , z^, with Zi measunng the 
amount of the ith characteristic contained in each good Products in the 
class are completely described by numencal values of z and offer buyers 
distinct packages of charactenstics Furthermore, existence of product 
differentiation implies that a wide variety of alternative packages are 
available Hence, transactions in products arc equivalent to tied sales 
when thought of as bundles of characteristics, suggesting applicability of 
the principle of equal advantage for analyzing market equilibrium 

In particular, a price p{z) = p{Zy, Z 2 > , z„) is defined at each point 

on the plane and guides both consumer and producer locaUonal choices 
regarding packages of charactenstics bought and sold CompetiUon 
prevails because single agents add zero weight to the market and treat 
pnees p{z) as parametne to their decisions In fact the function p{z) is 
identical with the set of hedonic pnees—“equalizing differences”—as 
defined above, and is determined by some market cleanng conditions' 
Amounts of commodities offered by sellers at every point on the plane 
must equal amounts demanded by consumers choosing to locate there 
Both consumers and producers base their locational and quantity dea- 
sions on maximizing behavior, and equilibnum pnees are determined so 
that buyers and sellers are perfectly matched No individual can improve 
his position, and all optimum choices arc feasible As usual, market clear¬ 
ing pnees, p(z), fundamentally are determined by the distnbutions of 
consumer tastes and producer costs We show how it is possible to recover. 


‘ Excellent tununanet of the hedonic technique are available in Gnhchei (1971, 
chap. 1) and Gordon (1973). Mqjor oiceptioni to the itatement m the text ate thoae 
studies dealing with depreaation and obsolescence (see Gnlichcs 1971, rfupa. 7 and 8} 
and some recent models based on markup pneing (e.g , Ohta and Gnhehes 1972) 



JOURNAL or POLmOAL BOONOMY 


36 

or identify, some of the parameters of these underlying distributions by a 
suitable transformation of the observations. 

An early contnbution to the problem of quality variation and the theory 
of consumer behavior has been made by Houthakker (1952). His analysis 
IS designed to take account of the fact that consumers purchase truly 
negligible fractions of all goods available to them without having to deal 
with a myriad of comer soluUons required by conventional theory. That 
virtue of Houthakker’s treatment is preserved in the present model More 
recently Becker (1965), Lancaster (1966), and Muth (1%6) have ex¬ 
tended Houthakker’s methods to more explicit consideration of utihty- 
beanng charactenstics Again, the emphasis is on consumer behavior and 
properties of market equilibrium have not been worked out, a gap we 
hope to fill, in part, here The spirit of these recent contnbubons is that 
consumers arc also producers Goods do not possess final consumption 
attnbutes but rather are purchased as inputs into self-production func¬ 
tions for ultimate characteristics Consumers act as their own “middle¬ 
men,” so to speak In contrast, the model presented below interposes a 
market between buyers and sellers Producers themselves tailor their goods 
to embody final characteristics desired by customers and receive returns 
for serving economic functions as intermedianes These returns arise from 
economies of specialized production achieved by specialization and 
division of labor through market transactions not available outside 
organized markets with self-production 

Section II discusses individual choices in the market and the nature of 
market equilibrium Some simple examples of analytic solutions for 
general equilibrium are given in Section III Section IV presents an 
empirical method for identifying the underlying structure from the 
observauons, while Section V apphes the model to price index number 
construction in the presence of legislated restrictions To highhght essen¬ 
tial features, the simplest possible specifications arc chosen throughout 
As a further appeal to intuition, use is made of geometrical constructions 
wherever possible 


n. Market Equilibrium 

Consider markets for a class of commodities that arc described by n 
attnbutes or charactenstics, z = {zi, Zj, , «,) The components of « 
are objectively measured in the sense that all consumers’ perceptions or 
readings of the amount of charactenstics embodied in each good arc 
identical, though of course consumers may differ in their subjective valua- 
uons of alternative packages The terms “product,” “model,” "brand,” 
and “design” are used interchangeably to designate commodities of given 
quality or specification. It is assumed that a sufficiently huge number of 
difierentiated products are available so that choice among various com- 



hedonic PEIOES 


37 

binations of s; is continuous for all practical purposes. That is, there is a 
“spectrum of products’’ among which choices can be made. As will be 
apparent, this assumption represents an enormous simplification of the 
problem. It is obviously better approximated in some markets than others, 
and there is no need to belabor its realism.^ To avoid complications of 
capital theory, possibihties for resale of used items m secondhand markets 
are ignored, either by assuming that secondhand markets do not exist, or 
alternatively, that goods represent pure consumption 

Each product has a quoted market pnee and is also assoaated with a 
fixed value of the vector z, so that products markets implicitly reveal a 
function p{z) = Pizyt ., c,) relatmg pnees and characteristics This 
function IS the buyer’s (and seller’s) equivalent of a hedonic price regres¬ 
sion, obtained from shopping around and companng prices of brands 
with different characteristics. It gives the minimum pnee of any package 
of characteristics If two brands offer the same bundle, but sell for different 
prices, consumers only consider the less expensive one, and the identity 
of sellers is irrelevant to their purchase decisions Adopt the convention 
of measuring each so that they all may be treated as “goods’’ (i e , so 
that consumers place positive rather than negative marginal valuations 
On them) in the neighborhood of their minimum technically feasible 
amounts Then firms can alter their products and increase z only by use 
of additional resources, and p{zi, , z„) must be increasmg in all its 
arguments Assume p{z) possesses contmuous second denvabves Smee a 
major goal of the analysis is to present a picture of how p{z) is determined, 
it IS inappropnate to place too many rcstnctions on it at the outset How¬ 
ever, note that there is no reason for it to be linear as is typically the case. 
The reason is that the differentiated products are sold in separate, though 
of course highly interrelated, markets This point is spelled out in some 
detiul below 

A buyer can force p{z) to be linear if certain types of arbitrage activi¬ 
ties are allowed. Let 2 ,, and z^ be particular values of the vector z 
( 1 ) Suppose z, — (l/<)^», and p{z^ < {}!t)p{Zf,), where / is a scalar and 
t > 1 Then ( units of a model offering z, yield the same amount of 
characteristics as a model offering z^, but at less cost, ruling out transac¬ 
tions m convex portions of p{z). (u) Suppose z, < z^ < Zg and p[z^ > 
+ (1 ~ where 0 < d < 1 and is defined by z^ = 

&Zf -1- (1 — S)Zf. Then characteristics in amount of z^ could be achieved 
by purchasing d umts of a model contaimng z, and (1 — d) umts of a 
model contaimng z^ at lower cost than by direct purchase of a brand 
containing z^, and products in concave portions of p(z) would be un¬ 
economical Arbitrage u assumed impossible in what follows (at this point 

* Thu assumption was first employed by L. M. Court (1941) and allows the use of 
marginal analysis rather than the programming methods retjuired by Lancaster's (1966) 
formulation. Following the general rule, it u not without its costs, however (see hekrw). 



38 


JOURNAL OF POLmOAL BOONOMV 


we depart from Lancaster [1966]) on the assumption of indivisibility. 
This amounts to an assumption that packages cannot be untied. For 
example, m terms of one characteristic, two 6-foot cars are not equivalent 
to one 12 feet in length, since they cannot be driven simultaneously (case 
[i]), while a 12-foot car for half a year and a 6-foot car for the other half 
is not the same as 9 feet all year round (case [u]) Similarly, assume sellers 
cannot repackage existing products in this manner or do not find it 
economical to do so, as might not be the case with perfect rental markets 
and zero transactions and reassembly costs 


A The Consumption Decision 

To begin, suppose consumers purchase only one unit of a brand with a 
particular value of z Write the uulity funcuon as U{x, z^, Z 2 , , -£,) 

assumed strictly concave, in addition to the other usual properties, where 
X IS all other goods consumed It would not be difficult to treat z as inter¬ 
mediate goods and relate them to yet more ultimate commodities through 
self-production functions, but that complication is ignored Set the price 
of X equal to unity and measure income, j, in terms of units of x ^ = 
X + p{z) Maximization of utility subject to the nonhnear budget con¬ 
straint requires choosing x and (z:,, ., «,) to satisfy the budget and the 

first-order conditions dpjdzi = p, = U^JU^, » = 1, , n Optimality 

is achieved by purchasing a brand offering the desired combination of 
characteristics Second-order conditions are fulfilled on the usual assump¬ 
tions regarding U, so long as p(z) is not sufficiently concave (for a general 
statement of these conditions under a nonlinear constraint see Intnligator 
[1971]) 

To stress the essential spatial context of the problem, define a value or 
bid function 0(Zi, , z„, u,y) according to 

U{y - 6, Zt, ^ ,z„) = u (1) 

The expenditure a consumer is willing to pay for alternative values of 
(Zy, , at a given utility index and income is represented by 
d(z, u,y) It defines a family of indifference surfaces relating the Zy with 
“money” (i.e, with x foregone), and has been widely used in urban 
economics (e g , see Alonso 1964). Differentiate (1) to obtain 

K = U,JU, > 0, = - 1/C7, < 0, and 6, = 1, (2) 

8.,., * - W,U,,U„, -P UlU„)IU^ < 0, (3) 

where the inequality in (3) follotvs from the assumptions about the bor¬ 
dered Hessian matrix of U Also, strict concavity of U imphes that 9 is 
concave in z. Equations (2) and (3) show that the value function is 
increasing in at a decreasing rate. Alternatively, 0,^ is the marginal 
rate of substitution between Zt and money, or the implicit marginal 



HBDONIO PRICBB 


39 



valuation the consumer places on at a given utility index and income. 
It indicates his reservation demand price for an additional umt of 
which IS decreasing in 

The amount the consumer is willing to pay for « at a fixed utility mdex 
and income is 6{z, u,y), while p{z) is the minimum pnee he must pay 
in the market Therefore, utility is maximized when B{z*, u*,y) = p{z*) 
and 0 ,,( 2 *, u*,>) = Pi{z*),i = I, whereand«• are optimum 

quantities. In other words, optimum location on the i-plane occurs where 
the two surfaces p{z) and Q{z, u*,y) arc tangent to each other. One 
dimension of consumer equilibnum is illustrated m figure 1, where the 
surfaces have been projected onto the 9 — Zt plane cut at {«*,. ., ij) 
A family of indifference curves, of which only one member (at «*) is 
shown, IS defined by d{zi, Zjy . . Z*, u,y). Two different buyers are 
shown in the figure, one with value function and the other with 0* 
The latter purchases a brand offermg more Zi ^ 

In general, far less can be said than in the standard analysis about 
comparative statics, because the budget constraint is nonlinear Differen¬ 
tiate 9^, with respect to u, numerator 

of which IS recognized as determining the sign of the income clastiaty of 
demand for “good” Z/ in standard theory when the other components 
of z are “held constant.” If all these derivatives are positive {zi is “nor¬ 
mal” in this restneted sense for all i), the gradient of 9 unambiguously 

’ Lewis (1969) employs a limilar construction m analysing the problem oT hours of 
work M a tied sale Jobs offer a fixed wage-hour package, which vanes from job to job. 
The market establishes a fiincuon relatuig wages and houn on which both workers and 
entploycn bue th<^ r deciuona. 



JOURNAL OF POLinOAL EdONOMY 


40 

increases as u increases Additional income always increases maximum 
attainable utility Hence ifp(z) is convex and sufRciently regular every¬ 
where, we might expect higher income consumers to purchase greater 
amounts of all characteristics Only in that case would it be true that 
larger income leads to an unambiguous increase in the overall “quality” 
consumed, and differentiated products’ markets would tend to be strati¬ 
fied by income However, in general there is no compelling reason why 
overall quality should always increase with income, Some components 
may increase and others decrease (cf Lipsey and Rosenbluth 1971) Be 
that as It may, a clear consequence of the model is that there are natural 
tendencies toward market segmentauon, in the sense that consumers with 
similar value functions purchase products with similar specifications This is 
a well-known result of spatial equilibrium models. In fact, the above speci¬ 
fication IS very similar m spirit to Tiebout’s (1956) analysis of the implicit 
market for neighborhoods, local public goods being the “charactcnstics” 
in this case He obtained the result that neighborhoods tend to be seg¬ 
mented by distinct income and taste groups (also, see Ellickson 1971) 
That result holds true for other differentiated products too 
Allowing a parameterization of tastes across consumers, the utility 
funcuon may be written U{x^, z,, , a), where a is a parameter 

that differs from person to person Equilibnum value functions depend 
on both y and a A joint distribution function F{y, a) is given in the 
population at large, and equilibrium of all consumers is characterized by 
a family of value functions whose envclojie is the market hedonic or im¬ 
plicit pnee function 

The model is easily expanded to include several quantities, so long as 
consumers are restricted to purchasing only one model Following 
Houthakker (1952), the utility function becomes U{x^,Zi, ,z„rn), 
where m is the number of units consumed of a model with characteristics z 
The constraint is_y = x -|- mp{z), and necessary conditions become 

^ = -p{z)U, + = (4) 

cm 

^ = -mpi{z)U^ + {/„ = 0 (5) 

CZt 

The value function is still defined as the amount a consumer is willing to 
pay for ^ at a fixed utility index but now with the proviso that m is optim¬ 
ally chosen That is, 0(,e,, , ^^) is defined by eliminating m from 

u ^ U{j - mQ,Zi, , z„ m) 

uju, = e 

Again, 0,, is proportional to U^JU^ The logic underlying figure 1 remains 
intact, and it can just as well serve for this case. However, second-order 



HEDONIC PUCBI 


4 ' 


conditions are now more complex. For example, convexity of p{g.) is no 
longer sufficient for a maximum as it was in the case where m was re¬ 
stricted to be unity. Also, it is necessary to employ stronger assumptions 
than those used above if die value function 0 is to be concave 
Note there is no question of monopsony involved here. Consumers act 
competitively in spite of the fact that marginal cost of quahty, Pt{z), is not 
necessarily constant—it is increasing in figure 1—because as many umts 
as desired of any brand can be purchased without affectmg prices The 
function p{z) is the same for all buyers and mdependent of m 

B The Production Decision 

Having set up the formal apparatus above, we gpve a symmetrical and 
consequently brief account of producers’ locational decisions. What 
package of characteristics is to be assembled^ Let M{z) denote the 
number of units produced by a firm of designs offering specificauon z. 
The discussion is limited to the case ofnonjomt production, in which each 
production estabhshment withm the firm specializes in one design, and 
there are no cost spillovers from plant to plant Thus a “firm” is an 
arbitrary collection of atomistic production establishments, each one 
acung independently of the others Analytical difficulties arising from true 
joint production are noted in passing 

Total costs in an establishment are C{M, z; fi), derived from minimiz¬ 
ing factor costs subject to a joint production function constramt relatmg 
M, z, and factors of production The shift parameter P reflects underlying 
variables in the cost mimmizaUon problem, namely, factor pnees and 
production function parameters. Assume C is convex with C{0, z) = 0 
and and C,, > 0 There are no producOon indivisibihties, and 
marginal costs of producing more units of a model of given design are 
positive and increasing Similarly, margmal costs of increasing each 
component of the design are also positive and nondecreasing (Ordinarily, 
there will be some technological constraints that hmit the set of feasible 
locations on the plane ) Each plant maximizes profit n = Mp{z) — 
C{M, Zi,. , z^)hy choosing M and z optimally, where umt revenue on 
design z is given by the implicit pnee function for characteristics, p{z) * 


* Our inability to treat joint production nontnvially yet umply items from the spec- 
trum-of-conimoditiei assumption If a finite number (say s) of packages is availabk, it 
would be straightfiwward formally to speciiy a standard ixompoiient multiple product 
cost function for the firm, and proceed on that basis In the present case, firms engage m 
jomt production only insofar as they own establishments tpecialiaing m different packages. 
However, genuine joint producuon requires coat depenctoaes between production umts 
withm the firm; the firm must choose a funcoon M(c) describing an entire “product 
line” offered in the market. The entire function M{z) is an argument m each plant's 
coats and total costa in turn are the sum (or integral) over all pi^uction estahliAment 
costs. A complete treatment requires use of fimctioniil analysa and is beyond the scope 
of this paper , 

dccfv,,., „ rwwfierg 



JOXniMAL OF POUnOAL ECONOMY 


42 

Again, firms are competitors and not monopolists even though marginal 
costs of attnbutes pi{z) arc not necessarily constant because all establish* 
ments observe the same prices and cannot affect them by their individual 
production decisions p{z) is independent of M. 


Optimal choice of M and 2 reqmres 



PM -.-Sj/M 1 = 1,. 

■ 1« 

(6) 

p{z) = 2i, , 2„) 


(7) 


At the optimum design, marginal revenue from additional attnbutes 
equals their marginal cost of production per unit sold Furthermore, 
quantities are produced up to the point where unit revenue p{z) equals 
marginal production cost, evaluated at the optimum bundle of charac- 
tenstics As above, convexity of C does not assure second-order conditions 
due to nonlinearity of p{z), and some stronger conditions, assumed to be 
satisfied in what follows, are required (see Intriligator 1971) 
Symmetrically with the treatment of demand, define an offer function 
y n, ff) indicating unit prices (per model) the firm is willing 
to accept on various designs at constant profit when quantities produced 
of each model are optimally chosen A family of production “indifference” 
surfaces is defined by <p Then (^( 2 ,, , 2 „; 7t, ff) is found by eliminating 

M from 

n - M<f) - CiM, z„ .,z,) (8) 

and 

^ 1 . . ^n) = <P, ( 9 ) 

and solving for <j> in terms of z, n, and ff Differentiate (8) and (9) to 
obtain <f>,^ = CjJM > 0 and (f>^ = l/M > 0 

The marginal reservation supply pnee for attribute i at constant profit, 
assumed increasing in 2 ,, is Again convexity of C does not always 
guarantee > 0 Since <j) is the offer price the seller is willing to 
accept on design z at profit level rz, while p(z) is the maximum price 
obtainable for those models in the market, profit is maximized by an 
equivalent maximization of the offer price subject to the constraint p = (p 
Thus maximum profit and optimum design satisfy pi{z*) - 
for « = 1, ,n, and p{z*) = tp{z*, ,z*, 

n*, ff) Producer equilibrium is characterized by tangency between a 
profit-characteristics indifference surface and the market charactenstics- 
implicit pnee surface. 

One dimension of the solution is depicted in figure 2, where 


• t y ff) 



HBDONIC PRICES 


43 



defines a family of curves on the Zi — <f> plane cut through the indifference 
surface at the optimum values of the other attributes Only one member 
IS shown in the figure The curve labeled (f)' refers to a production unit 
possessing production and cost conditions making it well suited to produce 
lesser amounts of while the one labeled refers to a firm with a com¬ 
parative advantage at producing higher values of z■^ That is, the two 
plants have distinct values of the parameter P More generally, there is a 
distribution of fi across all potential sellers Let G{P) represent that dis¬ 
tribution. Then producer equihbnum is characterized by a family of 
offer functions that envelop the market hedomc price functions 

What IS the empirical content of It is anything that shifts cost con¬ 
ditions among firms Thus, differences in factor pnees arc one fiossibility 
For example, many products are produced m several countries and arc 
traded on national markets (for examples, see Gnliches [1971], chap 5) 
There is no reason to assume equalization of factor pnees m these cases. 
More generally, anything allowing identification of convenuonal multi¬ 
product production functions m cross-section data serves to provoke 
differences in j8 Factor pnee diflfercnces across states or regions within-a 
country often serve this purpose and do so here as well Second, differences 
in “technology," as reflected by typically unmeasured, firm-specific 
factors of production, also act as supply shifters across firms. For example, 
agricultural production function research often treats education of the 
farm operator in this manner Firm-specific R&D expenditure as well as 
the phmomena of progress-function-ieaming also serve these purposes. 



JOURNAL OF POLmOAL BOONOMY 


44 

C What Do Hedonic Prices Mean* 

An answer to the question is an immediate application of the above 
analysis Superimpose figure 2 onto figure 1 In equilibrium, a buyer and 
seller are perfectly matched when their respective value and offer functions 
“kiss” each other, with the common gradient at that point given by the 
gradient of the market clearing impliat price funcUonjft(i) Therefore, 
observations p{z) represent a joint envelope of a family of value functions 
and another family of offer functions An envelope function by itself 
reveals nothing about the underlying members that generate it; and they 
in turn constitute the generating structure of the observations. Some quali¬ 
fications are necessary however (a) Suppose there is no variance in p 
and all firms are identical Then the family of offer functions degenerates 
to a single surface, and p{z) must be everywhere identical with a unique 
offer function Pnee differences between various packages arc exactly 
equalizing among sellers because offer functions are constructed at 
constant profit A variety of packages appicar on products markets to 
satisfy differences in preferences among consumers, and the situation 
persists because no firm finds it advantageous to alter the quality content 
of Its products (b) Suppose sellers differ, but buyers are identical Then 
the family of value functions collapses to a single function and is identical 
with the hedonic pnee function Observed pnee differences are exactly 
equalizing across buyers, and p{z) identifies the structure of demand. 


m. Elxistence of Market Equilibrium 

Analysis of consumer and producer decisions has proceeded on the 
assumption of market equilibrium This section demonstrates some details 
of equilibrium price and quantity determination Market quantity de¬ 
manded for products with characteristics z is Q'‘{z), and Q'iz) is market 
quantity supphed with those attributes It is necessary to find a function 
p{z) such that Q‘‘{z] = Q‘{z) for all z, when buyers and sellers act in the 
manner described above The fundamental difficulty posed by this prob¬ 
lem IS that Q''(z) and Q’(z) depend on the entire function p(z) For 
example, suppose quantities demanded and supplied at a particular 
location do not match at prevailing prices The effect of a change in price 
at that point is not confined to models with those particular charac¬ 
teristics but induces substitutions and locational changes everywhere on 
the plane. A very general treatment of the problem is found in Court 
(1941), and our discussion is devoted to some examples. These examples 
have been chosen for their simplicity but illuminate the problem and 
illustrate most of the basic issues. In contrast to the rest of the paper, 
discussion is specialized to the case where goods are described by exactly 
one attribute (i.c., « = 1) Therefore represents an unambiguous 



hedonic prices 


45 


measure of “quality ” When n = 1, the location surface degenerates to a 
line rather than a plane, and products arc unequivocally ranked by their 
z content 


A Short-Run Equilibrium 

Consider a short-run equilibrium in which firms have geared up for the 
quality (^,) of goods they can produce and are only capable of varying 
quantities The honzon is sufficiently short so that new entry is pre¬ 
cluded, and the distribution of firms by quality is given as an initial 
condition The market reveals an implicit price function p(Zi), and each 
firm determines the quantity it supplies to the market according to con¬ 
dition (7) Market supply in a small interval dzi near quality Zi is found 
by weighting firm supply by the quality distribution function Consumers 
differ in tastes and income, but all determine optimal quality and quantity 
as in (4) and (5) Market demand near any quality Zi is found by using 
the conditions of consumer equilibrium to transform the distnbution of 
tastes and income into a distribution of qualities demanded and weighting 
individual quantities demanded by the resulting distribution of qualities 
Finally, setting demand equal to supply yields a differential equation in 
p and Zi that must be satisfied by market equilibrium, subject to some 
boundary conditions 

To be specific, assume that C(N, z) — {aj2)M^z\ for all firms Also, 
suppose firms are uniformly distributed by the characteristic 
^(4:|)d4i = kdzx for ^ Zn, where A is a constant and 2 ,, and 

are exogenously determined uppier and lower limits of the product line 
Apply equation (7) to obtain firm supply M{Zy) = pioz\, since qualities 
cannot be varied by assumption Therefore, 

Q’{Zi)dzi = g{z^)M{z^)dz^ = [(*/«)( 10 ) 

Assume a fixed number of consumers in the population and that only 
one unit per customer of the optimal model is purchased Consumers have 
the same income, and utility is linear in x and i,, with the marginal rate 
of substitution, p, varying from person to person Maximize U{x, ^,) = 
X -t- pzi subject to_y = X -)- ^(^:,) Each consumer purchases a brand 
for which dpjdzi = P'izi) — p In this case the value functions bf 
figure 1 are straight lines with a different slope, p, for each person The 
marginal condition charactenzes consumer choice so long as ^" > 0, 
which will be shown to be true. Suppose p is distributed uniformly, 
/{p)dp = bdp for p, ^ p ^ p„ where i is a constant and p, and p, are, 
respectively, the largest and smallest marginal rates of substitution in the 
population. Use the marginal condition p' ^ p to transform f{p)dp into 



46 


JOURNAL OF POUnOAL BOONOMY 



a distnbution ofThen 

Q^{z)dz ^f(z,) 


± 

dzi 


dzi = bp"{zi)dzi 


( 11 ) 


Pnee must clear the market at every quality Equating (10) and (11), 
p(zi) must satisfy the differential equation 


{klba)pjz\ = d^pjdz] 


( 12 ) 


Equation (12) is a special case of what is called “Euler’s equation” and 
has a known solution of the form 


p = (13) 

where c, and Cj are constants determined by the boundary conditions and 
r and s are defined by — r — {ajbk) = 0 r=>(l+>/l + 4a/6A)/2 
and s = (1 — Vl + ^albk)l2 The parameters r and s are real numbers 
andr > Oands < 0 Furthermore,^'(^i) would not be positive through¬ 
out Its range unless c, > 0 and Cj < 0, and consumers'could not be 
interior at those points Equation (13) is graphed in figure 3 on that 
assumption. Note that pm (13) exhibits an inflection point at .Cjo « 
(—and it so happens that p{zio) = 0. Therefore p" > 0 
for Zi > Zi 0 - 

Boundary cotuhtums .—Competition requires there be no masses of con- 
Bumen at any quality, for there are few sellers located at any point and 



Hraxnno pkicbs 


47 


they would otherwise add nonzero weight to the market. As seen in Sec¬ 
tion II, consumers with high values of p buy higher-quality models, and 
it must be true that those for whom p an p, purchase the highest quality 
available Otherwise pnces of quality Zn would fall, a great mass of con¬ 
sumers would switch over to them, driving the price back up and causing 
those buyers to relocate again Therefore, one boundary condition is 
= Pi’ Of 

Pi = «i-2'ii ‘ + (14-) 

The other boundary condition is found by cxamimng the lower end of the 
line The following three cases cover all relevant possibilities 

1 Zi, = 0 and p, > 0 Firms choose not to sell at negative pnces (see 
hg 3) and all plants geared to produce qualities less than Zio (to be 
dcternuned) shut down On the other hand, all consumers value Zi at 
least as much as its minimum supply pnce (i c , zero) and it must be true 
that they all buy some value of Individuals for whom p = p, consume 
the lowest qualities appeanng on the market, for if they chose quahties 
greater than q, prices of models in the neighborhood of ^ would fall 
to zero, inducing low p customers to relocate there and dnving their prices 
back up. Thus a second boundary condition is p'(«io) - P,. ot 

p, = rct<o‘ + (15) 

The parameters Zi o, ^ 1 , and are determined by equations (14) and (15) 
plus the definition of z,o It can be shown that c, > 0 and Cj < 0, as 
required by the second-order conditions of consumer equihbnum 
Therefore, the equilibrium hedonic price function appears as a portion 
of the curve in figure 3 in the interval (^lo, -Ju) 

2 If p, = 0 = all producers must be in the market, and it follows 
that ijo = 0 This only is possible if ^'(0) = p, = 0 and Cj must be 
zero In this case pnce is a log-linear function of quality 

3 > 0 and p, = 0 Now some consumers do not value Zi very 

highly, and there is a definite hmit to the smallest amount available 
Clearly, p{zi,) must exceed zero and some consumers must be dnven out 
of the market, findmg it optimal not to consume the product at all If not, 
consumers with small values of p would mass on Zi, (there would be a 
corner solution there), adding fimte weight to the market and causmg 
p(Zi,) to explode Using the budget constraint, the market rate ofexchiunge 
between not buying at all and buying is [> — must 

equal the slope of the value function for buyers at that (extensive) margin. 
That is, the condition [j( — p(iit)]/Zt, « ^'(<|,) replaces equation (15)— 
after substituting for p and p' from (IS) — in the determination of Cj smd 
Ci The hedonic price function also can be illustrated in figure 3 as the 
portion of the curve between the points such as those maticed A{aa 



JOURNAL OF POUTICAL ECONOMY 


48 

and Zi, Again, c, and have the correct signs and the second-order 
conditions are fulfilled 

A second type of short-run equilibrium could be considered in which 
existing firms can alter qualities as well as quantities of their products. 
When there is a distribution of cost functions, it is necessary to proceed 
analogously to the treatment of demand in the example above For ex¬ 
ample, costs might be described by {aj2)N^z\ with X varying across 
firms Then ()l2) = Ztp'lp is used to transform the distribution of X into a 
distiibution of qualities supplied The resulting distribution weights firm 
quantities supplied in the determination of market supply at any quality 
A httlc experimentation will show that the differential equation resulting 
from setting = Q\Zi) is nonlinear m most cases, and closed 

solutions are not always feasible 


B Long-Run Equilibrium 

Firms may var\ qualities at will and also construct establishments of 
optimum size No entry icstrictions iinph the absence of profit {n* = 0) 
and long-run offer price for each firm must satisfy <j>{z, P) = 
C{M, z, P)IM Plants arc constructed to produce models of quality z 
at minimum cost Hence scale economies are exhausted under competi¬ 
tion and the opttmurn production unit occurs where C{M, z, P) is 
linear in df, sanations of quantity being achieved by changes in the 
number of establishments Let h(z, P) represent minimum average cost 
of c for an establishment of optimum size Then C{M, z, P) — Alh{z, P) 
in the long run Therefore (j> — h{z, P) and p{z) = h{z, P) is the 
equilibrium condition for maximum profit and p{z) is completely 
determined by supph, or by the envelope of the family h{z, P) with 
respect to p Generalization to n characteristics is obvious in this case 


IV. An Identification Problem 

Section III demonstrated that complete solutions for p{z) and the 
distribution of qualities traded sometimes can be obtained if sufficient 
a prion structure is imposed on the problem However, it is not always 
possible to proceed in that manner In general, the differential equation 
defining p{z) is nonlinear and it may not be possible to find closed solu¬ 
tions Moreover, a great deal of structure must be imposed For example, 
the distnbution of income follows no simple law throughout its range, 
making it difficult to specifv the problem completely Finally, partial- 
differential equations must be solved when there 1 $ more than one 
characteristic. This section sketches an alternative and more efficient 
procedure, based on the analysis of Section II. 



hedonic nucBs 


49 



As shown above, denvatives of a consumer’s value function, 0,,, are 
proportional to marginal rates of substituUon They arc reservation- 
demand prices for additional amounts of at a constant utility index. 
Therefore {d,,{z)} are the inverses of a set of ordinary compensated 
demand functions for the ^j’s The marginal cost of Z/ to the consumer is 
p{(z), and optimalis determined where marginal costs equal margmal 
values One dimension of these marginal concepts is illustrated in figure 4. 
The curves labeled 0^^ arc derivatives of 0^ in figure 1 and reflect com¬ 
pensated demand functions for vanous buyers The dashed line labeled 
Pi(z) iS the common marginal cost confronting all buyers Consumer 
choice IS given by the intersection of demand and margmal cost. It 
should be emphasized that the functions d„(z) arc compensated demand 
prices (real income held constant) and can only be denved once equilib- 
num is determined, as in Section II For example, a new equihbnum 
resulting from an exogenous shift in p would not always be given by the 
intersection of the new marginal costs, and the initial compensated 
demand price functions An exception occurs when 0,,, = 0 and the 
family of surfaces 0(4:, «), such as depicted in figure 1, are all parallel to 
each other: 0,^, = 0 is equivalent to constant marginal utihty of money 
and 0,| IS unique and independent of« only in that case If 0,,, # 0, the 
shape and location of the 0^, functions sue detenmned by the equilibrium 
conditions of Section II: tangency between p{z) and *•) 

A similar procedure applies to firms: 0„ is the reservation su{^ly price 
of incremental Zi and reflects a profit-compensated supply function for 





JO JOURNAL OF POLmCAL BCONOUY 

charactenstic 2 ,, is the marginal revenue function for facing each 
firm One dimension of producer equilibrium is shown in figure 4 as the 
intersection of a set of compensated supply curves for various firms, 
with a common marginal revenue function, (e) 

Figure 4 reiterates the major conclusion of Section II in terms of 
derivatives of p{z) Equlibnum is described by the intersection of supply 
and demand functions However, income effects have been removed, in 
distinction to the typical case Observed marginal hedonic prices merely 
connect equilibrium reservation prices and characteristics and reveal 
little about underlying supply and demand functions 

However, figure 4 suggests a method that can be used for estimation 
In principle, data are available on designs purchased by buyers and also 
on their incomes and taste variables such as age, education, etc Denote 
these empirical counterparts of a by a vector F, Data are also potentially 
available on the characteristics’ content of models produced by sellers 
and factor price and specific technological differences among them 
Denote the empirical counterparts of by a vector Following figure 4, 
let Fi{z, Fj) represent the marginal demand price for z, and G,{z, Fj) 
represent the marginal supply price Ignoring random terms, the model 
to be estimated can be written as 

P,iz) = f{Zu , F,) (demand), (16) 

p,{z) = G'{z^, ,z„,Y,) (supply), (17) 

for 1 = 1, , n, where pi and z, are all jointly dependent variables and 

Fj and Fj are exogenous demand and supply shift variables The 2n 
equations determine the 2n endogenous variables />, and Zi Estimation 
requires a two-step procedure First, estimate p{z) by the usual hedonic 
method, without regard to F, and F^ That is, regress observed differenti¬ 
ated products’ prices, p, on all of their characteristics, z, using the best 
fitting functional form This econometrically duplicates the information 
acquired by agents m the market, on the basis of which they make their 
decisions Denote the resulting estimate of the function p{z) by p(z) 
Next, compute a set of implicit marginal prices, dp{z)ldzi = ^/(■J) for each 
buyer and seller, evaluated at the amounts of characteristics (numerical 
values of z) actually bought or sold, as the case may be Finally, use 
estimated-marginal pnees P,{z) as endogenous variables m the second- 
stage simultaneous estimation of equations (16) and (17) Estimation of 
marginal pnees plays the same role here as do direct observations on pnees 
in the standard theory and converts the second-stage estimation into a 
garden vanety identification problem There arc four cases to consider; 

1. There is no variance in and cost conditions are identical across 
firms The variables Fj drop out of equation (17) and P{z) identifies the 



HEDONIC PRICES 


5" 


offer function Similarly, the sample observations on and the Zt 

identify compensated supply functions Suppose several cross sections for 
different years are available and firms’ production functions have been 
subject to technical change Then withm-ycar hedonic price regressions 
identify supply conditions for each year Changes in marginal prices and 
qualities induced by changing technology and cost conditions between 
years approximately sweep out the structure of preferences and com¬ 
pensated demand functions (with due qualification for the nonconstancy 
of the marginal utility of money) 

2 If buyers are identical, but sellers differ, Y^ drops out of (16) and 
single cross-sectional observations trace out compensated demand 
functions 

3 If buyers are identical and so are sellers, offer and value functions 
are tangent at a single point, and only one quality appears on the market 
The observations degenerate to a single pioint, there is no product differen¬ 
tiation and no problem 

4- In general there is both a distribution of buyers and another dis¬ 
tribution of sellers Both Y^ and Y 2 have nonzero variance, and the usual 
identifying rank and order conditions apply A necessary prior condition 
for estimation is that p{z) be nonlinear at stage one For if ^(z) happens 
to be linear, Pt(z) arc constants, independent of qualities traded, and 
display zero vanance across sample observations As shown above, 
linearity of p(z] is unlikely so long as there is increasing marginal cost of 
attributes for sellers and it is not possible to untie packages But it is 
obvious that the model docs not apply if very few distinct products arc 
actually traded 


V. Price Indexes, Economic Welfare, and Legislated Restrictions 

This section uses the model to analyze the welfare consequences of quality- 
standards legislation, a problem not easily handled by conventional 
methods The discussion clarifies issues in recent controversies regarding 
treatment of legislated standards in the construction of price mdexes For 
example, how should mandatory installation of seat belts and air bags 
affect the automobile price index^ For expository convenience, discussion 
is confined to the case of one attribute Generalization to several charac¬ 
teristics is immediate 

A minimum quality standard means that 2 ^ z, and brands con¬ 
taining less than z are prohibit^ from the market Assume constant 
returns to quantities (as in Section III B) Then the law is irrelevant for 
all consumers previously purchasing packages contaimng more than the 
legislated mimmum The situation for a buyer whose choice is affected 
by the law is shown in figure 5:2* was the original choice, whereas z is 
chosen after the law has been passed, since 2 * is no longer available The 



52 


JOURNAL OF POLtnOAL BOONOUY 



FlO 5 


trunimum attainable value function has shifted from 6" to d*, and the 
consumer is worse off (see eq [2]) 

Choose the distance AP = P2 — Pj as a monetary measure of the loss 
in welfare Since dOjdy = 1, AP is the bnbe necessary for the consumer 
to purchase z when z* was available Clearly, this measure is not umquc 
(i.c, if compensation is evaluated at a different amount of z) unless 
= 0 The welfare loss can be estimated from the implicit pnee and 
bid functions The distance P 2 — Pq >s given by 



Pii.z)(k, 


or the area under marginal cost from z* to z, and is shown in figure 6 as 
Z*abZi. It represents the social opportunity cost of additional resources 
necessary to produce z instead of The integral 


£ 


ei{z)dz. 


or the area under a compensated demand function (compensated at the 
original level of real income) between z* and z in figure 6 {z*aezi) 
measures the amount the consumer would have paid for the increment 
(?! — z*) at the unrestneted level of welfare. It measures Pi — Pq in 
figure 5 and reproents the benefit of the restriction. The difference 






HEDONIC FRIOSS 


53 



Fio 6 


between costs and benefits ts given by or the dificrence between 

the areas under the marginal cost and compensated demand functions, 
the shaded area in figure 6 In the general case of several attributes, tJ* 
must be measured by a line integral Otherwise, everything else is 
unchanged 

When the marginal utility of money is constant. A/* is unique tind the 
price restnction is equivalent to an additive increase in imphcit pnccs in 
amount AP everywhere In figure 5, 0,, = 0 means that all value func¬ 
tions are parallel, and if the budget constraint was^ = x + p{z) + AP 
instead of^ = •* + piz), the consumer would have arrived exactly at 0* 
of his own free choice The real price of the charactenstic has risen because 
choices are restricted, and the price index should nse to reflect that fact 
A natural measure of the real pnee increase impiosed by the law is a 
weighted average of terms such as AP (including buyers for whom AP = 
0), where the weights are expenditure shares among all consumers,* 
This measure overstates the loss insofar as the restnction actually forces 

* A complete assesiment of the law and its effect on the price index reqiures balancing 
the costs calculated above against any extemality-mduced social benefits of the restne- 
t on In our judgment, seat belts and air bags are in a different category than emission- 
control devices. In regard to the latter, the apparatus above esm be used easily to analyse 
the effect of the European system of taxing engine duplacement. An ad valorem tax 
increases average and margmal costs of packages with huger hter capacity, and the usual 
income and subsutuuon effects apply: packages with smaller amounts of this and oom- 
plementary chsuactensdci (such as site of car) are purchased 



54 


JOURNAL OP POLITICAL EOONOUY 


some consumers completely out of the generic goods market since they 
escape the full loss AP Also, standard index number problems arise when 
the marginal utility of money is not constant 


VI. Conclusions 

This paper has drawn out the observational consequences of the construct 
of implicit markets for characteristics embodied in differentiated products 
When goods can be treated as tied packages of characteristics, observed 
market prices are also comparable on those terms The economic content 
of the relationship between observed prices and observed characteristics 
becomes evident once pnee differences among goods are recognized as 
equalizing differences for the alternative packages they embody Here, 
as elsewhere, price differences generally are equalizing only on the margin 
and not on the average Hence, estimated hedonic pnce-charactenstics 
functions typically identify neither demand nor supply In fact, those 
observations arc described by a joint-envelope function and cannot by 
themselves identify the structure of consumer preferences and producer 
technologies that generate them 

The formal analysis is complicated by the fact that budget constraints 
are nonlinear Consequently, it is not surprising that far weaker theorems 
than usual apply However, a feasible econometric procedure for esti¬ 
mating the underlying generating structure has been derived through the 
use of derivative transformations When constraints are nonlinear, 
marginal prices serve the same role as average prices do in the linear 
case Finally, the essential spatial context of the problem means that 
substitution and income effects must be more carefully distinguished than 
usual Indeed, here is a major practical instance where compensated 
demand and supply functions become the relevant fundamental concepts. 
These compensated functions are estimated by the econometric method 
and measures of consumer and producer surplus can be derived directly 
from them We anticipate that the basic conceptual framework outlined 
above will have a variety of applications to many practical problems 
involving equilibrium in cross-section data 

The analysis has been simplified by assuming divisibility in production 
Generalization has to incorporate nonconvcxities, and discontinuities 
must result When nonconvexities arc not small relative to the market, it 
IS obvious that only isolated locations on the characteristics surface will 
be filled In other words, such a generalization will naturally incorporate 
the case of monopolistic competition, and observed “distances” (in terms 
of characteristics) between differentiated products will be endogenously 
determined The methods employed above do not carry through because 
certain nonmaiginal decisions must be analyzed, and far more sophisti¬ 
cated techniques arc required 



KBDONia PRICBS 
ReferencM 


55 


Alonzo, William. Lccahon and Land Vst. Cambndge, Mats.; Harvard Umv. Preif, 
1964. 

Becker, Gary S “A Theory of the Allocation of Tunc." Earn. J 75 (September 

1965) . 493-517. 

Court, Louu M “Entrepreneurial and Consumer Demand Theories for Com¬ 
modity Spectra." Econonutrua 9, no. 1 (April 1941) 135-62, no 2 (July- 
October 1941): 241-97. 

Elhckson, Bryan. “Jurisdictional Fragmentation and Residential Choice ” 
A.E.R. 61 (May 1971) 334-39 

Gordon, Robert J “The Measurement of Durable Goods Prices ” Mimeographed 
Nat. Bur Econ. Res., 1973 

Grihches, Zvi, ed Price Indexes and Quality Change Cambridge, Mass. Harvard 
Umv. Press, 1971 

Houthakker, H S “Compensated Changes in Quantities and Qualities Con¬ 
sumed ” Rev. Econ. Studies 19, no 3 (1952) • 155-64 
Intriligator, Michael D Mathematical Optiimzatum and Economic Theory Englewood 
Cliffs, N J Prcntice-Hall, 1971. 

Lancaster, Kelvin J “A New Approach to Consumer Theory ” J P E. 74 (April 

1966) 132-56 

Lewis, H Gregg “Intcres del empleador en las horas dc Trabajo del empleado” 
[Employer interests in employee hours of work] Cuadenws de Economia, Cathohc 
Umv Chile, 1%9 

Lipsey, Richard G , and Rosenbluth, Gideon “A Contribution to the New De¬ 
mand Theory A Rehabilitation of the Giffen Good ” Canadian J Econ 4 
(May 1971) 131-63 

Muth, Richard F “Household Production and Consumer Demand Functions ” 
Econometnca 34 (July 1966) 699-708 

Ohta, Makoto, and Grihches, Zvi “Makes and Depreciation in the U S Passenger 
Car Market ” Mimeographed. Harvard Umv, 1972 
Tiebout, Charles M “A Pure Theory of Local Expenditure " J P E. M (October 
1956) 416-24 



A Diagrammatic Exposition of the 
Harberger Model with One Immobile 
Factor 


Charles E. McLure, Jr. 

Rut Untotrnty 


A four-quadrant diagram is used to exposit the Harberger model of tax 
incidence (Harberger 1962) m the case in which one of two (actors is 
immobile between the two sectors Beyond its basically expository pur- 
piose, the paper offers two original contributions. First, the general 
equilibrium “supply curve” relating percentage changes in output to 
percentage changes in relative product prices is developed expliatly. 
The roles of various production parameters and factor immobility are 
thus highlighted Second, the diagrammatic exposition makes explicit 
one danger of using the Harberger model, written in terms of differ- 
entiab, to exanune tax changes of fimte size. 


I Introduction 

In the past 10 years the incidence model presented by Arnold Harberger 
in his famous article in the June 1962 Journal of Polittcal Economy has 
become the standard theoretical tool of economists concerned with the 
incidence of taxes in a general equilibrium setting ‘ But many economists 
may find this analysis largely unintelligible because of the mathematical 


This paper reports research related to AID contract csd-3302 to the Program of 
Development Studies at Rice Umveruty on the DiitnbuUon of Gains, Wealth, and In¬ 
come (him Development The author wishes to acknowledge the contnbubons Greg 
Bsdlentine, Ibrahim Ens, Paul Hanley, Wayne Thirsk, and an unusually meticulous 
referee for thu Joumat made through their careful readmg and comments on an earlier 
draft As always, neither the Agency for International Development nor the mdividuals 
mentioned above should be held accountable for the views expressed here. 

‘ See McLure (1972a) for a more detailed descnption and appraisal of the original 
Harberger model, its extensions, and applications 

56 



HASBEROER M<»}BL exposition 


57 

manipulations involved. Therefore a bnef diagrammatic presentation of 
one version of the Harbeiger model is offered here in the hope of making 
that extremely useful model available to a wider audience In this paper 
only the case in which one factor is mobile between sectors and one factor 
immobile is presented, primarily because the onginal Harberger model 
with perfect mobility of both factors does not readily lend itself to the 
kind of diagrammatic treatment presented for the imperfect mobihty 
case. Yet this exposition of the imperfect mobility case should assist m the 
understandmg of even the complete mobihty case Moreover, many pohey 
problems involve time horizons m which analysts cannot reasonably 
assume that all factors arc completely mobile, either between regions or 
industries This is especially true m the sphere of geographic incidence 
analysis In this paper both tax and expenditure incidence are examined 
In fact, It will simplify understanding of the model to consider expienditure 
incidence before tax incidence 

In addition to the basically expository objective described above, this 
paper has two further purpKises First, the “supply curve” relating per¬ 
centage changes in the output of the taxed good to percentage changes m 
relative product prices is developed explicitly Use of this supply curve, 
in conjunction with the “demand curve,” facilitates comparison with 
traditional partial equilibrium incidence analysis Moreover, we are 
able to see quite clearly how the various parameters of the model deter¬ 
mine the supply response to changes in relative paces By comparison, 
none of the discussions of the Harberger model up until how has developed 
an explicit product supply relation Rather, they have focused pnmanly 
upon responses in markets for factors Of course, m the mathematical 
treatment of the model, all the equations arc solved simultaneously The 
point IS that the demand relation has never been juxtaposed against a 
supply relation derived from underlying conditions in factor markets and 
production functions ^ 

Second, stnctly spieaking, the Harberger model is appropnate for the 
analysis of only small changes in taxes, since all the equations are wntten 
in terms of differentials. The diagrammatic exposition enables us to 
appreciate one of the pitfalls that threatens one who uses the Harberger 
model to analyze tax changes that are not small In particular, the model 
IS developed imtially under the assumption that the lincanzation of 
economic relationships inherent in the Harberger formulation in terms of 
differenUals is acceptable. This is done solely as a matter of expository 
convenience But Section VI discusses the propnety of Imcanzation 
when discrete changes in taxes arc involved 

’ For a more detailed diiciuuon of uiing the Harberger model to “go behind the 
■upply curve” of traditional partial equilibrium analyiu, lee McLure (1972a) 



JOURNAL OF FOLmOAL EOONOMY 


58 

The diagrams presented here are used primarily to explain the general 
structure and workings of the model and to isolate the crucial parameters 
of the analysis No attempt is made to present the results of the analysis 
in detail, except for polar cases, since anyone interested in the results for 
intermediate cases would undoubtedly turn to the algebraic formulation. 
Thus this paper may help readers to understand the more detailed 
algebraic version, but it is no substitute for it. Nevertheless, the link 
between the algebraic and diagrammatic expositions is made exphcit by 
the textual development of the equations plotted in each diagram 

Because this paper is intended to be primarily expository, the assump¬ 
tions underlying the analysis are spelled out in detail But for the most 
part this IS done as the various diagrams are developed, rather than as a 
group at the outset Thus the reader must be patient if he suspects that 
some crucial assumption is being neglected, hopefully it will come forth 
and be explained in due course Nevertheless, it will be convenient to 
describe several general features of the model at this point 

First, the model is one in which two factors which are inelastically 
supplied in total (and by each individual) are employed in two competitive 
industries under constant returns to scale to produce two goods, x, 
Capital IS assumed to be perfectly mobile between sectors Therefore, the 
price of capital services net of tax is the same in both sectors On the 
other hand, labor is assumed to be completely immobile between sectors, 
thus wage rates net of tax may differ between sectors 

Second, in a model such eis that employed here only changes in relative 
prices (as opposed to changes in absolute prices) can be detenmned in the 
absence of an explicit monetary equation But this in no way limits the 
usefulness of the model for incidence analysis, which is essentially a 
question of changes in relative factor returns (incidence on the sources 
side of income) and changes in relative product prices (incidence on the 
uses side of income) Thus we are free to choose one good or factor (or 
some weighted average) as the numeraire by setting the policy-induced 
change in its price equal to zero Though in general I prefer to use the 
untaxed product as the numeraire (in which case dP^ = 0 is the numer¬ 
aire equation), the diagrammatic analysis is simplified by choosing capital 
services as our numeraire (so that the numeraire equation is dPg^ = 0) 
Of course the outcome in terms of tax-induced changes in real income is 
in no way dependent ufxin the choice of numeraire equation Moreover, 
it will be convenient, but not essential, to define units of products and 
factors in such a way as to set all prices initially equal to unity. Thus any 

* Throughout this paper reference 11 to two sectors or industnei In my earlier work I 
have used the model to investigate interregional incidence and relat^ problems by 
dealing with two regions rather than two industries (see McLure 1969, 1970a, 1971a). 
This and other applications of the model are straightforward and hence are ignored in 
this paper. 



HARBBROBR UODEL EXPOSITION 59 

Specific tax is also an ad valorem tax, and any change in price u also a 
fractional change/ 

Finally, the Harberger model is perhaps the comparative statics model 
par excellence m the sense that it can be solved for tax-induced changes 
from a preexisting (no-tax) equilibrium, but not for either the imtial 
(no-tax) or the final (cwn-tax) equilibrium solution. Thus in the diagrams 
of this paper all axes mdicate percentage changes in variables, with the 
initial equilibrium corresponding to zero rates of change of all vanables 
These changes are given imtially m differential terms, following the 
onginal Harberger formulation Extension of the model to cases of discrete 
changes in vanables is discussed in Section VI. 


n. The Basic Diagrsun 

The basic analytical tool developed here is the four-quadrant diagram 
presented in figure 1 It is denved on the assumption that there are no 
taxes and government spending in the economy 

In quadrant a is plotted the “demand curve,” Z)- 

rfQJQ, = -£(dP, - dP,) (1) 

(Ignore curve for the moment, it is used m Sccbon III) This descnp- 
tion of demand differs from the standard demand curve in very special 
ways that must be made exphcit First, the curve relates the percentage 
change in the demand for good x to the percentage change in the relaUvc 
pnees of goods x and^ Thus it passes through the ongin of the quadrant a 
diagram, and the reciprocal of the absolute value of its slope is the elas¬ 
ticity of demand for good x with respect to relative pnees, E * Second, this 
elasticity of demand takes account of only substitution effects on the 
demand for good x resulting from changes in relative pnees Income 
effects are ignored initially by using the very special assumption that at 
the margin all economic groups (including the government) spend their 
incomes in exactly the same way. Thus E is the market elastiaty of 
demand for x vnth respect to relauve pnees, excludmg first-order income 
effects 

* For more deuuled duonuoni of the unies raised m thu paragraph, see Muagrave 
(1959, pp 217-21 and chap IS), Harberger (1962, pp. 219, 224), and McLure (197U, 
19714, pp. 31-33) 

’ As drawn, the curve shows a constant demand elasticity, but it need not do so. Hus 
constancy of slope u not so bothersome in the spedficatioa of the demand reUttonship or 
even of the H. curve of quadrant c, where a constant slope implies a constant elasticity of 
factor substitution. But for the curvet in quadrants h and d arid for the curve in qimd- 
rant e the oonitant slope u m genenl quite unrealistic. Hus problem, which mvolvea the 
analytu c£ differential changes, rather than discrete changes, in die Harbciger modd, is 
discussed forther m Section VI bdow. For now the curves are all drawn as straight liM 
with foil knowledge that, stnctly speaking, this is unacceptable for discrete tax changes. 



6o 


JOURNAL OF POtmOAL BCION<»fY 



c b. 

Fio 1 —Denvation of lupply curve and analysii of expenditure madence 


The second curve in quadrant a, S, which corresponds to the supply 
curve of traditional paurtial equilibnum analysis, is derived from the re- 
maming quadrants The remainder of this section is devoted to describing 
that denvation The curve in quadrant b is,descnbed by the following 
equation. 

dQJQ. = A dKJK, + A dLJL, = A dKJK, (2) 

It desenbes how output of good x changes if the amount of services of 
the mobile factor, capital, combined with the fixed amount of services 
of the immobile factor, labor, is changed. The curve Q„ relating dQJQ^, 
the percentage change in output of x to dKJK„ the percentage change in 
capital services used in produang x, has a slope of 1/Ai where A be 

interpreted alternatively as the elasticity of output with respect to this 
input in industry x or as capital’s initial share of the output of industry x 
Of course, this line, too, passes through the origm 








HARBEROER MODEL EXPOSITION 


6l 


If production exhibits constant returns to scale, as is assumed in the 
Harbcrger model, factor proportions are umquely related to the relative 
costs of the services of the two factors Stated differently, the possibility 
of changing factor proportions in production in respionsc to changes in 
the relative costs of factor services is given by the elasticity of factor 
substitution m production In general, the possibilities of factor substitution 
m the two industries are given by the following two equations. 


a^dPK - (3) 


<j,{dP^ - dP^,), ( 4 ) 

where <7^, and are the elasticities of factor substitution in the two 
industries Making use of the additional assumptions (a) that labor is 
immobile between sectors, (A) that the total supply of capital is fixed, 
and (c) that capital is chosen as the numeraire, so that dPj^ = C, vse can 
rewrite equations (3) and (4) as 


and 


dj^ 

dK, 

K. 


L 


i 3= — 


dU 


and 




dj^ 




Ly 


(3') 


(4') 


These equations are plotted m quadrant c as and As written, they 
show the changes in the utilization of capital services in industry x that 
would result from a given change in the cost of labor m each of the two 
industries, given that the price of capital serv ices is held constant The 
slopes of these curves are <t„ and —OyK^jK^, respectively, and both curves 
pass through the origin 

The fourth quadrant converts changes in the costs of factor services in 
the two industries into changes in product prices The basic equations are 


dPx =fLdP^ +fKdPx 

(5) 

dP, = Ml dPi, + gK dPx 

(6) 


In commonsense terms, these equations state that the percentage change 
m the price of a product is the weighted average of the percentage changes 
in the prices of the services of the two factors used to produce it The 
weights are the initial shares of output of the good in question accruing 



63 


JOURNAL OF POUnOAL BOONOMY 


to the two factors. Since capital has been chosen as the numeraire, we 


can reduce these two equations to the following: 



(S') 

and 


II 

(6') 


These are plotted in quadrant d as and Py As is true of all the curves 
in the initial equilibrium without taxes, these curves pass through the 
origin 

Having derived the curves m quadrants b-d from the basic equations 
of the Harbcrger model, we can now use them to denve the supply curve 
for X with respect to relative prices The derivation is a straightforward 
exercise in the use of the diagrams First, we choose an arbitrary point, 
M, on the horizontal axis in quadrant a as a starting point Moving ver¬ 
tically to quadrant b we see that if output of good x is to increase by the 
fraction OM, the amount of capital services employed in industry x 
must increase by the fraction ON But this reallocation of capital implies a 
nse in the wage rate in industry x (given dPf^ = 0 ) of OF, and a fall 
m the wage rate in industry of OVy, as indicated by curves S, and 
in quadrant c 

These changes in wage rates in the two industries are transmitted to 
product prices in the way desenbed by curves P, and Py in quadrant d 
Thus the fractional increase in the wage rate of 0 F, in industry * produces 
a fractional nse in the price of good x of OiF, Similarly, the price of 
goodji falls by the fraction 0 FF, in response to the wage rate in the industry 
falling by the fraction OF, Thus we have seen that the fractional increase 
in the output of good * of OM implies a change in the relative pnccs of 
goods X andj or fF,lF, * This change in relative prices can be plotted as 
OW m the quadrant a against OM to produce A, one point on the supply 
curve of good x with respect to relative product pnees Repeating the 
procedures in this and the previous paragraph for other points on the 
horizontal axis in quadrant a produces the entire supply curve 

Before we consider how ttixes and expenditures affect the curves and 
values of the variables in the model, it will be convenient to digress 
briefly at this pioint to examine the determinants of the shape of the supply 
curve. First, we can describe the supply curve in general terms in the 
following way 

dQjq, = - dPy) (7) 

* Some rcadei* may object that this denvauon reverses the direction of causality m 
our usual way of thinking about supply curves, which is in terms of a given change m 
relative product pnees giving nse to a certain change m output. The clockwise denvation 
desenbed implks nothing about causality. It was chosen simply because it u substsmtially 
easier than a counterclockwise derivauon, as the reader can easily ascertain for himself 



HARBEROER MODEL EXPOSITION 


63 

Of course, S is the elasticity of supply of good x with respiect to relative 
prices To obtain an expression for the elasticity of supply that utilizes 
information about conditions in factor markets and the production function 
we can repeat the reasoning of the last two paragraphs Thus 


dKJK, 

= dQJQ,,-A, 

dP^ 

= dKJK,,a,,, 

dPLy 

= -K,,dKJK,,<r,K,, 

dP. 

= AdPLy, 

dPy 

= gL dPty, 

dP„ - dP, 

— A dPu ~ gt dPl_y, 


= A dKJK^a, + dKJK,c^„ 

= iftlCx + dKjK,, 

= iAK + gL^JK^a,) dQjQJx 

From the last expression we can calculate the clasticitv of supply with 
respect to relative prices as 

5 =- A£j£i - ( 8 ) 

AOy + gLOyf(JJ(y 

Under the linearization assumption followed through Section V, ,9 is a 
constant Thus curve S m quadrant a is a straight line 

The determinants of the elasticii) of supply can be seen directly from 
equation (8) to be (a) the elasticities of factor substitution m the two 
industries, (6) the allocation of the total capital stock between the two 
industries, and (c) labor’s initial share of output in the two industries 
Partial differentiation of equation (8) reveals that celeru paribus the supply 
ot good X IS more elastic the easier is factor substitution in either industry, 
the smaller is the fraction of the total capital stock devoted to production of 
good X, and the smaller is labor’s share of output in each industry The 
same thing is revealed by inspection of figure 1 The steeper arc curves 
and in quadrant c, the less is any change in the allocation of capital 
between the two industries reflected in relative factor prices and hence in 
relative product prices Thus the greater are and the 

greater is the elasticity of product suppiv with respect to relative product 
prices Similarly, ceteris paribus, the flatter are curve Qj, in quadrant b and 
c irves and in quadrant d, the less effect a given change in the 
quantity of good x has upon relative product prices But flatness in any of 
these curves implies a low labor share in one industry 
The economics behind these results is easy to appreciate First, if factor 
substitution in industry x is easy, capital can be absorbed with relauvcly 



JOURHAL OP POUTtOAI. ECONOMY 


64 

little rise in the wage rate in that industry and hence little rise in the 
price of good x Similarly, if labor can easily be substituted for capital in 
industry^, the outflow of capital necessary to expand production of good x 
will not greatly lower the wage rate in industry> and the price of good_>' 
Essentially the same result occurs if relatively little of the total capital 
stock IS used in industry x In such a case the capital inflow needed to 
expand production of good x can occur without greatly affecting factor 
proportions, relative factor returns, and the product pnee in industry '' 
Finally, the greater is capital’s share in industry x, the smaller is the capital 
flow induced by a given fractional change in the output of that industry 
This, in turn, is reflected in a smaller change in wage rates and product 
prices m the two industries And the smaller is labor’s share in either 
industry, the less responsive is the price of the product of that industry 
to a given change in the relevant wage rate 

It may be worthwhile, to carry this reasoning further, to note the value 
of the supply elasticity for extreme values of the parameters just discussed 
If factor substitution is impossible m either industry, supply is completely 
inelastic Given the immobility of labor, output of x cannot expand unless 
factor substitution is possible in both industries 

If in industry x factors are perfect substitutes, the elasticity of supply 
depends upon the elasticity of substitution in the other industry, factor 
shares in both industries, and the split of capital between the industries 
If perfect substitution characterizes production in industry j, the supply 
elasticity depends on the ease of factor substitution and relative factor 
shares in industry x Finally, if virtually all of the capital stock is used in 
industry x, the supply curve for that industry is quite inelastic But if 
almost none of the capital stock is used in the industry, the elasticity of 
supply depends crucially upon the elasticity of substitution and factor 
shares in the industry, just as when is inflnite 

Finally, it is instructive to calculate the elasticity of product supply for 
various values of the crucial parameters To simphfy matters, one can 
calculate the elasticity assuming that virtually none of the capital stock is 
bemg utilized in industry x, since inspection of equation (8) reveals that 
this assumption yields the largest possible value of S, given the values of 
the other parameters Table 1 gives the values of S in the limit as KJK^ 
approaches zero, for the stated values of and factor shares in industry x 
One IS immediately struck by the fact that these elasticities are all quite 
low, only at the extreme limits of the acceptable range of values for 0 , and 
factor shares does the elasticity of product supply equal or exceed unity. 

^ The numeraire assumpuon dP^ = 0 u not equivalent to an elastic supply of capital 
to industry x This elastiaty, which depends upon the values of a, and KJK,, is a matter 
of response to changes in relative factor returns and it independent of the choice of 
numeraire 



HARBESOER UOOEl. EXPOSITION 


65 


TABLE 1 

ELAsncrrv of Puodoot Supply with Rmpeot to 
RXLAITVB PRIOEi, FOR VaRIOUI VaLUEI of AND ft. 

(LnuT At Ax/A, -► 0) 


A 

A 

A/A 



Value of 0 , 



02 

04 

OS 

08 

10 

1 2 

1 5 

01 

09 

0 11 

0 02 

004 

007 

009 

0 11 

0 13 

0 17 

02 

08 

0 25 

0 05 

0 10 

015 

0 20 

0 25 

030 

038 

03 

07 

0 43 

009 

0 17 

0 26 

034 

0 43 

0 51 

064 

04 

06 

0 67 

0 13 

0 27 

040 

0 53 

0 67 

080 

1 00 

05 

05 

1.0 

0 20 

040 

060 

080 

1 00 

1 20 

1 50 




Formula For KJKy • 0, 





And for the most realistic assumptions of values of the parameters 
( 6 ^ <7, ^ 10 and 6 ^ /l ^ 8 ), the elasticity of supply falls within a 
range of 15- 67 All things considered, it seems hkely that in industries 
and regions characterized by the kind of immobilities assumed here the 
elasticity of supply is probably no greater than -J—J * The elasticity of 
supply IS, of course, lower than shown in table 1 if KJK^ is not near zero 
For example, if /j, = and o, = a^, the elasticity of supply is only 
one-half as great as shown in table 1 if A", = K^, and it is only one-fourth 
as great if A, = SA^ 

In constructing the diagrams in this paper, what seem to be more or 
less reasonable values of the parameters are used In particular, we use 
Oj, = (Ty = .7, Sl — ®> ^il^y — * (Different labor 

shares in the two industnes are employed so that curves and Py in 
quadrant d do not coincide and KJKy is not assigned a value near zero so 
that curve Sj, in quadrant c will not be vertical) This gives a value of 
5 of 20 


* If we reverse the mobility assumpuons to make capital the immobile factor and labor 
the mobile factor, as in the usual examples of the short run in price theory, the elasOaty 
of supply is greater than shown m table 1 by a factor of {/J fx) ^ But one can expect 
elastiaties of factor substitution to be considerably higher ex ante or m long-run equi¬ 
librium (when capital is mobile) than they are once the capital stock is given If both 
factors are mobile between sectors, the elasticity of supply of good x with respect to 
relative pnees can be written su follows 




From this expression we can derive the usual results that in the perfect mobibty case the 
supply response to a change in relative product pnees can range from aero (if the two 
mdustnes utilize the two factors in different proportions and factor subtUtubon is com¬ 
pletely impossible in both industnes) to infimtely elasbc (if the two button are perfect 
subsututes m either industry or if the two industnes utilize the two Cscton in the tame 
proportions). 






JOURNAL OF POLtTICAL ECONOMY 


66 

ni. Expenditure Incidence 

Thus far only the supply and demand curves and the derivation of the 
former have been given, no change from the initial equilibrium position 
has been analyzed In this and the next two sections the analytical 
framework developed in the last section is used for the comparative statics 
analysis of the distributional implications of tax, expenditure, and 
balanced-budget policies As noted earlier, it will be convenient to begin 
the investigation with the infrequently examined, but analytically simpler, 
question of the incidence of government expenditures Then we can 
proceed to the more complex questions of tax incidence and balanced- 
budget incidence in Sections IV and V 

“Expenditure incidence” refers to the way government spending (as 
opposed to taxation) affects the distribution of income available for private 
use, by affecting relative factor and product prices ® Government spending, 
hnanced in a neutral way, will have no effect on relative product prices, 
relative prices of factor services, and the distribution of income, unless at 
the margin the government spending pattern differs from the private 
pattern But if the government devotes a large (smaller) share of its 
budget to good x than would have occurred at the margin in the pnvate 
sector, the demand curve shifts to the right (left), and relative prices and 
the distribution of income change '' We can trace this through the four- 
quadrant diagram 

Consider an increase in government spending of dG In such a case the 
demand curve shifts by (m^ — m^) dGjQ„ where and are the public 
and private marginal propensities to consume good x (If and are 
equal, no shift occurs) Let us denote this shift by the shorthand notation 
<^QclQx The shift in the demand curve D is shown in figure 1, being 
the shifted curve The intersection of the supply curve 5 and the shifted 

’ This concept of expenditure incidence is based upon the discussion in Musgrave 
(1959, pp 213-15) and is discussed more fully in McLure (1972i) It might be noted 
first that expenditure incidence, defined m terms of the effects of government spending 
on relative prices and real incomes, is closely akin to the concept of pecuniary externaliues 
familiar to cost-benefit analysts, and second that the term “expenditure incidence” does 
not refer to the question of who benefits from government services That question is not 
generally amenable to the type of analysis presented here In this paper only the incidence 
of government purchases of final products is considered. The much more compheated 
problems of the incidence of direct governmental employment of factors, which is con¬ 
sidered m McLure (1972f), is relatively difficult to fit into the diagrammatic analysis, 
though It IS not impossible 

"’Reference to neutral financing can be interpreted to mean financing through a 
general sales or income tax (which m this model are equivalent) That such a tax u 
distnbutionally neutral is shown in the next secUon For a more complete discussion of 
distnbutionally neutral financing and expenditure patterns, see McLure (197If, pp. 
30-31, and 1972f, pp 435-36) 

’' There are, of course, polar cases in which relative pnees do not change and in which 
changes in relative prices do not affect the distribution of income Quahiying remarks of 
this type are not repeated in the discussion that follows 



HARBEROBR MODEL EXPOSITIOM 


67 

demand curve yields a new equilibrium position A, in which output of 
good X has increased by the fraction OAf and the price of x has risen relative 
to that by 

We can use some simple algebra to learn more about this outcome. First, 
the expression for the supply curve in quadrant a is = S(dP,^ — dPy), 

as noted above. The shifted demand curve can be written as follows. 

dQJQ, = -E{dP, - dP,) + dQJQ„ (la) 

where dQgjQ,, is the fractional amount by which demand shifts toward 
good X because of public procurement, at the imtially prevailing prices 
Solving these two equations simultaneously yields the followmg. 

dP, - dP, = dQalQAS + E) (9) 

and 

dQJQ, = SdQolQ^iS + E) (9a) 

The more elastic is either supply or demand, the less the pnee of x rises 
relative to the price of_> as demand shifts toward good * Similarly, the 
more price elastic is demand, the less the quantity of good * rises as 
demand shifts to good x But the more clastic is supply the more output 
responds to the shift in demand 

Figure 1 can be used to derive the same result First, note that OM in 
figure 1 equals OW S and ME equals OlV- E But OM + ME = OE 
Thus, OE - OW (5 + £), and OW = OEI{S + E) But because 
OW IS simply dP„ — dP^ and OE is dQc/9* o*" (’"f “ '”p) 
yields the results reported in equation (9) Similarly, OM = S OEj 
(S + E), as repiorted in equation (9a) m slightly different terminology 
Quadrant a can tell us the incidence on the side of uses of income of 
a shift in purchases toward product x But to learn about incidence on the 
side of sources of income it is necessary to “go behind” the supply curve 
to learn how relative factor prices are affected This can easily be done, 
of course, since the supply curve was derived from the more fundamental 
relationships desenbed m quadrants b-d In particular, for the policy in 
question we need only note that the indicated increase in output of good x 
implies an inflow of capital of ON, which in turn raises the wage rate in x 
by the fraction 0V„ By the same token, the wage rate in mdustry> falls 

' * A second curve could be added to quadrant b to show bow any change in capital 
uulizabon is reflected in the output of goody through the following equauon* 

dQ,IQr - -uK, dK.IK„K, (2a) 

This equation does not, of course, figure m the final sohibon, except to give the change in 
output of good 



68 


JOURNAL OF POUTtCAL EOONOlfV 

by OVy, as capital flows from that industry.Finally, though strictly 
speaking the question is of no interest from the standpoint of incidence 
analysis, since in that context only changes in relative product prices and 
relative factor prices are relevant, we might want to solve for the changes 
m the prices of goods x and y, given that dP^ = 0 ** This we can do in 
quadrant d; the price of good x rises by 0W„ and that of good^ falls by 
OWy Of course, WyW^ equals OWin quadrant a by construction. 

IV. Tu Incidence 

The model presented here accommodates analysis of taxes levied on nme 
distinct tax bases value added, retail sales, or income (1), only one of 
the two products (2), the services of only one of the two factors in all its 
uses (2), and the services of one of the two factors in one of the two 
industries (4) But in the neoclassical world of this model we need examine 
explicitly the mcidence of only four of the nine taxes The incidence 
results for the rest of the nine follow directly by subtraction ‘ * Yet it will 
be useful for the present purpose to examine the incidence of more than 
the minimum number of taxes In particular, we will examine the in¬ 
cidence of the general sales or income tax, the tax on product x, the tax on 
both uses of the mobile factor, the tax levied on the mobile factor in one 
industry, and the tax levied on the immobile factor in one industry In 
each case we assume that tax proceeds arc used to finance a distnbutionally 
neutral increase in public spending. This last assumption is relaxed m 
Section V, in which we consider the possibihty of using these taxes to 
finance an increase in public spending that is not neutral 

' ’ The results given in equauoiu (9) and (9a) and equations (2)-(4) can be used to 
solve for these capital flows and pnce changes Thus 

dK.IK, = 5 dQJQ.MS + E), (9b) 

dPL.‘-S dQ„IQ,<r,A{S + E), (9c) 

and 

dPL, = SK. dQJQJC^,/,{S + E). (9d) 

These results, like those of equations (9) and (9a), though different in appearance, are 
the same as those reported m the mathematicid treatment in McLure (i972k, p 444) 
The tame approach can be employed to solve for the changes m absolute product prices, 
given dPj, a= 0. 

' * One reason for doing this u that we might wish to choose an alternative numeraire, 
say good jf To convert percentage changes m a given pnce under the numeraire assump- 
tum dPc 0 to percentage changes under an assumpUon of dP, v 0 is a simple matter. 
One need only subtract from each percentage price change as origmally calculated 
(mcluding dPg) the calculated value of dP, An altemauve and attracUve approach is to 
fonnulatc the diagrammatic analysis presented here under the assumption of dP, m 0 
Space constraints dictate that this alternative formulation be left as an exercise for the 
mterested reader. 

See Musgrave (1959, pp. 348-54) and McLure (I97\b, p. 29). 



HAKBEROER MODEL EXPOSITION 


69 

The supply curve developed in Section 11 and used in Section III to 
examine expenditure incidence is based upon a no-tax situation To be 
able to allow for the influence of various kinds of taxes on the supply 
curve, we must modify equations (3)-{6) Equations (3) and (4) become 

^ ^ - dPl^) (3a) 

and 

^ ^ = -<^,{dPty - dP^,) (4a) 

In these equations (and in other equations below), astensked terms 
refer to changes in various factor prices inclusive of factor taxes That is, 
they refer to the changes in factor prices that affect cntrcpreneural 
decisions and costs of products They are to be distinguished from their 
unasterisked relatives, which indicate changes in factor prices net of 
factor taxes, that is, wliat factor owners receive The following relations 
between factor prices gross and net of factor taxes will be used in this 
section ** 


dPxx ” dPfi + dr*, + rfTx, 

(10a) 

dPtx = dP^, + dT^,, 

(10b) 

dP*y = dP^ + dT, 

(10c) 


Note that while capitalists receive the same net price for capital services 
in the two industnes, entrepreneurs will pay different gross prices for 
capital services if capital is taxed at different rates in the two industries 
Thus the asterisked value of the change in the price of capital carries a 
sectoral subscript 

Equations (5) and (6) must be rewritten as follows to allow for the 
existence of excise taxes, general consumption taxes, and taxes on factors 

dPx = A dp;^ + A dP^, + dT, + dT, (5a) 

and 

dPy = gL dPLy + gK dP^y + dT, + dT, (6a) 

These equations now state that the percentage change in the price of a 
product is the weighted (by initial shares) average of the percentage 
change in the price of the two factors, including any factor taxes, plus any 
ad valorem tax levied on the product itself or ujxin consumpUon 

We can begin the analysis with the tax on the output of industry x 
Such a tax affects only two curves in the analysis, the curve m quadrant 
d relating the price of good x to the wage rate in x and (through that) curve 

“ Equations (3a), (4a), and (lOa)-(lOc) reflect the choice of taxes to be analysed, 
dPL, appears without an asteruk because no tax on labor in u to be examined 



70 


JOURNAL OP POLITtCAL BOONOUY 



C. b. 


Fio 2 —Analysis of incidence of excise tax on good x 


S, the supply curve of good x It moves both these curves vertically upward 
by the amount of the tax to P* and S*, as shown in figure 2 ‘ ’ The inter¬ 
section of the shifted supply curve S* and the demand curve D shows that 
the tax increases the relative price of good xby OlV and that the output of 
the taxed gfood falls by OM Carrying the output effect to quadrant b, we 
see that the tax induces a fractional capital outflow from the taxed 
industry of ON. The reallocation of capital results in a fall in the wage 
rate (given dPi^ = 0) in the taxed industry of OV^ and a rise in the wage 
rate in the untaxed industry of OV^ as shown in quadrant c. Finally, we 
can use these effects and the curves in quadrant d, as before, to determine 
the effects on the prices of the two goods The price of good x rises by 

' ^ It 1 * a simple exercise in the manipulation of the diagrams to show that the upward 
shift m curve m quadrant d to shifts the supply curve by the same amount. 










HAItBBR(»R MODEL EXPOStTION 


7‘ 

Ojy, and that of good^ ns« by OlVy, the difference being the 

change in relative product prices, which is equal to OlV in quadrant a. 
Total payments to factors in industry x per umt of output fall in nominal 
terms by OfVJi By construction, in quadrant d equals OfV' in 

quadrant a, the amount by which the price of good x, net of tax, falls 
relative to the pnce of good ji. 

As in the case of our examination of expenditure incidence, we can use 
some simple algebra to elaborate on these results First, the supply curve 
in this case can be written as 

JQJQ, = S(dP, - dT, - dP,), (7a) 

since firms react to prices net of tax Combining this equation with 
equation ( 1 ) above, we have 

and 

dQJQ. = -^dT, ( 11 a) 

E + S 

By the same reasoning used in the previous section, we can easily denve 
these results from figure 2 OB — OM + MB, which in turn equals 
— {E + S)OfV But OBalsoequals —SdTj^ Thus.OW' = SdTJ{E + S), 
as reported in equation (11) Finally, OM = —ESdTJ{E + S),t!s 
reported in equation (11a) As partial equilibnum analysis would 
suggest, the increase in the relative price of the taxed good induced by 
excise taxation depends positively upon the elasticity of supply and 
negatively upon the elasticity of demand ’ * Changes in factor rewards 
can be calculated directly from these results by the use of equations 
(2)-(4), and those in absolute product prices by also using equations (5a) 
and ( 6 a) Or, in the spirit of the present analysis, they can be read from 
the appropriate curves in quadrants c and d of figure 2 

The next tax to be examined, a general consumption or income tax, is 
particularly easy to analyze Such a t 2 ix can be thought of as an ad 
valorem tax levied at the same rate on both goods x and 7 Thus it raises 
both curve P„ and curve P^ m quadrant d by the amount of the tax, as 
can be seen by inspection of equations (5a) and ( 6 a). But because both 
these curves nse by the same amount, the tax has no effect on the supply 
curve for industry x Thus it htis no effect on relative product prices, the 

Musgnve (1959, pp 291-92) note* the analogou* partial equilibnum result It u 
worefa noting at thu pomt that the expressions in equations (1) and (la) are each 1 degree 
higher m £ and ^ t^n are those in equations (9) and (9a) Because of the nonumform 
wedge dnven between costs and pnees by excise taxes that is not present when govern¬ 
ment procurement simply shihs the demand curve, more substituuon m production and 
consumption is involved in adjusting to excises than to expenditure shifts This u noted a* 
well in McLure (1972a) 



7a 


JOURNAL OF POUnCAL ECONOMY 


dBc 



Fto 3 —Analysis of incidence of tax on capital m industry jt 


interindustry allocation of capital, or the relative rewards to factors 
It does, however, given the numeraire equation = 0, raise the 
absolute prices of both goods x and_y by the amount of the tax And, what 
IS more crucial, it raises product prices relative to factor prices by the ad 
valorem amount of the tax Thus it is simply borne m proportion to initial 
shares m national income (or consumption) 

The analysis of a tax levied on capital services in only industry x is 
more complicated than those presented thus far First, the tax shifts 
curve Z, m quadrant c to the right by the amount of the tax (to Z* in 
fig 3), as can be seen from equations (3a) and (10a) In addition, it 
raises curve m quadrant d by/i^ times the amount of the tax, as shown 
by equations (5a) and (10a), to P* To determine the effect of the tax on 
the supply curve, we can choose an arbitrary point N on the vertical axis 
of quadrant c in figure 3 In the absence of the tax, the capital outflow 
mdicated by ON would result in a fall m the wage rate in industry x of 







HASBEftOEll IfODKL BXPOSmON 


73 

OF,, a fall in the price of good x of 01F„ and tome (unspedfied) rise in the 
wage rate and product price in industry^. But given the tax on capital in x 
and the choice of capital as numeraire, this capital outflow would result 
in a fall in the wage rate in industry x of only OF^, as shown by curve £*. 
Similarly, the impact of the postulated capital flow on the price of x, 
given the tax, can be seen from curve P* ih quadrant </ to be OWh. 

Since the existence of the tax on capital sdvices in industry x does not 
affect the way the wage rate and product pnce in industry respond to the 
postulated capital inflow, the supply curve is shifted upward by the tax on 
capital used in x by the amount But this shift can readily be seen 

to be exactly equal to the ad valorem rate of tax on capital services used in 
industry x, as follows. First, can be decomposed into two parts, 

fF,Z and ZIV^. The first of these is simply equal to dT^ (The distance 
V„V'„ equals dT^„ as noted above, and the slope of curve P, is_/i,) The 
other IS the distance between curves P, and P*, noted above to be/^ 

Sincey* + yi, = i> entire distance W^W'^ equals the ad valorem rate 
of taxation of capital m mdustry x, This unphes, of course, that this 
tax raises the supply curve of x by exactly as much as an equal ad valorem 
excise tax on good x This, m turn, imphes that with one exception the 
tax on capital services in industry x affects all variables m the model 
exactly as the excise tax does The exception is the wage rate m industry x. 

The effect the tax has upon the wage rate in x is given m quadrant c 
Suppose that the tax on capital services m x produces the outflow of 
capital from the industry of ON shown in fig^ure 3 In such a case the 
wage rate falls by OF,| But this result of a lower wage rate in industry x 
cannot be generalized Suppose that production in mdustry x is char¬ 
acterized by a high elasuaty of factor subsUtuUon, relative to the elasticity 
of product demand. In that case it is possible for the wage rate m that 
industry to nse relative to the net price of capital services This case b 
shown in figure 3 for the capital outflow ON' The point is that the capital 
outflow reduces the output of good x and raises its price as well as reducing 
the marginal physical product of labor If demand is sufficiently pnce 
inelastic and factor substitution m industry x sufficiently elasoc, the former 
influence can outweigh the latter. In that case the tax on capital m x will 
actually rabe the wage rate in that mdustry, relative to the return on 
capital 

The final two taxes to be considered are a tax on labor in industry x and 
a tax on capital in both mdustnes These taxes are considered together 
because both are borne entirely by the owners of the taxed factors. The tax 
on labor in industry x shifts two curves. £, m quadrant c and F, in 
quadrant d. But because both these curves shift to the left by exactly the 


If the numeraire equation w 0 is chosen, the sign of dPig depcadt smqity upon 
(<r, - E) (see McLure 197U, p. S9). 



JOURNAL OF FOUnOAL SCONOKy 

amount of the tax, the supply curve m quadrant a is unaffected, and the 
tax has no effect on relative product prices or the allocation of capital 
between industries Nor docs it affect the relative returns to capital and 
labor in the untaxed industry Its only effect is to reduce the net wage 
rate in the taxed industry by the amount of the tax. Thus labor speafic 
to one industry bears the full burden of any tax levied on it. Since the 
wage rate gross of tax in industry x and the wage rate in industry j arc 
both unchanged, the absolute price of neither good changes 

From this result it is easy to see why the tax on capital services in x 
and the excise tax on x have the same effects, except on the wage rate, if 
levied at the same ad valorem rate The excise is equivalent to an ad 
valorem tax on both factors used in x But the part of the excise that is 
equivalent to an ad valorem tax on labor in x affects only the net wage 
rate in that industry Thus in total the excise and the tax on capital affect 
all other product and factor paces identically In terms of the diagrams, 
the part of the excise that is equivalent to a tax on capital in x shifts 
to the nght, but the part that is equivalent to a tax on labor in x shifts it 
to the left by an equal amount, leaving the curve unchanged on balance 
The two taxes together raise curve /*„ by dT, the amount of the ad valorem 
tax But so does the tax on capital services in industry *, which, however, 
also affects 

The previous result was the standard Ricardian result that a tax on an 
immobile factor is borne by that factor A similar, but less widely recog¬ 
nized, result occurs if a tax is imposed on all uses of the mobile factor 
capital.^® Such a tax shifts curves E, and in quadrant c to the right by 
the amount of the tax, and it raises curves F, and by dT^ and 
respectively But it has no effect on the supply curve for good x 
This can be demonstrated using figure 4 and the reasoning employed in 
the discussion of figure 3 Alternatively, we can simply use the results for a 
ttix levied on capital services in one industry, since a general tax on 
capital services is equivalent to a tax levied on capital services at the same 
rate in each industry The part of the general capital tax that is equivalent 
to a tax on capital in x shifts the supply curve upward by the ad valorem 
amount of the tax, the part equivalent to a tax on capital in_y offsets this 
shift Yet another way to sec that the supply curve is unaffected is to recall 
that a tax on capital in one industry has the same impact on the supply 
curve as an equal ad valorem excise on the output of that industry 
But we have already shown that an excise tax levied at the same ad valorem 
rate on both industnes docs not affect the supply curve In any event, 
relative product prices and the intenndustry allocation of capital arc 
unaffected by the general tax on capital services But the wage rates in 

There is no need to examine expliatly the incidence of a tax on labor in both 
industnes. Such a tax can be broken down conceptually mto two taxes—one on labor in 
industry x and one on labor m mdustry ji, Smee each of the parts would be home by the 
taxed labor, the enure tax is borne by labor 



HAKBBROSR MODEL BXPOSmON 


75 



Fio 4 —Analysis of incidence of tax on all capital income 


the two industnes and the prices of the two goods nse by the ad valorem 
amount of the tax, given the choice of capital as the numeraire In terms 
of figure 4, the solution is OF in quadrant c and OIF in quadrant d 
That OW equals dTg can be shown by the same reasoning employed 
above in the discussion of figure 3 In real terms this implies that owners 
of capital bear the full burden of the universal tax on capital even though 
capital IS mobile between industries. 

V. Bsdanced-Budget Incidence 

Thus far we have considered the incidence of government expenditures 
financed in a distnbutionally neutral way (Section III) and the incidence 
of several taxes used to finance distnbutionally'neutral expenditures 
(Section IV). Now we consider bnefly how the analysis of the last two 
sections can be combmed to allow us to investigate the inadence of 
balanced'budget increases in government expenditures in the general 





76 


JOURNAL OF POLmOAL ECONOMY 



Fio 5 —Analysis of balanced-budget incidence 


case in which neither expenditures nor their hnancing is distnbutionally 
neutral 

We need to examine only the effect of financing a nonneutral increase 
in public expenditures through a tax on good x There is no need to ex¬ 
amine the cases of financing through taxes on either the immobile factor 
or the mobile factor in all its uses Since those taxes are simply borne by the 
taxed factors, they can readily oe combined with nonneutral expenditure 
patterns And since the tax on good x and the tax on capital in x have 
different results only for the wage rate in x, there is no need to consider 
the case of balanced-budget financing through a tax on capital in x. 
Finally, we can limit ourselves to quadrant a, since all that is of real 
analytical interest occurs there, though to obtain the full solution for all the 
vanables in the model, it would be necessary to carry the results from 
quadrant a into the remaining quadrants, as we have done in earlier 
sections 

The diagrammatic analysis is given in figure 5, which combines aspects 
of quadrant a m figures 1 and 2 The demand curve shifts from D to 
because of the public expenditure pohcy, and the supply curve shifts 
from S to S* because of the tax on good x^^ The new equilibrium is 

Curve Pg in quadrant d u the only curve in quadrants b-d that u affected by the 
balanced-budget program financed by a tax on good x It rues by the amount of the tax, 
as demonstrated in Secuon IV Similarly, for each of the taxes not explicitly considered in 
thu section, the shifts of curves in quadrants b-d are exactly those descnbed in Section IV. 
In each case the demand curve shifts as descnbed in Section III 

ss Careful examination of fig 5 and eqq. (12) and (12a) will reveal that stnctly speaking 
curves Dg and S* are not consutent, in that Dg should intersect the honzootal axu no 
further to the ngfat of the origin than S* mtenects the vertical axu above the ongm, if 
^ 1 and m, 2 0. The exaggeration m fig 5 allows the possible outcomes descnbed 
in the text to be shown more clearly than would otherwise easily be the case. 



HAWBBROBR HOOEL EXPOStTION 


77 

determined by the intersection of the two shifted curves, Dg and S*. 
In this particular case the price of good x would rise relative to that of 
good> by and output of good x would rise by the fraction OM 
Of course, if the shift in aggregate demand toward good x were less than 
shown by curve Dg, the price rise would be less than shown, and if the 
demand shift were small enough, output of good x would fall, rather than 
nsc Such a case is shown by curve D,. Finally, if demand shifts away from 
good X, rather than toward it, as it does in the case of curve £)*, the 
relative pnce of good x would rise less than if the tax financed a neutral 
expenditure program, and it might even fall 

We can use equations (la) and (7a) to solve explicitly for the changes 
in relative pnces induced by balanced-budget expenditures Alterna¬ 
tively (and equivalently), we can simply combine the results given in 
equations (9) and (9a) and equations (11) and (11a) In either event, we 
reformulate the fractional shift in demand for good x resulting from the 
increase in government expenditures as follows 

dQalQx = (»». - ^p) dGjQ,, (12) 

where dG is the balanced-budget increase in spiending, and and 
are the public and pnvatc marginal propensities to spend on good x 
Moreover, we note that because of the assumption of a balanced-budget 
fiscal operations, the following holds 

dG=Q,dT, (12a) 

Using these relations gives us the following solution for the balanced- 
budget increase in spending under consideration 

dP, - dP, = (5 m, - m,) dr,/(S -t- £) (13) 

and 

dQ^IQ, = -S[E - (m, - m,)] dTJ{S + E) (13a) 

Of course, if = m^, these results reduce to equations (11) and (11a), 
as they must 

The approach developed m earlier sections can be employed to demon¬ 
strate these results diagrammatically, though dcalmg with a balanced- 
budget change is slightly more complicated than treating either the tax 
or expenditure sides by itself In figure 6 , MB = -5 OW and MF = 
E’OVF Thus BF = OW• {E + S).ButBFalao equals {S + m^ — m,) dT^. 
From this we can see that OW {S + — m^) dTJ{E + 5), as stated 

in equation (13). Finally, to solve for OM, the fractional change in the 

The results reported in this section sure consutent with those in McLure (1972i, pp. 
447-50). 



78 


dPx-dPy 


JOURNAL OF POLlTtGAL ECONOMY 



output of good X, we need only note that B'W = OMjS and WF' = 
OMjE, so that B'F' = OM{E + S)IES and OM = B'F' ESjiE + S) 
But we can rewrite B'F' as (m^ — Wp) dGIQ^ E — dT^ Thus OM = 
S(mg — — E) dTJ{E + 5), as noted in equation (13a) 

Inspection of figure 6 reveals that the price of good x can rise by more 
than the amount of tax on good x only if demand shifts toward that good 
enough so that output of that good rises on balance Equation (13a) 
shows that this result can occur only if {m^ — exceeds E Of course, 
equation (13) gives this result directly Finally, we can see that the price 
of good X can fall relative to that of good_y even if a tax on good x is 
employed to finance a balanced-budget increase m public spending if 
supply IS relatively price inelastic and demand shifts strongly toward 
good^ In particular, dP^ — dP^ has the sign of 5 -f — m^, which is 
negative for relatively small values of S and and large values of nij, 


VI. Discrete versus Small Changes 

Strictly speaking, the Harberger model, as well as its many applications 
and extensions, is applicable only to the analysis of small taxes and 
expenditures, as it is written in terms of differentials Yet it is commonly 
utilized as the basis for policy decisions involving lax changes that are 
far from small The appropriateness of the model for such uses has been 
the subject of increasing skepticism (see Krauss 1972, for example) In 
this section we examine briefly one of the problems posed by using the 
Harberger analysis of differential changes to approximate the effects of 
discrete tax and expenditure changes But the problem is only expositcd 
No attempt is made either to generalize the model to deal adequately 
with discrete changes or to perform a sensitivity analysis of the inaccuracy 
resulting from using the Harberger model to deal with discrete changes. 




SAUBROKR MOOIL BSTOBTION 79 

Thoie more demanding tasks are beyond the scope of this brief discussion 
(but see Shoven and Whalley 1972). 

In a nutshell, the problem u that to be fully consutent with the Har> 
beiger model as usually written, all the curves m figures l-€ should be 
straight lines.^* But it u virtually certain that for discrete changes in 
taxes or expenditures no more than two of the curves will in fact be straight 
lines. If such is the case, the Harberger model—and its various extensions 
and applications—can be expected to yield quantitatively mcorrect 
results. 

Whether curves D and Z„ should be straight hnes does not concern us. 
If the income-compensated elastiaty of product demand with respect to 
relative product prices is not constant, then D should not have a constant 
slope. And if the elastiaty of factor substitution is not constant, then 
will not be a straight line But these are empirical questions which, while 
important, are not central to the discussion of this section. Rather, the 
crucial questions are whether curves Q„ P^, Py, apd S are straight Imes. 

These questions can be answered on analyUcal grounds 

The slope of curve Q, in quadrant b is the inverse of the elastiaty of 
output of good X with respect to inputs of capital. In general, this elastiaty 
u not constant for increasing doses of capital apphed to a constant amount 
of labor. Only if the elastiaty of factor substitution is umty is the output 
elastiaty constant If (and only if) the elasuaty of factor subsutution 
is less than unity, as seems likely, the output elastiaty falls as more and 
more capital flows into industry x Thus curve Q, should increase m slope 
as capital flows into industry x (Moreover, the increase in the slope is 
greater, the smaller is the elastiaty of factor substitution) In quahtativc 
terms, the curve relating capital inflows to changes in output should look 
hke Q*, rather than Q, (see fig 7) 

Moving to quadrant c, we see at once that curve Zy cannot generally 
have a constant slope. Since that slope is —OyRyjK^, it falls m absolute 
value as capital flows into industry x, even it we assume that is a 
constant Of course, if is inibally quite large, curve Zy is virtually 


s* Each of equationi (l)-(6) doenba a straight line One problem not considered in 
this paragraph is that in the discrete case the demand curve is likely to shift. For difior- 
entially small tax policies it is reasonable to assume for analyunl convenience that 
marginal propensities to consume good « are equal for all economic agents But for discrete 
tax changes thu assunqitioa becomes untenable even as an analytical device, since in the 
limit marginal propensities are equal to average propensities. Thus, let us simply note 
that redistribution of discrete sue is hkely to shift the demand curve, but m an undeter* 
mined way. Mieszkowtki (19€7, pp 261-62) has analysed the impact of rdaxing the 
assumption that all expenditure patterns are identical at the margm. While this problem, 
which is closely related to the question of expenditure mcidence, is easjy treated alge¬ 
braically, it would be difficult to incorporate it m the present diagprammatic analysit. For 
a potentially more damaging entuasm, m which it u claimed that excess burdens of 
taxation m a kes the assumpUon of constant real income untenable, see BaUentwe and Ens 
(1973). 



8 o 


JOUKNAL or POUnOAL BCONOlfY 



c. b. 

Fto 7.—Illuitrative analyiu of expenditure incidence under linear and nonlinear 
veruons of the Harberger model 

vertical, and its slope is not much affected by even propiortionally large 
mflows of capital into industry ;r (i c , by large values of dKJK^) But if 
KyfKji is initially small, 2^ will be relatively flat, and even small positive 
values of dKJK^ will flatten it further by an appreciable amount ** 
In qualitative terms, we can expect the appropriate curve to resemble Sj 
in figure 7, instead of the linear form 2^ 

Fmally, curves P„ and Py in quadrant d have slopes equal to the imtial 
shares of labor m the two industnes. By the same reasoning that prevailed 
in quadrant b, we know that these slopes are constant only if the elasticity 
of factor substitution is umty But if factor subsUtution is less than unitary 
elastic, labor’s share in either industry will increase with its price, and in 
quahuuve terms the curves will look like P* and P*, rather than P, and 
Py, in figure 7. 

** For example, if K,IKi were initially 1/2, a capital flow of dKJK, s 1/2 would 
result in a value of K,IK„ of 0 and a horizontal L, curve. 







IfODKI. BXPOBTION 


8l 


Turning now to quadrant a we can see that the supply curve is likely 
to bend upward to the right. This can be seen from an inspection of 
either the other three quadrants of figure 7 or equation (8). If both 
elasticities of factor substitution are less than one, an increase in output of 
good X is accompanied by a fall and a rise in/j,, a rise in KJK^ and 
a fall in ^ • Only the fall in tends to swing the supply curve S (derived 
from the unasterisked or straight-line curves in quadrants b-d) downward 
All the other changes tend to swing it upward From this it seems reason¬ 
able to conclude that in general supply becomes less elasUc u we move 
from the origin to the right in quadrant a of figure 7, and more clastic as 
we move to the left, though the results clearly depend upon the values of 
the parameters of the model ^ ’ 

The importance of this qualification can be seen by considering the 
expenditure policy indicated by the shift of the demand curve from D to 
Dg in figure 7. The linear version of the model gives the results indicated 
by the solid line connectmg points corresponding to OM and OW^ 
ON, OFj, and OVy, 0W„ and OlVy, reading around the diagram m a 
clockwise direction from quadrant a. The curvilinear model, on the other 
hand, g^ves the results indicated in the same notation, but corresponding 
to the points connected by a broken line Comparing the two results, 
we find that the linear model understates the change in relaUvc product 
and factor pnees and overstates the quantity adjustment induced by the 
expenditure policy QuantitaUve estimates of the extent to which 
linearization in the Harberger model causes results to be inaccurately 
stated IS beyond the scope of this paper 

In a similar vein, we could introduce an excise tax into the system to 
see how the linearization might lead us astray But again, doing so would 
lead us further into the quanitative evaluation of errors resulting from 
linearization than we should gfo after a papier has reached this length. 
Suffice It to say that under certain circumstances the errors can be sub¬ 
stantial and that users of the linear version of the Harberger model 

** If both these elasticities are greater than one, most of what is said here is revened 
If only the elastiaty of subsutution in industry jt exceeds unity, supply is certain to be las 
elastic as output mcreasa than m the hnear version If the elasuaty exceeds umty only 
in industry x, supply is likely to be more elasttc as output mcreasa 

Differentuting equation (8) totally helps one to sec the rola played by the various 
parameters Unfortunately, the total denvative is not unambiguous m sign It may be 
worth noting exphatly that whereas PmIP, can mcrease without bound, it is bounded on 
the bottom by PJP, «• 0 

^' Smee the primary interat here is m quahtative results, no effort has been made to 
draw curve S* to corrapond exactly to curva Q*, Z*, Z„ Pi, and /**. Unfortunately, 
even fairly casual treatment of this subject tends to plunge us directly into the sensitivity 
analysis that u beyond the sc«<pe of this paper 

We have a familiar mdex number problem here The results reported here would 
be reversed if the slopes derived from the posttiut, rather than the pretax, equihbrium 
were employed m the comparison. A similar pomt u noted m Miesxkowski (1969, p. 
1109). 




89 


JOURNAL OP POLITK2AL BCONOlfY 


should keep this caveat in mind m offering policy advice and in making 
empirical estimates on the basis of the linear model. 


References 

Ballentine, J. Gregory, and Eris, Ibrahim “On the General Equilibrium Analysis 
of Tax Incidence.” Discusuon Paper no. 38, Rice Univ., Program of Develop¬ 
ment Studies, 1973. 

Harberger, Arnold C. “The Incidence of the Corporation Income Tax.” J.P.E. 
70 (June 1962)- 215-40. 

- “Efficiency Effects of Taxes on Income from Capital.” In Effects qf 

Corporation Income Tax, edited by Manan Krzyzamak. Detroit: Wayne State 
Univ. Press, 1966 

Krauss, Melvyn “Differential Tax Incidence: Large versus Small Tax Changes.” 
J P E QQ (January/February 1972)* 193-97. 

McLure, Charles E , Jr “Interregional Incidence of General Regional Taxes.” 
Public Finance 24 (1969) 457-83 

- “Taxation, Substituuon, and Industrial Location.” J P E. 78 (January/ 

February 1970). 112-32 (a) 

- “Tax Incidence, Macroeconomic Policy, and Absolute Prices ” Q J.E. 84 

(May 1970) 254-67 (i) 

-. “The Design of Regional Tax Incentives for Colombia.” In Fiscal Rffbrm 

for Colombia, edited by Malcolm GilUs. Cambridge, Mass.* Harvard Lavr 
School, 1971 (a) 

-. “The Theory of Tax Incidence with Imperfect Factor Mobility.” 

Ftnanzarchio 30 (1971) 27-48. (4) 

-. “General Equilibnum Incidence Analysis* The Harberger Model after 

Ten Years ” Discussion Paper no. 37, Rice Umv, Program of Development 
Studies, 1972, (a) 

- “The Theory of Expenditure Incidence” Finanearchiv 30 (1972) 432- 

53 (4) 

Mieszkowski, Peter M “On the Theory of Tax Incidence ” J.P.E 75 (June 
1967) 250-62 

-. “Tax Incidence Theory; The Effects of Taxes on the Distnbuuon of 

Income.” J Econ Literature 2 (December 1969) 1103-24 

Musgrave, Richard A The Theory of Public Finance. New York McGraw-Hill, 
1959 

Shoven, John B , and Whalley, John “A General Equilibrium Calculation of the 
Effects of Differential Taxation of Income from Capital in the U S ” J. Public 
Ecpn 1 (November 1972) 281-321. 



An Economic Theory of Suicide 


Daniel S. Hamermesh 

Mtdagm StaU Umtursity 

Neal M. Soss 

Bureau of the Budget, State qf New York 


. as soon as the terrors of life reach the point at which 
they outweigh the terrors of death, a man will put an end 
to hiS life [A Schopenhauer, On Sumde] 


Although sociologists have developed and tested numerous theories about 
suicide, economists have not analyzed this phenomenon We derive an 
economic theory of suicide and test its implications using (1) data by 
age in many developed countries, (2) a time senes, 1947-67, by age 
group in the United States, (3) a cross section by state and age group m 
1960 Most of our predictions are venfied Particularly striking are the 
response to unemployment, as strong in the postwar period as in the 
Depression and stronger among older individuals, and the negative 
effect of increased permanent incomes among all but the youngest age 
group, a result found in both the time scries and the cross section 


Since the appearance of Durkheim’s Le svindt in 1897, sociologists have 
constructed numerous theoncs to explain patterns in suicide rates both 
within and across societies Economists have not considered the problem 
of suicide, although it is surely one involving individual decision making 


The work for this paper was done while both authors were at Pnneeton Umvenity. 
They thank Joshua Greene for his highly competent research assutance. Ronald Ehren- 
berg, Ray Fair, Robert Goldfarb, Simon Rottenberg, Burton Weisbrod, and the referees 
provided helpfUl comments on an earlier draft Partiapanis m seminars at Princeton 
Umvenity, t^chigan State Umvenity, and the National Bureau of Econonuc Research 
also offered useful suggestions. 


83 




84 jouitHAL or pounoAL BOONcmy 

and often also economic facton.* Just as economics has served to improve 
our understanding of fertility (cf. Becker I960; Schultz 1969) and 
marriage (Silver 1965), so it may provide a testable theory of suicide. 
The empirical analysis should enable us to measure another effect on 
society of both long-run economic growth and cyclical fluctuations in 
mcome. In this paper we outline several sociological theories and then 
develop a new, economic theory of suicide. Some implications of this 
theory are then tested against various sets of data. 


I. Sociological Theories and an Economic Theory 

Most subsequent work by soaologists in the study of suicide has been 
motivated by Durkheim In addition to providing a model for empirical 
research by sociologists, he summarized existing knowledge about 
smcide patterns by age, literacy, and other demographic and sociological 
variables and constructed a typology for classifying suicides In a departure 
from the Durkheimian model, Henry and Short (1954) developed the 
frustration-aggression theory According to them, suicide results from 
individuals* frustration in their attempts to achieve their social goals and 
the ensuing aggressive feehngs, directed cither at themselves (suicide) 
or at others (homiade) They postulate that economic improvement 
leads to a decrease in frustration and thus aggression, and they concen¬ 
trate most of their empincal work on the relation of suicide rates to the 
busmess cycle They also argue that frustration and aggression occur 
differentially dependmg upon the degree of external restramt imposed 
upon the individual Since they assume that external restraint u inversely 
related to status, their theory implies a positive correlation between suicide 
rates and status Their empirical work demonstrates conclusively only 
the positive correlation of suicides and unemployment in the Umted 
States before 1942. 

Undoubtedly, much of the vanation m suicide rates among ethme 
and demographic groups can be explained by these and other soaological 
theories Several aspects of this problem, particularly vanations in smcidcs 
by age and income, that cannot simultaneously be reconciled by any one 
of these theories may well be rationalized by an economic theory. 
Admittedly, some suicidal behavior can in no way be attnbuted to 
econonuc factors We contend, though, that some is due to economic 


* In the earlieit available records in Great Bntain, covering 1770 throu^ 1830, 
one-fourth of the recorded suiadei were attnbuted to poverty (Fedden 1938, p. 203). 
Lendnim (1933) analyzes 419 male suicide attempts and finds t^t over one-third of those 
giving a motive ate economic problems. Shneidinan and Farberow (1965, p. 36) analyze 
a sanqile of suicide notes and fi^ 8 percent of those listing a specific cause mention povoty 
or budncM difiSoilties. 



THEORY OF SUICIDE 85 

decision making, so that some of the variation m suicide rates should be 
explicable using hypotheses dcnved from an economic theory * 

Let us postulate the following utility function for the average in¬ 
dividual in a group of people with permanent income YP‘ 

U„ =, U[Cim, YP) - K(m)] >0, (1) 

where m is his age and K a technological relation describing the cost each 
period of maintaining himself alive at some nunimum level of subsistence ^ 
(Presumably if' > 0 ) If this is the utility of the average individual age m 
with permanent income YP, then the present value of his expected life¬ 
time utility at age a is 

Z{a, YP) = dm, (2) 

where r is the private discount rate, cu is the highest attainable age, and 
P{m) is the probability of survival to age m given survival to age a The Z, 
then, IS a decreasing function of a and an increasing function of YP 
Let b, ~ N{0, a^) be the ith individual’s taste for living, or conversely, 
his distaste for suicide The variable 6, is in units commensurate with the 
utility function and is subject to any transformations performed on 
U„ * It IS assumed to be defined for the cohort at birth 
Following the spirit of the epigraph to this paper, we assume that an 
individual kills himself when the total discounted hfeume uulity remaining 
to him reaches zero This point must be a maximum, for the remaining 
hfeume utility is decreasing with age Our hypothesis, therefore, is that 
the ith individual commits suicide if and when Z,(a, YP) -I- 6, = 0 
The fracUon of individuals in the cohort born at time {t — a) who commit 
suicide at age a is 

S(a) =/[-Z(a, YP)], (3) 

where/ (•) is the density function for 6, * In our model, the instantaneous 
suicide rate is simply the fraction of individuals in the cohort for whom 


^ This IS not the first analysis of the economics of mortality However, pret lous work 
neither formulates theories nor tests hypotheses Hansen (1957) presenu a calculauon of 
the loss in GNP resulting from child mortality, Holtmann and Ridker (1965) compute the 
welfare loss arising from the early incurrence of bunal costs of individuali who die 
prematurely We calculate that the loss in human capital resulung from suiadet among 
males ages 25-64 in 1959 was S740 million, based on a discount rate of 6 percent (Cnsmt 
of Populalwn [1960], PC(2)-7B, and U $ Office of Vital Statisbcs, Vital Statutici [1959]) 

’ Grossman (1972) has argued that this funcuon can be idTccted by the individual’s 
earlier choice of hesUth expenditures 

* Our results are valid for any distnbuuon which is posiuve over the enure real number 
line and has posiUve first and second denvauves at the left tail 

’ We sue expliatly anuimng that no members of the birth cohort die from other 
causes before age a. Relsuung this assumption by dividing equation (3) by the fracuon of 
the cohort actually remaining at age a changes none of our theorebcal results on the 
relation of suiade to age and income 



86 


JOUANAL OF POUnOAL SOONOtiy 


Z(a, yP) reaches —6 at age a. Differentiating equation (3) totally: 


dS = -fl—da + —dYA. 
•' \da BYP ) 


( 4 ) 


Now BZ\Ba < 0, both because K' > Q and because there is less gross 
consumption left to enjoy as a person ages. Also, BZjBYP > 0, so we 
can use equation (4) to derive the predictions that the siudde rate, •S', 
will nse with age and decrease with permanent income 
Taking the total differential of equation (4), we obtam. 


d^S = 


-/ 


da^ 


+r 



da^ 


+ 






d^Z 
BYP da 
d^Z . 


dYP‘ 


f /•* 

^ BYP Baj 


dYPda 

1 


( 5 ) 


Of the three terms, only the sign of the third can be determined un¬ 
ambiguously It IS positive, given the reasonable assumptions that we are 
dealing with the left tail of the normal distnbution (so that f and 
/" > 0) and that the marginal utility of lifeume income decreases with 
increases in YP so that B^ZjBYP^ < 0 The signs of the other terms are 
ambiguous The first term m B^SjBa^ is negauve if Z is decreasing in a, 
and the second is positive Similarly, the first term in d^SKBYP Ba) is 
positive and the second negative * 

The age-specific suicide rates, then, will depend upon the distribution 
of tastes against suicide and the distnbution of permanent incomes 
Several specific hypotheses anse from the economic theory we have 
oudined We expect that the rate of suicide increases with age (S-S/do > 0); 
that the suicide rate is inversely related to permanent income 
(BSjBYP < 0), but that the marginal absolute effect on suicide declines as 
permanent income increases {d^SjBYP^ > 0) ^ 


* The analysis could be expanded to include expected utility based on the probability 
of employment at each age Unless severe restncUons are placed on differences by ages 
m the elasticities of expectations about future unemployment in response to changes m 
current unemployment, the only result is that suicide rates will nse as unemployment 
mcreases. In Section IV we examine the role of unemployment and, by inference, 
difierences by age in these elasucities 

’’ In addition to the economic aspects of suicide discussed here, some demographic 
differences m suiade rates are well established The rate is lower for females than inales 
m all countries compiling reliable data, and in the Umled States the nonwhite suicide 
rate is slightly less than half that of whites (see U.S. Office of Vital Statistics, Vtlal 
StaHshu [1967], Il.a: 1.20) These differences are subsumed under the taste variable, b, 
in our theory, an assumption that allows us to concentrate on economic factors. A socio¬ 
logical theory that performs well in explaining these demographic differences is that of 
Gibbs and Martm (1964). 



TKBOKY at SUIOIDE 


87 


□. Empirical Evidence—Introduetoty Considerationa 

The deficiencies in the data on suicide rates are well known. (Douglas 
[1967] presents a detailed discussion of this problem ) Underreporting of 
suicide rates varies in different jurisdictions depending upon the per¬ 
sistence of the coroner and the degree to which society stigmatizes suicide 
as a form of deviant behavior. This difficulty implies that any cross¬ 
country comparisons of suicide rates are likely to be worthless, the 
underreporUng problem is undoubtedly one reason for the low suicide 
rates observed in predominantly Cathohc countries * Within the Umted 
States, the use of suicide rates by states in a cross-section regression model 
could be dangerous because errors in the dependent variable are correlated 
with the true (observed) values of the income variables Errors may also 
be introduced into any correlation of income and suicide by the require¬ 
ment that an individual be covered for 2 years before an insurance com¬ 
pany will pay off on a death by smcide (Gregg 1959, pp 139-40). 

While we admit that there are problems in using official statisucs on 
suicide rates, a consideration of them suggests that the data should be 
useful if handled judiciously Data on suicide rates by age within any 
country should embody very few problems, there is little reason to expect 
systematic differences in underreporting of smcides by age The degree 
of underreporting may change over time as the moral stigma attached to 
suicide decreases We can, however, account for this difficulty in Section 
IV and still make use of time-senes data on suicide rates Data from a 
narrow jurisdiction in which one official has charge of all decisions on 
causes of death should be fairly reliable Finally, the use of the appro¬ 
priate demographic measures should enable us in Section V to remove 
most of the errors arising from compansons across jurisdictions within a 
fairly homogeneous society 


m. Suicide by Age—^International Evidence 

Table 1 presents the average male smcide rate by age in each of the 21 
developed countnes for which all the data are available from 1965 
through 1967 In mne of these, the rate nses monotonically with age, 
in two It reaches a peak at the oldest age group although it does not nse 
steadily, and in eight others it nses monotomcally to a peak in the age 
group 55-64 Only in two countnes, Sweden and Poland, does the peak 
occur before this point, and in the former the rate is essentially constant 
after the peak While the evidence presented in this table is by no means 
conclusive, it does suggest that within a country there is a general tendency 

* This problem 11 vividly illustrated by the probably apocryphal quotatioii, ated by 
Alvaros (1972, p. 87), from an Inih coroner who certified an accident death firom gun¬ 
shot wounds. “Sure, he was only cleaning the muizle of the gun with hu tongu e .** 



88 


JOURNAL OF POLITICAL ECONOMY 


TABLE 1 

Male StjicibE Rates by Aoe (per 100,000 PoptnATioN), 21 Countries, 
1965-67 Average 


Country 



Aoe 



15-24 

25-34 

35-44 

45-54 

55-64 

65-74 

Australia 

10 1 

22 4 

29 7 

33 9 

349 

41.8 

Austria 

19 7 

316 

42 3 

50 0 

65 0 

59 5 

Belgium 

78 

130 

178 

29 5 

42 1 

54.9 

Bulgaria 

83 

79 

10 9 

19 1 

25 2 

402 

Canada 

103 

159 

19 4 

25.5 

30 0 

25 9 

Czechoslovakia 

26 0 

35 9 

43 6 

50 2 

52 7 

67 3 

Denmark 

9 7 

22 5 

32 6 

45 9 

49 7 

40 2 

England and Wales 

64 

10 8 

14 5 

17 7 

23 9 

26 8 

Finland 

15 5 

35 2 

53 2 

62 2 

68 6 

68 9 

France 

7 0 

170 

26 0 

404 

54 3 

54 0 

W Germany 

18 3 

26 9 

33 0 

46 1 

55 9 

48 4 

Hungary 

30 5 

44 1 

56 0 

65 2 

70 7 

76 4 

Israel 

48 

10 2 

11 4 

164 

182 

23 0 

Japan 

14 7 

21 0 

15 5 

20 9 

35 2 

52 1 

Netherlands 

4 1 

7 5 

84 

139 

188 

24 5 

New Zealand 

6 7 

11 0 

19 4 

29 0 

29 9 

28 3 

Norway 

62 

10 5 

156 

19 5 

20 6 

198 

Poland 

130 

22 6 

29 2 

30 4 

28 5 

24 4 

Sweden 

138 

28 3 

40 8 

51 0 

505 

47 4 

Switzerland 

190 

25 1 

34 5 

46 2 

49 3 

53 0 

United States 

99 

173 

22 4 

28 4 

36 0 

35 7 


SouRCP —VNorld Health OrgatuzAtion, IVarlJ Heaith Siolultcf Annual, 1965, 1966, 1967 (Ge»*va WHO, 
1968-70) 


for suicide rates to rise with age Insofar as people born earlier have both 
lower incomes at any age and less income remaining, this evidence is 
consistent with our theory of the effect of income on suicide 


IV 4 Time Series—Males, United States, 1947-67 

Data are available on the suicide rate by 5-year age groups for 1947-67, 
and the data on real incomes of persons exist for some closed 5-year and 
10-year age intervals through age 64 in the United States * These data 
form the basis for the empirical analysis in this section The theoretical 
model we presented in Section I provides us with all but two of the 


* The suicide data, numbers of suicides by 5-year age groups, are published in U S 
Office of Vital Statistics, Vital Slalishcs (1967) and earlier issues. These were transformed 
into rates per 100,000 population through the use of population data kindly made avail¬ 
able to us by the National Center for Health Statistics. Data on money incomes of male 
persons by age for 1947-64 are available in Bureau of the Census, Trends in the Income of 
Famihes and Persons m the United Stales, 1917-64 (Washington Government Printing 
Office, 1967) For 1965-67 thedata were provided by Charles Beach of Q,ueen’s University 
and are based on his calculations using published data The money mcome senes are 
deflated by the Consumer Pnee Index (CPI) The CPI senes and data on unemployment 
rates by age group are from Bureau of Labor Statistics, Handbook of Labor StaUshes (1971), 
Bulletin 1705, pp 120 and 253 All the income data used are d^ated to constant 1967 
dollars 





THEORY or nnann 


89 

variables to be tued in the regression equations. In particular, for a 
cohort age A at time t, the suicide rate S(A, t) will be a function of the 
discounted real permanent income stream remaining at time t and the 
age of the cohort. We also include a variable to measure transitory 
deviations around expected income. When unemployment rises, in¬ 
dividuals’ expectations of future incomes (and utilities) are revised dovm- 
ward. Holding real income of the employed constant, an increased number 
of people will believe future prospects to have diminished and will 
commit smeide 

We construct discounted permanent income remaimng, YPL{A, t) 
by assuming that individuals project the average growth rate of mcomes 
by age group that occurred during 1947-67 We divide each 10-year age 
interval in two and assign each 5-year interval the value Y(A, t), the 
mean real income at age A at time t, for the 10-year mterval to which it 
belongs Through a regression of this variable against tune we construct 
7{A, t), expected real income at age A at time I, which we project out 
to the end of the economic lifetime of each group 

We then compute YPL{A, t) as: 


YPL(A, t) 


= 7(A, t) + 


^ 7(A + 5t, < + 5i) 

^ (I + O’* 


7{N,t + N- A) 

(1 + rf-* ’ 


where N is the mean age of males in the open interval 65-1-, and L = 
(62.5 — A)j5 Age A is taken as the midpoint of the appropriate 5-ycar 
age interval The senes is computed for r = 02, 06, 12, 20, and 00 . 

For transitory income we use the proxy UN{A, t), the fracuon of males 
age A who are unemployed at time t This term is included alone and in an 
interaction with the age vanable There is evidence that periods of 
unemployment are longer among older workers who are laid off, and 
further evidence that older workers’ expectations of finding new employ¬ 
ment are affected by this phenomenon.'* We thus expect unemployment 
to have an increasing effect on smade as workers age ' ^ 


The actual value cf N ia 1970 was 73 6 (Computed from Bureau of the Census, 
Cmsus oj Populabon [1970], General Population Charactensucs, PC(1)-BI, table 50) 
Thu method of computing permanent mcome remaining impliatly asni mei that indi¬ 
viduals are uncertain about thar incomes m the dutant future and weight them accord¬ 
ingly. The implication of thu uncertamty u an i^timal consumption path which u 
shaped like a monotomc function of F as we have computed it (see Nagatam 1972 for a 
derivation of thu optimization rule). 

Wilcock and Franke (1963, p 61) find that only 40 percent of workers below age 45 
failed to find jobs withm one year after their layoA, fiv workers 45 and am the figure 
was 60 percent. One older worker claimed, “It's little better than shooting me to throw 
me out of a job at 55" (p. 85). 

IS An additional transitory term, F(A, I) — F(if, I), was included m prdiminary 
woric. It was dropped because the 4«tatutic of its codSoent never exceeded .2 in absolute 
value. 



90 


JOURNAL OF POLmOAL ECONOMY 


Our basic estimating equation is: 

t) = oo + aj¥PL(A, t) + «2yPL*(i4, t) + UjA + tt^A^ 

( 6 ) 

+ ajA^ + otfUNiA, t) + oiiUN(A, t) - A + v^_„ 

where is a disturbance term, and income is measured in thousand- 
dollar units The higher-order terms on A are included as a result of some 
experiments made on the international data in table 1 ** We estimate 
equation (6) over a pooled cross section of the time senes 1947-67 for 
each of the nine 5-year age intervals, 20-64 The opien interval 65+ is 
excluded because most males in the group are not economically active. 
A modified version of this equation, denoted (6'), is also estimated 
and differs from equation (6) only by the addition of the term otgt This 
modiRcation enables us to test the hypothesis that the apparent effect of 
increasing income over time on suicide rates merely reflects a trend in 
the latter Finally, we search over the YPL series computed for each 
of the flve discount rates and report results only for the value that max¬ 
imizes the likelihood function in this parameter. 

Examination of the residuals based on ordinary least-squares estimates 
of equations (6) and (6') showed that ,) # 0 for each A To 

remove this source of inefficiency in the estimation, we devised a two- 
round procedure that transformed the onginal variables by an estimate 
of the senal correlation of the residuals within each age group The 
results of this procedure arc presented in table 2 for r = 20, the value of 
the discount rate that maximized the likelihood function for both equa¬ 
tions (6) and (6') This fairly high discount rate suggests that incomes m 
the near future are most important in affecting suicide The presented 
IS based on the second-round estimates * * Undoubtedly the most stnbng 


•’In 10 of the 20 countnes in table 1 for which suicide rates generally increase with 
age, the suicide-age prohle has a definite logistic shape To reflect this phenomenon in 
equation (6), and thus to distinguish better between K and YP m testing our theory, we 
need the higher-order terms 

' * No standard technique for removing serial correlation in pooled cross-section and 
tune-series data is available To circumvent this difliculty we employed the follovinng 
method We postulate a vanance<ovariance matrix of the residuals composed of nine 
21 X 2l blocks on the pnme diagonal 


and blocks consisting entirely of zeros off the prime diagonal The first-round estimate fi is 
obtained, and each variable is transformed to produce 

I as: 1948, , 1967 The second-round estimates are based on the AAT variables for the 

sample 1948-67 

* * Hiere is no problem of nondiagonality of the varuuice-covanance matrix of the 
residuals resulting from correlation among the residuals within eadi age group. Only 2 
percent of the total unexplamed variance m equations (6) and (6'} is accounted for by 
this correlation. 





■naoiiy or nnoasn. 


9* 


TABLE 2 

SiaoND-RouMD ERntATSt or EfjuATiom (6) and (6'), f — .20 


Variable 

(6) 

(6') 

Comtant ... 

26 54 

26 24 


(5 29) 

(3B9) 

ypL . .. 

-7 73 

-7.78 


(-5.82) 

(-4 10) 

YPL* . 

56 

57 


(6 20) 

(5.54) 

A 

-2 08 

-2 09 


(-2 86) 

(-2 69) 

A* . 

.065 

065 


(3 73) 

(3 56) 

A* . . 

- 0005 

- 0005 


(-3 62) 

(-3 47) 

UN 

-7016 

-69 68 


(-3 15) 

(-3 12) 

UN A 

340 

338 


(5 68) 

(5 64) 

t 


.008 



(06) 


93 

93 

P 

517 

526 


Note —<>vElues in parentheMl here end in tnble 3 


TABLE 3 

Mean Valuei or DEJUVAHVEa at DirrERENT Acutt, Based on Equation (6) 


Grout 


Derivatives 

20-24 

4(M4 

60-64 

BSI3UN 

6 23 

74 14 

142 04 

(57) 

(7 79) 

(7 63) 

BSieT 

2 17 

- 29 

-1 11 

(1062) 

(-162) 

(-402) 

d^SIdT^ 

3 16 

3 12 

145 

(6 20) 

(6 20) 

(6 20) 


feature of these results is the insignificance of the trend variable, t, in 
equation (6'), apparently the effects of income on the suiade rate 
depicted in equation (6) are not the result of the correlation between 
income and time. In both equations all the other variables are significant, 
and those for which we postulated an expected sign on the coeffiaent 
confirm our expectation The estimates for the age terms suggest that in 
the United States there u a loguUc shape curve relating suicide rates to age. 

In table 3 we present various derivatives based on equation (6) and 
computed at the means for each age group listed. (We assume the means 
have no stochastic element and treat them as constant in computing 
variances of these derivatives.) Raising the fraction unemployed by .03, 
roughly equal to the average increase for males m postwar recessions, 
increases the suicide rate by amounts ranging fipom 0.19 per 100,000 for the 
youngest age interv|J to 4.26 per 100,000 for the 60-64 age group. In 








9 ® 


JOURNAL OF POLmOAL ECONOMY 


absolute numbers, we find that total male suiadcs per year in the age 
groups 20-64 rise by approximately 950 in an average recession, given 
the population distribution of the United States in 1960. Durkhenn 
(1951) and others have demonstrated the relation between suicide and 
unemployment in periods characterized by violent swings in business 
activity Our results indicate that this relation also holds for the relatively 
mild recessions of the postwar penod and that it is stronger among higher 
age groups * ® 

Table 3 also presents the derivatives involving income that we discussed 
m Section I The effect of a thousand-dollar increase in all real expected 
incomes on the suicide rate is negative for the median and oldest age 
groups, but positive for the 20-24-year-old group We find that dSjdY = 0 
at A = 35, so that we observe the predicted sign of this derivative in all 
but the three youngest age groups The predictions of our model on 
d^SfdT^ are confirmed by the signs of this derivative 

As incomes have risen in the postwar period, the male suicide rate 
has fallen for most age groups, especially for the older groups nearing 
retirement age Only for the youngest three age groups (20-34 years) 
has the suicide rate risen with rising incomes, a result that u contrary to 
our predictions There is no simple economic explanation for this phe¬ 
nomenon, but the expansion in the number of people pursuing education 
into their twenties and the consequent postponement of consumption 
by an increased fraction of the people in these groups could be one cause 
The decline in suicide rates for older people as their incomes have increased 
may be especially strong because of the decrease in variability of income 
resulting from the expansion of Social Security benefits Indeed, there 
was a sharp drop in the relative suicide rates of groups over age 55 in 
the late 1930s coincident with the introduction of Social Security 


V. Cross-Section Evidence 

In this section we present first a summary of studies on the relationship of 
suicide rates to occupation and income based on individual data and 
then the results of estimating the cross-section analogue to equation (6) 
Official published reports contain no information relating the suicide 


“ Our results imply an average peak-to-trough change m the male suicide rate for 
ages 20-64 of 2 07 Interestingly enough, the weighted suicide rate for this group rose in 
the Elepression by 10 08, from a low of 28 59 m 1928 to a peak of 38 67 in 1932, while the 
fraction unemployed rose by 192 Our estimates of equation (6) imply a nse of 13 27 in 
this suicide rate in response to this change in the fracuon unemployed, a result fairly 
close to that which occurred 

Equations (6) and (6') were recstimated for each age group separately, without the 
three age terms and for r = 20 In all nine groups the net effect of UN was positive, 
furthermore, dS\S7, computed at the mean for eiwh ^e group and bated on the co- 
effiaenti for that group alone, was again positive for ages 20-34 and negative thereafter. 



THBORY OF SUIOOB 


TABLE 4 

Rblativb Suiode Rates (100 m Peofemonal Rate) and 
Fashly Incomes m 1959, by Occupation* 


93 


Cook County New Orleans Tulsa 

(1959-63)t (1954-59)t (1937-56)1 


Occupation 

Males 

Total y 

White 

Males 

Whites’ y 

White 

Males 

Whites’ y 

ProfcHionals 

too 

9,445 

100 

8,456 








(100 

8,522 

Managen 

107 

10,487 

174 

8,669 


Clencals 

88 

7,086 

100 

6,020 








(33 

6,782 

Sales 

134 

8,843 

102 

6,861 



Craftsmen 

141 

8,142 

137 

6,262 

41 

6,201 

Operatives 

157 

6,887 

224 

5,527 

58 

5,359 

Service 

314 

6,045 

502 

4,689 

68 

4,108 

Laborer! 

342 

6,026 

481 

4,276 

110 

4,316 


* Computed from Ciiuus of Population (i960), PC(I)-150, 20D, 38D, uble 145, for the SMSA conuumof 
the city 

i imputed from Miiru (1967. p 249) 

Computed from Breed (1963, p 181) 

Computed from Powell (1938, p 135) 

rate to income or occupation Several sociologists have, however, con¬ 
structed detailed data by occupation for individuals within a narrow 
junsdiction, and the results of these studies are summanzed in table 4 along 
with census data on family incomes by occupation of the family head In 
each study the suicide rate for professional workers is used as a base of 100, 
and the index for each occupation group is related to it In all three 
studies presented, the suicide rate is generally lower among the occupa¬ 
tion groups in which income is greater As table 5 shows, the rank 
correlations of the suicide index with income are negative, although the 
small sample size makes it difficult to claim a high degree of significance 
for them individually The correlations are especially strong when 
professionals and managers are deleted from the sample The data pre¬ 
sented in table 4 do not hold constant for religious practices, if there is 
a negative correlation between the percentage Catholic and occupational 
status within each narrow jurisdiction, the results presented in table 4 
will be biased against observing the expected negative relation between 
suicide and occupational status While there are relatively few Cathohes 
in Tulsa, the fairly high percentage in New Orleans and Cook County 
suggests that the true pattern of suicide rates dechnes even more sharply 
with inconte Age differences by occupation would impart a negative 
bias to the correlations The correlations of the suicide index and median 
age by occupation in each city are all found to be nearly zero, suggesting 
that differences in age by occupation are not producing spurious correla¬ 
tions in table 5. 

Dublin (1963), using data from England and Wales, presents results 
similar to those that Powell (1958) has produced for Tulsa. Sainsbury 






94 


JOURNAL OF POLITICAL ECONOMY 


TABLE 5 

Rank Correlations between Suicide and Income Based on Table 4 



Cook County 

New Orleans 

Tulsa 

All occupations 

- 69* 

- 63* 

- 31 

excluding professionals and managers 

- 83* 

- 77* 

- 90* 


* SiRnirirantl> dilli rrnt fiotn zrro at th^ lO^'u one sided test 


(195i) finds constant suicide rates except in the highest and lowest social 
strata, where the rates are greater, but he, like Dublin (1963), fails to 
adjust foi age differences Summarizing these studies, we can conclude 
that, with the exception of professional workers and managers, a strong 
ncgatisc relation between income and suicide rates seems to be present 
in the data '1 he more general conclusion, that the suicide rate declines 
rnonotomcalK as permanent income increases, is supported somewhat 
less strongK bv the available studies 

We turn now to the last test of our theory of suicide Whereas the 
units of observation in Section I\’ are various age groups at different 
points in time, the units of observation here are age groups in different 
states of the United States at the same point in time, 1960 We restrict 
our analysis to the white male population and to all 38 states in the 
contiguous United States for which data on income bv age and race are 
available The age and income \ariables are constructed similarly 
to those used in the time-series tests in Section IV Five age groups— 
20-24 and the four 10-vear groups between 25 and 64—form the basis for 
analysis To avoid heteroskedasticitv in the residuals, each observation 
IS weighted by the square root of the white male population for that age 
and state 

Special problems in the cross-sec tion test require the introduction of 
two new variables ‘ ® The first difficulty is that, ceteris panbus, suicide rates 
tend to be substantialK higher in the Far West The restless life often 
associated with this area mav be a cause of this phenomenon This 
consideration suggests the use of a dummy variable, assuming the value 
of unity for California, Oregon, and Washington The second is that the 
distnbution of Roman Catholics in the population is not random across 
states Rather, Catholics lend to be concentrated in the heavily urban, 
industrial states where incomes are relatively high Given the lower 
incidence of suicide among Catholics, a failure to control for their 
proportion in a state’s population would bias the coefficient of income 
negatively We might then spuriously confirm the implications of our 

'• The states excluded are Idaho, Maine, Montana, Nevada, New Hampshire, North 
Dakota, Rhode Island, Utah, Vermont, and Wyoming 

‘ * TTiese problems can be interpreted in the context of our theory as resulting from 
differences in the distributions of b across different cultures or societies 




thbory of suiaoB 


95 

theory for the effects of income on the suicide rate. To correct for this 
possibility, we experimented with several possible indices of the pro¬ 
portion of a state’s population who are Catholics, including the fraction 
of foreign stock and the fraction with Spanish surname The vanable best 
justified theoretically and used here is CATH, the percentage of total 
elementary and secondary enrollment in Catholic parochial schoob m 
each state This proxy may be better than a measure of the percentage 
Catholic {xipulation, if, as in Ryder and Westoff’s study of fertility 
(1971), Catholic education is the best indicator of the degree to which 
behavior conforms to Church dogma 
Table 6 presents the estimates of 

S{A,j) =Po + PiyPLiA,j) + 

+ M" + PiCATH + P^WEST + 

where S{A,j) is the suicide rate by age and state, and the YPL and A 
terms are as used in equations (6) and (6') Equation (7') differs only 
in that the observations for the far western states and the dummy variable 
WEST have been deleted Table 7 presents estimates of the derivatives 
dSjdY, based on equation (7), computed as in table 3. 

The first thing to note about these tables is that, as in the time-scnes 
tests, the predictions of our theory on the age profile of suicide rates and 
the effect of income on suicide rates are borne out by the data The sign 
patterns of dSjdT by age in the cross-section and timc-sencs tests arc 
remarkably alike—negative for all but the youngest age groups The 
only difference is for the 25-34 group, where the derivative is positive 
in the time series but negative here The piositive value of d^SjdT^ here is 
contrary to our prediction in Section I Given that the search procedure 
maximizes at r = oo and given the ikge-YPL profile resulung from this, 
the signs on YPL and YPL^ are the only ones consistent with the pattern 

Numbers of white male suicides by state for I960 arc published m U.S Office of 
Vital Statistics, Vtlal SUilutus (I960), table 9-6 These were transformed into rates per 
100,000 populauon using Cmnu of Population (I960), data published in PC(1)-1B, table 
59 Income data by age group for ea^ state are published in PC(I)-D, table 134 Data 
on Catholic elementary enrollment in 1961-62 are from Office of Education, Ctrculer no 
753, p 34, those on secondary enrollment, available only for 1960-61, arc from Cmutar 
no 707, p 27, and pubhc school enrollment for 1961-62 is from Circular no 757, p 47 

Initial esumates, using the same five values of r as in Secuon IV, showed that the 
likelihood funcuon reaches a maximum at r a oo We then leuxhcd over a finer gnd of 
r le 35, 50, 75, and 1 00 and found r s: oo still to be the maximizing value, although 
the hkelihood funcuon is very flat for r £ 20 No unemployment term is included in the 
estimates of equaUon (7) that we present because of our lack of confidence m the state 
uneiaployment data Moreover, it is not clear that diflercnces m unemployment rates 
across states at a point in time represent transitory components of income When the 
measured unemployed rate, prorated by the national unemployment rate for the age 
group relauve to the total nauonal rate, was added to equation (7), its coefficient had a 
<-value of — 6 in equauon (7), its addition produces only minor changes m the other 
coefficients 



96 


JOVANAL OF FOUnOAL BOONOUV 


TABLE 6 

EniMAni OF EftUATiom (7) amo (7')> r k eo 


Variable 

(7)—38 Sutes 

(7')—35 Sutes 

Constant 


46.45 

43.98 



(2 01) 

(1.88) 

YPL 


10.97 

12 84 



(3.35) 

(3.69) 

YPL^ 


-133 

-1.61 



(-3 86) 

(-4.31) 

A 


-480 

-488 



(-2.42) 

(-2.43) 



14 

14 



(3.02) 

(3.06) 



-.001 

-.001 



(-3 16) 

(3 26) 

CATH 


- 43 

- 39 



(-5 75) 

(-5 24) 

WEST 


7% 




(5 18) 


iR*. 


893 

.867 

Note.—<* vmlue« in pAreothe««i her« and in table 7 




TABLE 7 



Mean Values of dSjdVis the Five Aoe Gaoups 

Group 

20-24 

25-34 35-44 

45-54 55-64 

(7) 

3 89 

-2 02 -3 61 

-2 51 -33 


(2 50) 

(-2 47) (-3 64) 

(-2 95) (- 40) 

(7') 

4 31 

-2 68 - 4 53 

-3 21 -57 


(2 65) 

(-3 15) (-4 33) 

(-3 61) (- 66) 


of dSjdT by age observed both here and in the time senes That the 
likelihood function is maximized at r = oo in the cross section is consistent 
with the observation that income differences across states are permanent 
Therefore, current incomes are a good proxy for lifetime incomes here 
and may well perform better 

The WEST dummy is positive and significant; its coefficient indicates 
that, once we adjust for income, age, and Cathohe school enrollment, the 
suicide rate for males 20-64 averages eight more per 100,000 in the Far 
West The proxy vanable for the extent of Catholicism, CA TH, has the 
expiected negative coefficient The percentage Catholic enrollment vanes 
from 1 m North Carolina to 26 in Wisconsin; it imphcs a difference in 
the adult male suicide rate of 10 to 11 per 100,000 population, if other 
factors are held constant Since the average rate in our sample is only 
25 per 100,000, the effect of Catholicism is quite substantial 

** We are indebted to Jacob Mincer for this observation. The flatness of the likelihood 
function in the cross section for r 2; 20 suggests that there is little difiiaence across states 
among these measures. 






THBORY OF tUICSDE 


97 


VL Gonclunon 

In no sense do we claim that the individual agony resulting in suicide 
stems from what is solely an economic calculation, the majority of 
suicides can perhaps be explained on noneconomic grounds. Nonetheless, 
as Fedden (1938, p. 54) pointed out, poverty can be an important cause 
of suicide in developed societies where values are to a large extent based 
on the possession of material goods. Given the matenahstic basis of 
modem society, it is reasonable to expect that variations in the siucide 
rate will be related to economic variables in ways predictable by economic 
theory. 

Although our main purpose has been to demonstrate the value of 
applying economic theory to the problem of smcide, we have also dis¬ 
covered some interesting and apparently novel empirical results. Within 
the United States we have found that the nuld cychcal decreases in 
economic activity since 1945 have produced the same proportionate 
increases in suicide rates that resulted from the stronger cyclical flucttia- 
tions before 1945 The suicide behavior of older people is significantly 
more sensitive to variations m unemployment than is that of younger 
people Suicide rates decrease with mcreased incomes, both in the cross 
section and the time senes, for all but the youngest age groups Contrary 
to widely held beliefs (Shneidman and Farbcrow 1957, pp 60-67) 
based on studies using mappropnate data or inadequate analysis and on 
what we call the “Richard Cory myth,” our results imply that suicide 
rates arc generally lower among higher-mcome groups 

Our results suggest the value of using economic theory to analyze 
social phenomena such as suicide Like other theoncs, ours has successfully 
predicted the relation of suicide by age, however, ours also entails 
predictions about both the response of suicide to increased income and 
the change in that respionse as income increases that seem to be con¬ 
firmed, with some exceptions, by the data It is more convincing when 
one remembers the trend toward decreased underreporting of suicides 
that should bias our time-scnes results against confirmation of our theory 
We believe the study demonstrates the need for work using data on 
individuals to separate out economic influences on patterns of suiade 
from noneconomic ones 


Refereaeea 

Alvarez, A. Th* Saoagt God New York. Random House, 1972. 

Becker, G. “An Economic Analysis of Fertility." In NaUonal Bureau of Economic 
Research, Dtmographu and Eaumme Changt m Dtoeloptd Onmtrm. Princeton, 
N.J.: Pnneeton Univ. Press, 1960. 

Breed, W. “OccupaUonal Mobility and Suicide among White Males." Ammtan 
Soc. Jin. 28, no. 2 (April 1963): 179-68. 



98 JOURNAL OF POLmOAL ECONOMY 

Douglu, J. Th* Social Mtanmg qf SuuiJt. Princeton, N.J.: Princeton Univ. Prea^ 
1967. 

Dublin, L. Stdcidt; A Sociological and Statistical Study. New York: Ronald, 1963. 

Durkbeim, E. Suicidi. Translated by J. Spaulding and G. Simpson. 1897. Re- 
prmted Glencoe, Ill.. Free Press, 1951. 

Fedden, H. R. Suictdf A Social and Historical Stu^. London* Peter Davies, 1938. 

Gibbs, J. P, and Martin, W. T Status Integration and Suicide. Eugene: Univ. 
Oregon Pi^, 1964. 

Gregg, D. W Life and Health Insurance Handbook. Homewood, Ill.: Irwm, 1959. 

Grossman, M “On the Concept of Health Capital and the Demand for Health.” 
J.P.E 80, no 2 (March-April 1972) 223-55 

Hansen, W. L. “A Note on the Cost of Children's Mortality.” J.P.E. 65, no. 3 
(June 1957) 257-62. 

Henry, A F,, and Short, J F Suicide and Homicide. Glencoe, Ill.. Free Press, 1954. 

Holtmann, A, and Ridker, R. “Burial Costs and Premature Death ” J.P.E 73, 
no. 3 (June 1965) 284-86 

Lendnun, F C “A Thousand Cases of Attempted Suicide.” American J. P^hiatiy 
13, no 3 (November 1933) 479-500 

Mans, R “Suicide, Status and Mobility m Chicago.” Soc. Forces 46, no 2 
(December 1%7) 247-56 

Nagatam, K. “Life Cycle Saving Theory and Fact.” A ER. 62, no. 3 (June 
1972)* 344-53. 

PowcU, E. “Occupation, Status and Suicide ” American Soc. Rev. 23, no. 2 (April 
1958) 131-39 

Ryder, N , and Westoff, C Reproduction in the United States, 1965. Princeton, N J.: 
Princeton Univ Press, 1971 

Sainsbury, P. Suicide in London London* Chapman & Hall, 1955 

Schultz, T. P “An Economic Model of Family Planmng and Fertility.” J.P.E 77, 
no 2 (March-April 1969) 153-80 

Shneidnaan, E S, and Farberow, N Clues to Suicide New York McGraw-Hill, 
1957 

- TTie Cry for Help New York McGraw-Hill, 1965 

Silver, M “Births, Marriages, and Business Cycles in the United States ” J P E 
73, no. 3 (June 1965) 237-55 

Wilcock, R , and Franke, W Unwanted Workers Glencoe, III. Free Press, 1963. 



John Maynard Keynes: Scientist or 
Politician? 


Elizabeth Johnson 

Rtyal Ecotumu Soeuty 


Keynes thought of hunself as a social scientist, an expert adviser to 
government on economic problems, yet he possessed—and used—all the 
talents of a natural bom politician. His upbringing and position in 
society somewhat circumscribed his thinlung, but when he was unable 
to solve the problem of mass unemployment with the orthodox tools at 
hand, his scientific integrity and his flexibihty of mmd impelled him to 
make the imaginative leap forward of the Gmeral Theory Dapite the 
limitations of his ume and place, be was truly a scientific pohtical 
economist. 


John Maynard Keynes—scientist or politician? The reader of the popular 
press of a generation ago would have had no doubt of the answer. Keynes, 
a swinging weather vane of a man, was the most tmscientific of mdividuals 
—a cartoonist’s dream He was Keynes the india-rubber man • the Dmfy 
News and ChromeU of'March 16,1931, earned an article headed “Economic 
Acrobatics of Mr. Keynes” and illustrated it by a sketch of “A Remarkable 
Performance Mr John Maynard Keynes as the ‘boneless man’ turns his 
back on himself and swallows a drat^ht”—the draught, a glass marked 
“15 per cent Protection.” After years of preaching the virtues of free trade, 
he had first announced the end of laissez faire and now urged a revenue 
tanfT on the country. As an exasperated political opponent remarked on 
another occasion—complaining of the man who in 1925 said wage costs 
were too high and in 1929 wanted higher pnccs; “It is diffrcult to reconcile 

This paper was read at the annual meetings of the Bntish AssoewUon for the Advance¬ 
ment of Sdence, Section F, at Leioester, September 5, 1972. 


99 




100 JOURNAL OF POLITIGAL BOONOKY 

Mr. Keynes the politician with Professor Keynes the economist. He seems 
to be both right and wroi^l”* 

Keynes himself had no such difficulty. In his own opinion, he was 
always right. He had a clear idea of his own role m the world; he was the 
economist—at first Cassandra, croaking prophecies of doom about the 
economic consequences of reparations and the gold standard, prophecies 
which came all too true—and then as he gathered stature, the chief 
economic adviser to the world, to the chancellor of the exchequer of the 
day, to the French minister of finance whoever he was, to the president 
of the United States To elaborate, Keynes the economist initially thought 
of himself as the educator, the persuader, the man who would assemble 
all the relevant mformation and thereby start the reverberation of public 
opimon that would echo back to the pohbcians who, he said, “have ears 
but no eyes’* (Keynes 1922^ Hereafter all references to Keynes’s writings 
are by earliest publication date) Then as he became established as an 
expert, he came to think of himself more as the economic scientist, the 
technician, the mechanic who is called in to fix the machine when the 
self-starter is broken He looked forward to the time when economists 
would be consulted like dentists and hailed President Roosevelt as the 
first head of state to take theoretical advice as the basis for large-scale 
action. 

“For the next 25 yean, m my behef,” he wrote m 1932, “economists, at 
present the most incompetent, will be nevertheless the most important, 
group of scientists in the world And it is to be hoped—if they arc success¬ 
ful—that after that they will never be impiortant again’’ (1932a) 

Keynes had a generally low opinion of pohtiaans as charlatans who 
mampulated the public with their propaganda and obstinately clung to 
the accepted shibboleths until the winds of change forced them to tack. 
He knew himself to be an intellectual and a scientist, but he was a very 
political economist, addressmg himself to the big problems of his time As 
the high ode of the mneteenth century ebbed away and the waves of the 
twentieth came rolhng in, as the world struggled out from the aftermath 
of the 1914-18 war with its old antagonisms and old sovereignties, the 
new hopes of peace and progress were bogged down by old debts and old 
habits. Poverty stood in the midst of what should have been plenty, and in 
Britain, less hard hit than Germany or the Umted States, one quarter 
of the woriung populaoon was unemployed How was the world to get 
out of this mess? In Britain, what could be left to the individual and what 

^ The remark was made by Sir Laming Worthington-Evans, Conservative secretary of 
state fijT %var, ta a]etteT to the EtmutigSlanJarJ, May 6,1929, part of a published exchange 
widi Keynes at the time of the 1929 election campaign. During this election Keynes not 
only supported but took the lead in origmating the Lloyd Geoige program to remedy 



lOI 


JOHN MAYNAKS KBYNES 

must be done by the stated No longer would “private ambition and 
compound interest . . . between them carry us to paradise” (1927), 
the system could no longer be trusted to correct itself. The problems were 
political problems and for Keynes they were intellectual ones 

For all that, he was a natural politician He inherited the Liberal 
politics of his parents and at Cambridge was a Liberal president of the 
union. During the early part of his life, he often appeared on the platform 
to speak for Liberal candidates He was asked to stand for election himself 
many times-^by all three parties—but he preferred the more powerful 
background role of expert counselor and adviser Yet he had, in fact, so 
many of the traits of the politician and they seemed to come so naturally 
and were so much a part of his piersonality that it is hard to think that 
they could not have influenced his advice 
He was an opportunist who reacted to events immediately and directly, 
and his reaction was to produce an answer, to write a memorandum, 
and to publish at once He was into everything, be it the German mark or 
the French franc, birth control, the Lancashire cotton trade, buying 
British, economic sanctions, compulsory savings for a joyful hereafter 
In the World War II treasury, he nearly drove some of his colleagues 
crazy with his propensity to keep a finger in every pie “Don’t just stand 
there, do something,” would have been his present-day motto. Discussing 
unemployment in 1930, he said “If we just sit tight there will be still more 
than a million men unemployed six months or a year hence That is why I 
feel that a radical policy of some kind is worth trying, even if there arc 
risks about it” (1930/") Both Conservative and Labour governments—in 
the “fatalistic belief that there never can be more employment than there 
is,” as Keynes said (1929a)—sat tight over the twenties and thirties, 
instructed by the civil servants of the treasury school whom he once 
characterized as “trained by tradition and expenence and native skill 
to every form of intelligent obstruction” (1939) 

Keynes instead was ready with ideas—ideas of his own, and the current 
ideas of others that he made his own—and ready with recipes for trying 
them They were practical, inventive solutions—such as his proposal to 
use legislation, originally framed to permit the government to stockpile 
war matenals, for the additional purpose of minimizing price fluctuations. 
Characteristically, the proposal imght be presented as a three-, four-, 
five-, or more point agenda. He explored alternatives, giving what he 
considered the preferred order of adoption Often he tied up a package 
deal; he was always happy if he could pick off two birds with one stone 
and get in a little social benefit while solving an cconoimc problem He was 
always ready to commit himself to definite figures—unkmd perstms said 
that he made them up Other people say that he was very good at making 
them up, and that he had a sense, a feel, for what the nght figure ought to 



109 


JOURNAL OF FOLTnOAL BOONOMY 


be. In any case, he cannot be blamed entirely for making up figures, fi>r 
well into the 1920s there was a great scarcity of economically relevant 
statistics; it was Keynes who laid the foundations for providing this 
branch of economic information in Britain.^ 

In dohng out his economic prescriptions Keynes’s style was to make a 
direct appeal to action—to governments, to heads of state (as in his 
“i^nda for President Roosevelt”) or to individual citizens. Often it was 
an appeal for intemationaliy concerted action. At the planning stage of 
the World Economic Conference of 1933, he urged all governments 
stmuUatuously to adopt programs of public spending, supported by gfold 
certificates issued by the Bank for International Settlements, to restore 
world prosperity “What is the charm to awaken the Slcepmg Beauty, to 
scale the mountain of glass without slipping? If every Treasury were to 
discover in its vaults a large cache of gold . would that not work the 
charm^ Why should not that cache be devised? We have long printed 
gold nationally Why should we not print it internationally^ No reason 
in the world, unless our hands are palsied and our wits dull” (1932c). 

This call to action is typically phrased As Keynes himself said “Words 
ought to be a little wild—for they are the assault of thoughts upon the 
unthinking” (19331i) Talking on the radio about unemployment in 1931, 
he hazarded the g;uess that whenever you saved five shillings you put a 
man out of work for a day, on the other hand, whenever you bought 
goods you increased employment “Therefore, Oh patriotic housewives,” 
he paeoned, “sally out tomorrow early into the streets and go to the 
wonderful sales which are everywhere advertised You will do yourselves 
good—for never were things so cheap, cheap beyond your dreams 
Lay in a stock of household linen, of sheets and blankets to satisfy your 
needs And have the added joy that you are increasing employment, adding 
to the wealth of the country because you are setting on foot useful activ¬ 
ities, bringing a chance and a hope to Lancashire, Yorkshire, and 
Belfast” (1931c) 

I have not consulted the contemporary newspapers to learn whether 
this appeal showed up m the department stores’ takings in the January 
sales, but one result is recorded—the cartoonists had a field day depicting 
the middle-class "little woman” sallying forth on a spending spree, 
loading up hubby with parcels and saddling it all on Mr Keynes Were 
these words a little too wild^ In a radio discussion two years later, Keynes 
was careful to explain the diiference between hoarding and useful saving 
(1933a). 

*For ocample, in editing the ManduiUr Guardum Cmmenuil “Reconstruction m 
Europe” lupplemenu in 1922, Keynes devoted several pages in each issue to following 
current trade fluctuauons, makuig use of the London School of Economia and Harvard 
"Business Barometers ” In the 1940i the establishment of the Department of Apphed 
Econtmucs at Casnbndge owed much to his influence. 



JOHN HAYNARD KEYNES 10$ 

But I should give you the rest of the quotation about words. Keynes 
finished it by adding, “When the seats of power and authonty have been 
attained, there should be no more poetic license.” 

He liked to call for timely action —now ts the time to buy sheets, to appoint 
a board, to settle the world’s currency system—and he had his favonte 
words. He was fond of using the phrases “the prospects for” and “the 
progress of” and the word “consequences” m his titles. He was always 
ready to present his “drastic remedy” or his “radical plan” or to approve a 
“bold measure”—he fancied himself as the enfant tembU He was 
optimistic—where others saw the beginning of a long industrial decline, 
he felt the country was “in the middle of a painful adjustment” (1930i) 
Addressing international delegates to a meeting of the National Council 
of Women (his mother was president) m June 1930, he explained that 
England was sufFenng “from a sort of malatse of wealth”—saving money 
faster than she was spending it, econonuzmg on the use of labor faster than 
finding outlets for it, raising the standard of living a little too fast—and he 
described these phenomena as “the growing pains of progress, not the 
rheumatism of old age” (1930r) 

On this occasion, he talked of England’s social achievements, in which 
he took pride He had a strong vein of patnotism, in spite of his inter¬ 
nationalism during the twenties He was perpetually fussing about the 
possibility that Bntish lending abroad was diverting funds from invest¬ 
ment m home industry (Was he, subconsciously, like the colonel m 
“Dr Strangelove,” worrying about the sapping of “precious bodily 
fluids”’) He did not want to sec his country become a rentier nauon and 
miss out on the action, he had his own ideas about how Bntain should 
honor her debts before devaluation, and he looked to her to take the 
initiative and leadership in all his international propiosals 

He enthusiastically supported any leader whom he thought could make 
the running After the terrible things he had said about Lloyd George, 
that goat-footed Welsh Witch, in connection with the Treaty of Versailles, 
he supported him vigorously in the election of 1929 and wrote the pam¬ 
phlet that became the textbook of the Liberal campaign (Keynes and 
Henderson 1929). When taxed with what he had said about Lloyd 
George’s conduct of the coupion election, he replied: “I oppose Mr. 
Lloyd George when he is wrong, I support him when he is right ” It was 
the same with Winston Churchill. And in December 1930 Keynes 
commended the enterprising spirit ofSir Oswald Mosley in putting forward 
a nauonal economic plan (1930«), but he did not write about Mosley 
after that. 

He was flexible Having, after much thought, deserted free trade to 
recommend a revenue tanflf with all his might, the moment that England 
left the gold standard and made such a tariff economically unnecessary, 
he dashed off a letter to the London Times calling attendon to the now 



104 JOURNAL or POLmOAL ECONOMY 

primary importance of devising a sound international currency system; 
the discussion of domestic protection should wait until later, he said 
(1931a). And, as the rest of the country breathed a half-sigh of relief after 
painfully climbing out of the slump, he looked ahead in two Times articles 
to ask how to avoid a future occurrence of such a situation (1937) 
Flexibility—or inconsistency^ “Inconsistency” was the word that he 
was branded with When he was attacked for coming over from free trade 
to a revenue tariff, he lamjxioncd his critics 

“I seem to see the elder parrots sitting round and saying ‘You can rely 
upon us ’ Every day for 30 years, regardless of the weather, we have said, 
‘What a lovely morning'’ But this is a bad bird He says one thing one 
day and something else the next” (193!fc) 

Even Keynes’s critics had to admit that he never tried to pretend when 
he changed his mind And he usually explained just why he had done so 
But he was not always so flexible When he was still in his thirties, he was 
youthfully uncompromising, insisting that the Treaty of Versailles must 
be revised, unwilling to live with it and accept it as a political necessity 
As time went on, he became more amenable to compromise “My own 
view IS that I want as much as 1 can get,” he wrote of a League of Nations 
proposal in 1930, “but I do not want to wreck the whole project by asking 
for more than I can get” (1930a) In the course of time, he even came to 
tone down—publicly, that is—some of his caustic language Writing to 
the literary editor of the Daily Mail about some small alterations that he 
wished to make in an article on England’s war debts to America, “chiefly 
with the object of avoiding strong language,” he replaced the phrase, 
“when one reads the rubbish reported from Congressmen, much of it 
altogether beneath the intelligence and dignity of human nature,” by the 
much milder observation, “when one reads what Congressmen say to 
reporters” (19326) 

Between the two wars he was very visible, energetic, and vigorous on 
the surface of English political life Articles, pamphlets, and books 
appeared perfectly timed for the opening of a conference, the preparation 
of the Budget—or perhaps the preparation of the public for the inevitable 
economic consequences of some past decision or event The political 
pamphlets were priced at 6d and 1/-, The General Theory sold for 3/- 
Margot Asquith objected to the publication of the article on the American 
debts in the Daily Mail “You sd have sent it to the Times, as those who 
read the Daily Mail are mostly in the Servants Hall (I never take it in)” 
(Asquith 1933) He was careful to send his articles, punctiliously presented, 
to the London Times, the Manchester Guardian, the Nation, and the New 
Statesman, at the same time, he never missed an opportunity to publish 
the same material in more swashbuckling form in the Daily Mail, the 
Daily Express, and the Evening Standard 
But as with every real politician, nine-tenths of the iceberg was invisible 
Only the small group of those in the know could have been aware of his 



105 


JOHN MAYNASD KEYNB8 

leado^p in 1920 in organizing an appeal to the League of Nations for an 
international loan and also the lint international conference o[ economic 
experts in Brussels—civil servants, not politicians; his part in the thinkmg 
and research which went into the Yellow Book that reviewed and restated 
the olqects and aspirations of the aihng Liberal party, the questioning 
that led to his adoption of the revenue tariff pohcy in the behind-doors 
Economic Advisory Committee, months before he expressed this change 
of heart in pubhc, his cruaal dominance the Macmillan Committee 
of Inquiry into Finance and Industry And just like every practicuig 
pohtician, he maintained contact with an extensive network of influential 
friends and acquaintances to whom he could go for help if necessary, 
because they would do the same with him. His correspondence files are 
almost a Who's Who for an era—not only in the fields of economics and 
politics, but also in society, bterature, and airt The day that he surfaced 
and was observed lunching alone with Ramsay MacDonald, the stock 
market rose 

All these characteristics show Keynes as a natural bom and highly 
efficient, ever alert pohtician. With his Eton Pop, Cambridge Union 
background, he was bom to the purple, if this was what he wanted 
Yet he insisted that politics was not his role, that he was more valuable 
in his chosen capacity as adviser It was a case of emotion versus mtellcct, 
some of his contemporaries thought of him as an emotionless, coldly 
logical machme. Wntmg of Lloyd George’s pohtical craft with intro¬ 
spection into his own personahty clearly in mind, he remarked. “A pre¬ 
ference for truth or for sincenty as a method may be a prejudice based on 
some aesthetic or personal standard, inconsistent, in jiohtics, with 
pracucal good” (1922c) In middle age, he was able to write, ‘T still 
suffer incurably from attributing an unreal rationahty to other people’s 
feelings and behaviour (and doubtless to my own, too)” (1949) His 
radical approach—‘‘The Republic of my imagmation lies on the extreme 
left of celestial space” (1926)—seemed to destine him for the Labour 
rather than the Liberal camp, but he shied away from Labour dogmatism 
and anti-intellectualism To proclaim any dogma as infalhble and apph- 
cable in all cases, he said, debating the merits of the two parties, was 
“voluntarily to shut oneself out from any scientific approach to economic 
problems by means of experiment and mvesbganon” (1928), Also, the 
Labour party was a class party, Keynes said, “and the class is not my 
class the class war will find me on the side of the educated boargeotsu” 
(1925). 

In considering Keynes, one can never forget his soaal background. The 
son of cultivated, thrifty, donnish parents—nonconformist m outlook 
though apparently not overtly religious—^brought up at the close of the 
nineteenth century in comfortable middle-class Cambridge, polisbed and 
finished at Eton and King’s, and sophisticated by Bloomsbury and the 
high-ranking civil service, making and losing a fortune smd reestablishing 



Io6 JOURNAL OF POLITIOAL S00N014Y 

himself again before he was 40—^thesc were personal experiences that made 
him what he was The man had a confidence, springing partly from his 
parents’ established place in Cambridge society but more powerfully 
from the consciousness of his own intellect which won him an even more 
exalted place in the outside world before he was 35. 

As the son of his philosopher father and socially conscious mother, he 
sought in politics a party that in the changing conditions of the time would 
create a society both economically just and economically efficient, while 
still preserving individual liberty (1927). He stayed with the Liberals, the 
party of his parents, by standing intellectually outside of it, despite his 
campaigning and committee work and participation in summer schools. 
From his parents he inherited the late Victorian nonconformist belief 
in the necessity and possibility of the improvement of society by the 
application of reason and the sense of obligation to one’s social infenors 
that went with it (The same attitudes appear in the Fabianism of the 
Webbs, but Keynes seems to have regarded the Webbs as a little naive ) 
Large-scale unemployment was the basic problem that Keynes came to 
focus on His received view of the world was of a society in which each 
man had his apfjointed place, and it was an injustice for him not to be 
allowed to feed his family and retain his self-respect in fulfilling the task 
ordained for him Economic theory then taught the belief that if there were 
not enough jobs, the labonng force would divide up the supply of work, 
driving down wages in the process—a hardship to the worker and no 
challenge to the employer to make himself more efficient, Keynes said 
His solution, when he wrote “The Question of High Wages’’ in 1930 
(1930(/), was to increase the workers’ real wages by providing social 
services He told a story about the little girl who, asked if the poor should 
be made like the nch, replied, “No, it would spoil their characters’’ 
(19296) The story was offered tongue-in-cheek, but does one detect the 
social worker’s instincts of his mother^ 

An achievement that gave Keynes great satisfaction was his wartime 
scheme for compulsory savings or, as he later chose to call it, “deferred 
pay ’’ He considered that he had made it “outrageously attractive” to 
the working class (1940c), although the working class seems to have 
been singularly unenthusiastic about it He regarded it, characteristically, 
not just as an expedient for financing the war, but as an oppiortunity to 
demonstrate the difference between the totalitanan and the free economy. 
“For if the community’s aggregate rate of spending can be regulated, 
the way in which personal incomes are spent and the means by which 
demand is satisfied can be safely left free and individual . the only 
way to avoid the destruction of choice and initiative, whether by con¬ 
sumers or by producers, through the complex tyranny of all-round 
rationing This is the one kind of compulsion of which the effect is to 
enlarge liberty. Those who, entangled in the old unserviceable maxims. 



JOHN MAYNARD KRVNES IO 7 

&I1 to see this further-reaching objective have not grasped, to speak 
American, the big idea” (19406). 

Toward the end of How to Pay for the PVar, he sums up the scheme as 
“the perfect opportunity for social action where everyone can be protected 
by making a certain rule of behavior universal” (1940a) It is economically 
just, It is economically efficient—and the smell is undeniably paternalistic. 
Keynes grappled with the problems of the twentieth century, but he was 
born in a big house in Harvey Road, Cambridge, in 1883, and he never 
really extricated himself from the view of society, and his own position 
therein, in which his parents had reared him 

Unemployment was a problem that according to orthodox theory 
should not exist, but it was the problem that would not go away He 
grappled with it with a moral indignation and persistency that conjures 
up the spirit of another great grappler, John Bunyan, the subject of a 
biography by Keynes’s maternal grandfather, the nonconfornust minister, 
John Brown “Is not the mere existence of general unemployment for 
any length of time an absurdity, a confession of failure, and a hopeless 
and inexcusable breakdown of the economic machine Keynes demanded 
in a radio dialogue with Sir Josiah Stamp Stamp, so addressed, observed 
“Your language is rather violent You would not expect to put an earth¬ 
quake tidy in a few minutes, would you^” (1930/') 

It took the years from 1923 to 1936 for Keynes to tidy up the theory of 
the earthquake In an address on the occasion of the centenary of the 
death of Malthus, Keynes quoted Maithus himself on the relauon of 
experience to theory, distinguishing between that partial or confined 
experience that a man gains “from the management of his own little 
farm, or the details of the workhouse in his neighbourhood”—which is 
“no foundation whatever for a just theory”—and “that general expienence, 
on which alone a just theory can be founded ” Keynes claimed for Malthus 
“an unusual combination of keeping an open mind to the shifung picture 
of experience and of constantly applying to us interpretaUon the principles 
of formal thought” (1935) Here, in his emphasis on constant referral back 
to the facts, he stated his own ideal of how a social scientist ought to work 

As I have documented earlier m this paper, Keynes behaved from day 
to day in public like a working politician In pnvate, he was deeply and 
seriously concerned with the science of economics. He kept cotiung back 
to the central problem that the existing theory would not explain At 
first, he was content to dazzle by demonstrating that a fuller knowledge 
of economic theory and statistics than his professional colleagues possessed 
would support different conclusions than their understandmg of the 
economic orthodoxy had led them to But eventually, his intellectual 
honesty and his concern for economic science brought him to believe that 
It was not the incompietence of the ordothox economists that was at fault, 
but the received theory itself. Modifications within the ihimework of 



I08 JOURNAL OP POLITICAL ECONOMY 

orthodox econonucs were not enough; a frontal attack on the framework 
itself was required 

So Keynes produced the General Theory as a proof—by the standards 
of the prevailing economic orthodoxy itself—that, contrary to orthodoxy, 
the normal state of economic society was not full employment, but general 
unemployment As a corollary, government policies to raise the level of 
employment were not mistaken and arbitrary interferences with a well- 
functioning and efficient economic machine, instead, government inter¬ 
ference was absolutely necessary in order for the machine to work at all 
Thus, Keynes behaved as a scientist in the crucial sense having found the 
existing body of scientific knowledge in economics increasingly unsatis¬ 
factory as a tool for solving the problems that he considered important, he 
produced a new and rival theory that would explain the discrepancies 
between the orthodoxy and the facts of observation—discrepancies which 
formerly had to be explained as special cases. 

Recalling Malthus on partial and general experience, how much did 
Keynes’s enthusiasms, his inventiveness, blind his science^ One of his 
early reactions to unemployment was to put it down to overpopulation 
(1922a; 1923), a cntic accused himofletting his advocacy of birth control 
affect his conclusions {New Statesman 1923) If he was less of a patriot 
would he perhaps have been so troubled by the idea of foreign lending^ 
If his upbnnging had not inclined him to think of foreign food as less 
wholesome than home grown, or if he had not bought a Sussex farm for 
himself and invested a lot of his college’s money in a large farm estate in 
Lincolnshire, would he, after he reverted from his 1931 recommendation 
of a revenue tariff to his lifelong belief in free trade, have made an excep¬ 
tion for tariff protection to agriculture? If his father had been a docker, 
and not a well-to-do academic philosopher, how would he have dealt 
with unemployment? 

Keynes’s ongns and place in society strongly influenced and limited 
his scientific thought There is a passage in The Economic Consequences of 
the Peace where he desenbes the expectaUons of a man of his station in 
pre-1914 society while he worked on his papers, sipping his morning 
tea in bed, other people cooked and washed and cleaned, providing all 
his wants—even if he needed money to travel, he could send his servant 
to the bank (1919) Keynes assumed, extrapolating from his own experi¬ 
ence, that “any man of capacity or character exceeding the average’’ 
could attain this position. 

In the world as it was then, security and indcpiendence were the lot 
of a few; the housewives whom Keynes urged to go out and buy sheets 
were middle-class housewives with money in the bank to provide for the 
future Yet, in his ideal society, he desired secunty and independence 
for everybody. However, he could conceive of it only in terms of his own 
experience' social happiness was employment for every one, each in his 
appointed place, his own niche 



JOHN UAVNARD KEYNES I09 

Keynes, armed with all the advantages of his upbringing, believed 
unquesUoningly that anyone in England with enough ambition could 
rise to his proper position in society Nor did he have any question as to 
whether it was just for a man who had risen as rapidly as he had to be 
able to rely so implicitly on the full-time dutiful service of others who did 
not have those advantages (What were the thoughts of the servant, who 
earned perhaps £\ a week, when he was sent to the bank to fetch £10 or 
£50^ Keynes was fortunate if the servant, like himself, did not bother his 
head about it) Social injustice existed only in there not being enough 
jobs to go around If there were servants with talents or character above 
their appointed stations, they deserved help, financial or advisory, from 
their betters, it was the obligation of the employer and ultimately the 
state to make it up to them in welfare benefits and other social transfers 

So that, although Keynes thought of himself as a radical, one can see 
that he took a conservative, even an archaic view of society His “radical” 
solution of government maternalism has now become the received or¬ 
thodoxy Even so, his social philosophy made a great leap forward he 
said that it is wrong for a government to cxpiect people to study, work 
hard, be honest and responsible, in order to fit themselves for a place m a 
society presumed to be anxious to employ and make good use of them, 
when m fact the economic policy of that government precludes a large 
number of its citizens from having an opportunity for employment and a 
decent career, government has a responsibility to society to follow an 
economic policy that will satisfy these expectations 

We are left with two problems that did not trouble Keynes We 
still have not solved the problem of equal oppiortunity, which Keynes 
took as a matter of course We do not yet know how to reconcile the 
boring nature of many jobs with freedom for the human spirit, Keynes, 
who thought about it in terms that reflected his own social background, 
looked forward to more automation, less work, and the enrichment of 
leisure time by cultural activities provided by the stale 

In summary, in my judgment, Keynes was a piohtician, but a politidan 
whose constituency was not electoral but intellectual—he had to be a 
scientist to be a politician And he was a good enough scientist, with a 
strong enough sense of scientific integnty, and a strong enough aesthetic 
preference for truth, to recognize eventually that the social science he 
knew was not good enough to solve the problems he recognized as political¬ 
ly important and that he had to reform the science to make it poUtically 
relevant and useful. He was a scientific pohtical economist One 
can emphasize either the “scientific” or the “jsolitical”—and which 
adjective one emphasizes depends on whether one is writing a pohtical 
biography of the man himself or a history of economic thought—but 
both adjectives are appropriate and both are necessary to characterize 
what the man was and what he contributed to British society and British 
social history 



110 


JOURNAL OF POLITICAL RCONOMY 


References 

Asquith, Margot. Unpublished letter. Keynes Papers, Marshall Library, Cam¬ 
bridge University, January 2, 1933. 

Keynes, J M The Economic Consequences of the Peace (1919). In Collected Writings qf 
John Maynard Keynes Vol 2. London* Macmillan St Martin’s (for Royal 
Econ Soc ), 1971 

- “An Economist's View of Population.” In “Reconstruction in Europe ” 

Suppl. 6, Manchester Guardian Commercial, August 19, 1922, pp 340-41 (a) 

-. “Reconstruction in Europe An Introduction ” In “Reconstruction in 

Europe,” Suppl 2, Manchester Guardian Commercial, May 18, 1922, p 66 {b) 

- A Revision of the Treaty (1922) In The Collected Writings. Vol 3 London; 

Macmillan St Martin’s (for Royal Econ Soc ), 1972, also in Essays in Per¬ 
suasion, in The Collected Writings Vol 9 London Macmillan St Martin’s (for 
Royal Econ Soc), 1972 (c) 

- “Population and Unemployment ” Nation and Athenaeum, October 6, 

1923, pp 9-11 

- “Am I a LiberaP” (1925) In Essays in Persuasion, in The Collected Writings. 

Vol 9 London Macmillan St Martin’s (for Royal Econ Soc ), 1972 

- “Liberalism and Labour” (1926) In Essays in Persuasion, m The Collected 

Writings Vol 9 London Macmillan St Martin’s (for Royal Econ Soc), 
1972 

- “Liberalism and Industry ” In Liberal Points of View, edited by H L 

Nathan and H Heathcote Williams London Benn, 1927 

- Report of Liberal summer school debate with Tom Johnston, M P 

Manchester Guardian, August 4, 1928 

-. “Mr J M Keynes Examines Mr Lloyd George’s Pledge,” Evening 

Standard, March 19, 1929 (a) 

- “Mr Keynes’s Lecture” (letter) Manchester Guardian, November 19, 

1929 (b) 

- “The Draft Convention for Financial Assistance by the League of 

Nations” (letter) Nation and Athenaeum, April 5, 1930, p 11 (a) 

- “Investments Abroad” (letter) Times (London), March 13, 1930 (fc) 

- “The Prosperity of England Mr Keynes’s Reassuring Picture” (report) 

Manchester Guardian, June 28, 1930 (r) 

- “ The Question of High Wages ” Po/it Q I (January 1930) 110-24 (d) 

- “Sir Oswald Mosley's Manifesto” Nation and Athenaeum, December 13, 

1930, p 367 (f) 

—- “Unemployment” (broadcast discussion with Sir Josiah Stamp) Listener, 

February 26, 1930, p 383 (/) 

- “After the Suspension of Gold” (letter) Times (London), September 29, 

1931 (a) 

- “Economic Notes on Free Trade II A Revenue Tariff and the Cost of 

Living.” New Statesman and Nation, April 4, 1931 (b) 

- “The Problem of Unemployment” (broadcast talk) /.irtener, January 14, 

1931, p 46 (c) 

- "The Dilemma of Modem Socialism" Polit. Q 3 (April-June 1932) 

155-61 (a) 

- Unpublished letter to R. J Frew Keynes Papers, Marshall Library, 

Cambridge University, December 10, 1932 (b) 

- “The World Economic Conference, 1933 ” New Statesman and Nation, 

December 24, 1932, p. 826 (c) 

- “Saving and Spending” (broadcast discussion with Sir Josiah Stamp) 

Ltstaier, Jsuiuary 11, 1933, pp. 41-42. (a) 



Ill 


JOHN MAYNARD KEYNES 

--. "National Self^tufficiency,” pt. 5. New Statesman and Nation, July 15, 

1933. (6) 

-. “The Commemoration of Thomaa Robert Malthui. The Allocutions III. 

Mr Keynes.” Econ. J 45 (June 1935) 234 Reprinted as “Robert Malthus. 
Centenary Allocution ” In ^says in Biography, in The Collected Wntings. Vol. 10. 
London. Macmillan St. Martin’s (for Royal Econ. Soc.), 1972. 

--. “How to Avoid a Slump ” Times (London), January 12 and 13, 1937 

-. "Democracy and Efficiency” (interview with Kingsley Martin). New 

Statesman and Nation, January 28, 1939, p 122 

- How to Pay for the fVar A Radical Plan for the Chancellor of the Exchequer 

London Macmillan, 1940. Reprinted in Essays in Persuasion, in The Collected 
Writings Vol. 9. London Macmillan St Martin’s (for Royal Econ. Soc.), 
1972. (a) 

- “Mr Keynes’s Plan Control of Boom and Slump” (letter) Times 

(London), April 10, 1940 (4) 

-. Report of talk to the Fabian Society Keynes Papers, Marshall Library, 

Cambridge University, February 21, 1940. (c) 

- “My Early Beliefs ” In Two Memoirs, with introduction by David 

Garnett London. Hart-Da vis, 1949 (posthumous) Reprinted in Essays in 
Biography, in The Collected Writings Vol 10. London- Macmillan St Martin's 
(for Royal Ecorr. Soc ), 1972. 

Keynes, J. M , and Henderson, H D Can Uoyd George Do It* An Examination of the 
Liberal Pledge London Nation and Athenaeum, 1929 Reprinted in Essays in 
Persuasion, in The Collected Writings Vol 9 London Macmillan St Martin’s 
(for Royal Econ Soc.), 1972. 

New Statesman “Malthusian Moonshine” (unsigned article) September 22, 1923, 
pp 664-65 


I 



Group Segregation and Optimal 
Jurisdictions 


Martin McGuire 

UnwtrsUy t)f Mmylaad 


Even if people are entirely devoid of any feelings toward each other 
(sympathetic or antipathetic) at a personal level, they may find it in 
their interests to set up and join asmciations which are segregated on the 
basis of income and/or tastes This paper constructs a model to show how 
wealth, technology, and preferences can interact to provide common 
mcentives for people to form segregated groups for the provision of local 
collective goods—that is, goods which cannot readily be supplied and 
priced on a variable unit-of-services basts, but which arc best provided 
communally at the same level to ail members of the association, in such 
a way that nonmembers are excluded from enjoying them altogether 


L Introduction 

The study of groups is of particular interest to economists because 
economics htis something special to add to our understanding of group 
formation Our population divides itself into groups m a vanety of ways, 
according to color, language, ethnic origin, age, mantal status, etc , in 
many cases such stratification may involve no individual choice When 
groups arise as voluntary associations, however, for the mutual benefit of 
the membership, analysis of the basis for and the structure of such mutual 
cooperation is essentially economic Economic reasoning about groups will 
suggest two alternative “pure” paradigms to explain segregation. In 
both these pure cases, people cooperate for their mutual benefit, the 
alternatives differ with respect to the source of those mutual benefits 

Earher versions of thu paper were read to the Committee on Urban Public Economics, 
where I benefited from members’ criticisms and to the Barcelona meetings of the Econo¬ 
metric Society m August 1971 I also want to thank Thomas Schelhng, Henry Aaron, 
Hirofiimi Shibata, and the referees for comments. Assistance from the Bureau of Business 
and Economic Research, Umversity of Maryland, firom the Insutute of Social and 
Economic Research, Umversity of York, England; and from the National Science 
Foundation is gratefully acknowledged. 




OROVP gEORBOATION 113 

On the one hand, reciprocal benefits from association may stem solely 
from the gratification of personal tastes for association—^the pure pleasure 
of being near others one wants to be near.^ A second source of mutual 
benefit from voluntary assoaation, and, therefore, a second pure case or 
paradigm for group formation, is the common advantage people may find 
in producing, exchanging, or consunung some good which they all value 
In this second pure case, personality is nothing; no benefits derive from 
association as such; people confer no extemahties upon one another 
solely by virtue of their existence Doubtless both kinds of segregation often 
reinforce each other in the real world, but it is useful to isolate the non¬ 
personal mechanisms of group segregation for closer analysis, so as to 
appreciate the power of people’s concern with the utihty of things to 
isolate them from other people 

The purpose of this paper is to bmld such an apersonal economic model 
of group formation based on tastes, wealth, and technology More 
specifically, we explore the logic of g;roup segregation and the interaction 
among these determinants in the provision of local collective goods or 
services 


n. Optimum Jurisdictions, Public Good Supply, smd Tastes for a 
Homogeneous Population 

Historically, the greater part of economic analysis has approached the 
theory of local collectives from two perspectives first from the viewpoint 
of pure public good theory, questions of optimal supply and tax distribu¬ 
tion have been analyzed (McGuire and Aaron 1969, Samuelson 1%9) 
Second, the effects of intcrjunsdictional extemahties upon public good 
supply and burden-shanng optima have received considerable attention 
(Pauly 1970a, Williams 1966) Less attention, however, has been given to 
the question of the size and composition of each consunung group and to 
the interaction of size and compiosition with optimal supply 

One reason for the relative neglect of this question in the literature is 
that It is harder to analyze As an example of this, take the problem of 
defining an cconormc optimum In the case of pure public goods, the size 
and nature of the consuming group is given in advance, and the problem 
IS to detenmne how much to supply, how to distribute cost burdens, and 
how, if at all, to use prices and quasi markets to attain desired outcomes. 
On the other hand, to define an economic optimum in the provision of a 
local pubhc good, several problems must be resolved simultaneously 
(1) the optimal number and shape of jursidicuons; (2) the best way of 
distributing households among jurisdictions; and (3) the resulting 

’ Thomai C Schelling (1969) hM explored this basis for group formauon with a set 
of pure personal-taste m^eb, models in which the aggregate outcome of individualixtic 
behaviOT foUows from the nature, the diversity, and the distnbutioti or structure of personal 
tastes for association 



JOURNAL OF FOLmOAL ECONOMY 


114 

patterns among local collectives: to include size, tax rate, and public 
good supply distnbutions, and homogeneity or heterogeneity of local 
population groups. This paper provides a unified theory of such simul* 
taneous optimization, thereby consohdating extant models by Tiebout 
(1956, 1961), Buchanan (1965), and Rothenberg (1970) * 

For our purposes, a local pubhc good is one which can be produced in 
vanous quality or quantity levels,* but must be provided uniformly* to 
all mdividuals consuming at the particular level in question An in¬ 
dividual IS assumed to choose only one quality or quantity level for each 
type of public good and thereby to idcnufy himself with the particular 
level chosen Examples of such situations are very common police pro¬ 
tection IS supphed to all citizens within a county (town, precinct) 

^ Wnting in 1956 and again in 1961, Tiebout proceeded on the assumption of a mobile 
“consumer” and fluid, redefinable local jurisdictions which effect no externahties upon 
each other His conclusions were 

1 For any configuration of jurisdictions (group size, collective output, tax rates, and 
expenditures) each consumer-voter “moves to thatcommunitywhosc local government 
best satisfies his set of preferences The greater the number of communities and tbe 
greater the variance among them, the closer the consumer will come to fully real- 
izmghis preference position” (1956, p 418) 

2 For every pattern of community services, there u an optimal (that is, average cost 
imnimizing) community size Optimal sizing of commumties will result from the com¬ 
bined mouves (a) of individuals to leave higher than minimum average-cost com- 
mumues, (b) of aty managers to restrict entry through zoning, collusion with realtors, 
etc 

3 People with similar fiscal tastes given the costs to each of them should move 
together 

More recently, James Buchanan (1%5) analyzed a similar if not idenUcal set of 
problems in “An Economic Theory of Chibs” (the formal similarity between this and 
Tiebout’s model has not been widely recog^zed in the literature—possibly due to the 
fact that Buchanan's analysis was not developed exclusively in the context of local 
government orgamzation) Although Buchanan explicidy assumed that the vanous 
members of a collective impose external economies or diseconomies upon each others’ 
eonsumptum, he might alternatively have followed Tiebout and have treated externahties 
as arising m the production of the collective good In any case, Buchanan’s central con¬ 
clusion closely parallels Tiebout’s, namely, that for a homogeneous group of mdivtduab 
the opumum size of jurisdiction (or club) and level of collective good must be simulta¬ 
neously determined, and that freedom of assoaation—freedom both to create a dub and 
to jom one already there—will lead utihty maximizers to just such an optimum More 
recently still, Rothenberg (1970) discusses the local collective problem as one of reaching 
a balance between consensus among members of particular jurisdictions and scale 
economies in production Jurisdictional consensus or homogeneity requires small, in- 
eifiaent groups, he conjectures, while “efficiency” entails large, heterogeneous, conffict- 
ndden umts 

‘ The “amount” of the good will be taken as a contmuous variable in this study 
represenbng either quantify or qualify of the good or service 

* Or m bed proportions! Our analysis will assume equal provision of the good to all 
members m the “lo^” group, although the analysis could easily be extended to include 
diversity m the fixed proportions m which the good is dutnbuted. So long as the distri¬ 
bution of a good can, m fact, be regulated according to a fixed proportion rule, the good 
can be treated as “pubhc” (tee Buchanan 1968, esp pp 58-56, entitled "Private Cteods 
as Pubhc Goods”) 



GROUP SEORBOAtlON 115 

approximately umformly, so are fire protection within a fire distnct and 
water and sewer service within a samtary distnct, in most places equal 
access to one level of pnmary and secondary education is available to all 
withm a particular school distnct * 

It will prove useful to begin our analysis with a case in which segr^ation 
IS impossible or meaningless, that is, the case of a homogeneous national 
population with identical tastes, incomes, and, therefore, demand 
curves for local public goods Consider a world of two goods, j>, the pnvate- 
good numeraire and je, the local public good Everyone’s utility function 
can be wntten as C/‘(x‘,y), t = t • -P, P = population size 

Next we assume that a certain cost function for a given jurisdiction 
applies universally to all possiblejunsdictions whatever their composition * 

C = C{x, n), 

where C = total cost measured in units of the numeraire, * = level of 
public good supplied to each member of the jurisdiction, and n = 
population or size of jurisdiction Writing the total cost function in this 
way suggests that our concept of “jurisdiction” is closely analogous to the 
idea of a firm in the conventional analysis of private goods In our case, 
however, individuals are identified with their particular “firm ” They 
cannot (or merely do not because of high cost) divide their consumption 
between jurisdictions Let us assume that every member of this collective 
has the same opportunity to enjoy the public good x People at the 
fringes of the jurisdiction do not benefit less in any way than do individuals 
at Its center, people outside the junsdiction receive no benefit from x.’ 
Thus, in our model the essence of the “localness” of the “public” good is 
captured by making total cost a function of n, the subpopulation served 
by (or size of) the jurisdiction ® The influence of n upon C or upon x 
may be regarded either as purely a technical production effect or 
alternatively as a congestion-consumption effect 

Clearly, we must concern ourselves with the case where the total cost of 
pubhe-good production increases with size of the junsdiction (as well as 
with the level of public good), or, equivalently, the quantity or quality of 


’ These examples are all of publicly provided good* There u a cloie analogy between 
these public good "junsdictions” and a pnvate good market area The analogy u 
duciuscd in more detail in McGuire (1972) 

* We are measuring the local collective good, therefore, in “consumption units,” each 
unit of which will figure identically in the uUhty function of every member of the local 
collecuve Thi* u one particular way of repreienUng a general unphat relation F (C, x, a) 
among the variables cost, output, and group wze We might equally have wntten x = 
*(C, n) It will become dear t^t thu formulation would produce identical formal remits 
but with a sli^tly modified mterpretaUon (see n. 12). 

^ We rule out mteijunsdictional spillovers m thi« analysis. 

* If one allowed for the fact that costs may also depend on the type of user, the 
conclusions of our analysis might change. 



116 JOUltNAL OF POUnOAL EOONOMV 

public good supplied is dqpraded as numbers served increase. That is, 


dx 


< 0 ; 


C * GOMtMl 




dn 


> 0 . 


Without this stipulation our problem would disappear. If numbers 
served could be increased at zero cc»t with no quantity-quahty loss, one 
national jurisdiction would be optimai, and the questions of supply and 
cost shares have been thoroughly treated already for this case of a national 
pubhc good. As we shall see presently, even with C„ > 0, a single nabonal 
jurisdiction may still be the beat way of organizing for collective supply 
To proceed with the analysis, let us define c' as the ith person’s tax or 
contribution to defraying the costs in “his” jurisdiction so that a person’s 
final consumption of the pnvate good is his initial endowment (or 
earnings) J* less this tax, that is,_)>* = — r*. We will letj' be the same for 

everyonci Clearly, optimization of local public-good supply is a meamngful 
question only with reference to some social welfare objective Therefore, 
we assume that each of the identical individuals’ utilities is equally 
weighted in a social welfare function This formulaUon implies a sym¬ 
metry in the solution to the welfare maximization problem—^namely, equal 
utility levels and pubhe-good consumption for each individual We can 
exploit this symmetry and simphfy the denvabon of necessary conditions 
for a welfare maximum by observing first, that cost burdens must be equal, 
r* » C(x, n)jn, and second, that maximization of social welfare becomes 
equivalent to maximizing the utility of the representative individual * 


Maximize: V 





9 


Necessary conditions for a maximum are: 


nC, = C: 



and n — 

Ui 




From these two equations in two unknowns we solve for x and «. If the 
total population is of size P, there should be Pjn identical jurisdictions. 
The first of these conditions requires that each of the Pjn groups be 
sized such that average cost per person Cfn be at a minimum; this occurs 
when average and margmal costs (of adding one more person) per person 


* This lUuttnitei that people simultaneously create external economies and disecono¬ 
mies for each other m local groups. An increase in n lowers U (external diseconomy), 
since Ca > 0. An increase in n raises U (external economy) since it spreads the cost. In 
our model people do not have a pure distete fw association—an interpretation vdiich the 
less specific ionnulation such as £1 » U(x, n,j)) would allow. Our conclusions, however, 
i^iply equally to the less specific formulation. 



GROUP SEGREGATION 


I17 

are equal ‘ ° Hence, each individual’s share of the cost burden, c\ equals 
the marginal cost he imposes on his local system, C, 

The second of the above conditions is the familiar one that 


S MRS, = MC, 


for each and every of the Pjn groups (where MRS,mdicatcs an individual’s 
marginal rate of substitution between goods x and_>', that is, UJU^, and 
MC, indicates the marginal cost of supplying everyone in the group with 
one more unit of x) 

These equations specify the relationships which must obtain among the 
variables (1) size of each jurisdiction, (2) level of public good supply, 
and (3) tax burden The equation 

C(x, n) - nC„{x, n) = 0 

may be thought of as a normative supply function, relating size of juris¬ 
diction to level of public-good supply Similarly, let us tentatively interpret 
equation 


n 


as a normative demand function. This equation indicates how level of 
public goods ought to vary as the size of jurisdiction increases, if the 
efficiency condition MC, = X MRS, is to be maintained The first of 
the above set of equations is called a “supply function’’ on the grounds 
that It IS purely technological, representing the values of junsdiction size 
which minimize costs of supplying various levels of public good uniformly 
to the homogeneous population The second equation, on the other hand, 
has been labeled a “demand” equation, since the utility functions of 
individuals in the homogeneous population figure directly in their 
derivation 

With this information at hand, let us jxirtray the solution to this prob¬ 
lem graphically Figure la gives an illustration of the basic, underlying 
cost curves C{x, n), and the various points on the supply curve where 
Q't = C„ In the case illustrated, along that supply function dxjdn < 0, 
hence, as shown in figure lo, x® < < **, n® > n‘ > This pro¬ 

gression of per person cost minima means that big jurisdictions are 

If the population (of uze P) cannot be divided into an even number of groups of 
uze B (as seems likely), the optimum-size group will either be the next lowest value of n 
(to round up Pin to an integer) or the next highest value of n (to round Pin down to an 
mteger) Contmuous method of analysis will not solve this problem (see I^uly 1970). 




u,}x, 


C{x, n) 


y - 


C{x, n) 


- C,(x, n) = 0, 




OBOUP 8BORBOATI(Ml 


119 



X 

Fio 2 


assumed to supply lower levels of public good than do small jurisdictions * ‘ 
Figures 16 and Ic now show per person costs as a function both of 
jurisdiction size and level of output, respectively In figure 16 we show 
U-shaped costs per person If the function Cjn rose monotonically, the 
good * should be supplied as a private good, if C/n steadily dechned, there 
should be only one “national” junsdiction Figure \c shows the set of 
efficient supply points along an envelope E{x) Each efficient combination 
(x’lti"), {*', n‘), etc—satisfying C, = Cjn —contnbutcs one point to 
this technically efficient supply schedule 
We can now employ figure 2 to give a complete description of the 
determination of the optimum jurisdiction size (number of people), 
optimum level of local public good, and corresponding payment structure 
for a perfectly homogetuouspopulation Several utility contours between income 
(i e , the negative of Cjn) and x are shown for one representative individual 
in the homogeneous population This individual, and therefore the entire 
population, maximizes utility by (1) organizing itself into groups of 
size n = n*, (2) supplying each group and therefore each member with 
X = ** of local public good, and (3) chargmg each member 
c‘ = C(x‘, n‘)/«‘ 

" Thu u an arbitrary assumption Total differentiation of the supply iuncuon, 
^ ~ leads to an expression for the slope of that function, namely, 

— 

^ luppir 

which may be positive or negative Our assumption that thu slope u negative is made 
solely for lUiatration. 


ISO 


JOURNAL or rOUTtOAL JBOOMOMV 



Another useful way of illustrating this optimum is shown in figure 3 
Consider one (nonmaximum) indifference curve in figure 2 Let us hold 
the utility level constant at, say U'’, and plot, in figure 3 the combinations 
of X and n which just allow to be maintained Clearly, two points on 
the supply-cfficicncy envelope allow U‘‘ to be maintained, namely, 
and b* Similarly, there are two points which allow U” to coexist with 
cITicicncy on the demand side, namely, 6^ and b^ To see this, suppose 
the size of jurisdiction is arbitrarily set at « = n®, given that size and 
assuming n” identical individuals actually “inhabit” the jurisdiction, the 
utility-maximizing level of collective output must satisfy 

MC, = MRSi 

This output level is identified by the tangency between C{x, n'')fn'’ and 
the representative individual’s utility curve at point b^ 

All combinations of x and n which just allow t/® to be maintained form a 
closed loop in figure 3 * ^ At points and b* this loop has a horizontal 
slope, while at points b^ and b^ its slope is vertical This follows from the 

’ * For every combination of x and C/n two values of n exist except on the efficiency 
envelope itself, where one value of n obtains This construction effectively shows the unity 
between Tiebout’s and Buchanan’s analysis (see n 2), since we have taken Tiebout’s 
assumptions and, m figure 3, produced Buchanan’s geometry More generally, this result 
shows that our analysis applies to any “optimum dub-size’’ problem where the size of 
membership simultaneously exerts an external economy and diseconomy on each member 
The fact that we have formally ascribed these simultaneous externalities to the productum 
process IS more or less incidental Since wc end up with a general, representauve utility 
function U (x, n, ji) in which the precise mechanisms of externality are not explicit, we 
could have generated the same general model with some other externality mechanism 
For example, we could have started off the analysis by defining the local public good as 
X a: x(C, n) thereby implying that the external diseconomy occurs m a “consumpuon 
process ” 




agovp aaitxoATioN 

fact that at and b* utility is maximized with respect to jurisdiction 
size, while at b^ and b^ utility is maximized with respect to level of 
collective good supply. Figure 3 shows several contours of constant 
utility constructed in a similar way. Now, consider all points along the 
line 4*4®. These indicate the uuUty maximizing level of public-good 
production for vanous values of group size. Clearly, the hne 4*4® repre¬ 
sents the efficient demand equation, MC, = S MRS,. Note that this 
“demand” equation depends on income, on tastes, and on technology.*® 
Next consider all points along the line 4*4*: these points indicate the 
utility-maxi m i z ing value of jurisdiction size for vanous values of public 
good output Thus, the Ime 4*4* represents the efficient supply equation 
nCg = C Note that this supply schedule is purely technical Finally, 
figure 3 shows the simultaneous optimization of jurisdiction size and level 
of public good supply at the mterscction of the supply and demand curves, 
and utility index £/*. 


m. Efficient Jurisdictions for Heterogeneous Populations 

We can now turn to mixed populations and to the normauve question of 
whether integration is more or less efficient than segregation when group¬ 
ing people for the provision of local public goods ** As an imtial simpli¬ 
fication of the problem of finding efficient jurisdictions for a heterogeneous 
population, let us suppose: 

1. Population heterogeneity takes the form of multiple sub- 
populations each homogeneous unto itself and discretely 
separated* ® from every other homogeneous subpopulation. 
Every member of a homogeneous subpopulauon has the 
same demand curve Z>(x, n). 

' ’ Given the technology, thu demand curve a* drawn m hgure 3 will have a ihallow 
(steep) slope if the public good (x) and private good [jr) art complements (lubatitutei). 
And for given underlying preferences, the "demand” curve will be steep or shallow 
according as raaigmal costs C„ mcrease rapidly or gently (with respect to x) 

‘*The foregoing analytics suggest an operational definiuon of “diversity” or "het¬ 
erogeneity " Consider the demand curve of figure 3 again. One demand curve can be 
taken to represent the combined income luid p t^e r ences of one individiial, without regard 
to the existence of other idenucal individuals. Clearly, dififerent mdividuaJs with diffetent 
tastes and mcomes can have identical demand filiations, provided the differences in 
twes and incomes cancel. Thus, one definition of "economic homogeneity" may be 
"identical demand curves.” But even if Mr. i and Mr. j have different demimd curves, 
the two could be considered "econormcaUy homogeneous” if each’s demand curve, mter- 
sects the supply curve at the same pomt. Since at that point both parties agree on (1) stae 
of juruAcnon (2) equal cost shares, and (3) level of public good supply, it u plauriblc to 
define the two as homogeneous. [Whether this definition ii accqited or not, the under¬ 
lying argument remains valid that individuals of highly diverse needs, tastes, and incomes 
may find themselves m unanimous agreement on preosely the three vanalto (C/a, a, x) 
is^^**** be tunultaneoudy optimiied for jurisdictional efficiency.] 

This discrete separation is not necessarily spatial; it is matbetnaticaL 



132 


JOURNAL OF FOLinCIAL sooNDinr 

2. Each homogeneous subpopulation is laige enough to ibrm 
many efficient junsdictions. Hence, we can ignore the 
problem of Mihere to put “leftover” individuals as being of 
second-order importance 

3. A separate jurisdiction can be formed for each local pubhc 
good (separate jurisdictions need not be formed if cost and 
demand conditions are the same for difterent public goods). 

Given these assumptions, the foregoing analysis of homogeneous popula¬ 
tions implies that integration—in the sense of forimng mixed tax- 
expenditure jurisdictions of people of diverse demands—is economically 
inefficient Given the foregoing assumptions, whatever the source of 
differences in demand by different representative individuals (whether 
due to diverse fundamental preference structures or to differences m 
income levels), i/ is Pareto-optimal to allow homogeneous {with respect to 
marginal rates of substitution) groups to form their own collectives, it is nonoptimal 
to effect income transfers bji integrating diverse groups and then varying cost burdens 
according to equity judgments ‘ * 

To demonstrate this conclusion, let us first establish the normative 
proposition that if we integrate diverse classes and simultaneously require 
that all members of a jurisdiction pay equal cost shares, everyone is worse 
off than he would be if allowed to join segregated groups We can dem¬ 
onstrate this with figure 4 Suppose the population is made up of two 
distinct classes, class 1 and class 2, of equal size * ^ If allowed to form 
separate, segregated collectives, the first class should organize itself into 
jurisdictions of size n* and supply each member with in public goods 
at a cost of C'/n' to each member Class 2 should’® choose n^ < n‘, 
x^ > x’, and C^/n* > C‘/n‘, respectively (The income value of 
to each member of class 1 is shown as distance [(C'/n’) a], and the 

income value of to each member of class 2 IS [(C*/n^) -P b]) 

Now let the two classes be integrated and required to share costs equally 
A new intermediate facility size, n®, should be chosen, supplying all 

’* I conjecture that this theorem is also valid for the case of populations with contin¬ 
uous distributions of income tastes As pointed out to me by Joanne Linnerooth, one 
exception to this general rule exists suppose the income transfers which are judged to be 
soaally desirable just happen to result in an income distnbuuon such that diverse indi¬ 
viduals demand the same quanUty of public good at an equal east share for this particular 
case it IS just as effiaent to "force” integration and effect the desired mcome transfers 
through discmnuiatory cost sharing as it is to rcdistnbute income directly and allow 
uniform homogeneous jurisdictions to arise "naturally ” 

‘ ’’ To determine the new “mtegrated” soluuon we need to know the relauve sizes of 
class I and 2 For simpliaty we assume the two subpopulations have equal numbers. 

' * Note that although we assume the two classes to be of equal size, each should organize 
itself (if segregated from the other class) into groups of difTercnt size for purposes of public 
good sujqily. 

For illustration we contmueto assume jurisdiction size to be inversely correlated 
with level of public good supply. 



OPA ttP (SIHUtOATION 


laj 



Fio.4 


members of both classes with If mdividuals from both classes, once 
integrated, pay equal cost shares C^jn^, each member of both classes will 
be worse off in the integrated situation than in the segregated one, as 
the diagram clearly shows Note that this universal detenoration does not 
stem from inefficient supply in the integrated groups In each integrated 
junsdiction the effiaency conditions C, = C/n, and £MRS„ = C, are 
maintamed. Hence, the joint utihty loss stems solely from class mtegration. 

It may seem that the inefficiency of integration follows only because we 
impose the severe restnction of equal cost shares. But this is not so. 
Differential cost shanng in pubhc good provision is identical to direct 
mcome transfers Consequently, direct transfers of mcome before mte¬ 
gration, and transfers via cost share distribution after intqr^uon, m 

Till* new aoluUon point u determined by the mtenection of the new average MRS 
curve (demand for good x) with the marginal cost curve £'(*) [marginal to the effioent 
supply envelope ^(a)]. The E\x) is cakulated as 

I given that C, « C/a: hence, . 

» I II 







124 


JOUSNAl, OF POtmOAL EOONOHY 



effect are identical redistnbution instruments ** Since we have identified 
an “efficient” postintegration distribution at which all parties lose (in 
comparison with the premtegration distribution), there is clearly no 
redistribution from this inferior point which makes both classes better off 
Furthermore, this argument is valid for every possible premtegration 
distribution of income, since our choice of such an intial point was totally 
arbitrary Therefore, the set of possible utility distributions among 
segregated groups dominates (weakly) the set of such distributions among 
integrated groups For each of the latter possibilities, a utility distribution 
among segregated groups exists which makes everyone better off (or no 
worse off) This conclusion is illustrated in figure 5 for the two homo¬ 
geneous classes 1 and 2 Assuming we always equalize income among 
individuals within each class, no matter what interclass distribution is 
desired, we need only plot the utility (C/* and U^) of a representative 
individual from each class The set of utility possibilities under a segre¬ 
gated regime is shown by the curve S, the set under an integrated and 
tax-discnminatory regime is shown by / There may be one or more 
distnbutions for which the two regimes are equivalent (distnbutions under 
which everyone would freely integrate and pay equal shares), but in 
general S is superior to I 

Thus, given our hmiting assumptions, we are led to the conclusion 
that for the provision of local public goods, class isolation is overall 
effiaent, it is more efficient to transfer income directly between classes 
than to do so indirectly by manipulating jurisdiction membership and 
cost shares paid for public goods (Of course, income transfers will 

For a given membership size within each local junsdicbon, the good provided is a 
public good Recently it has been shown for this case that mampulation of the level of 
public goods supply together with the distnbuUon of the costs of supply among members 
of the commumty, constitutes a perfectly efBcient means of redistnbuting mcome—that 
IS just as ideally desirable as lump sum transfers (see McGuire and Aaron 1969) 



OROUF SBOREOATION 125 

influence the ultimate configuration of jurisdictions ) By the nature of the 
problem, various jurisdictions cannot export their public goods to others, 
hence, equahzation of marginal evaluations cannot come about via 
regional or local trade To give a simple intuitive explanation of this 
conclusion, consider the case in which the optimal membership of a 
local collective is independent of the level of public good supplied 
Obviously in this event segregating individuals with identical preferences is 
superior economically to integrating them We have extended this 
conclusion to the case in which group size and public good supply interact 
Evidently the argument is generalizablc beyond two classes. 

We may also draw one positive conclusion from an otherwise normative 
analysis—namely, that when in fact costs of local public good provision 
are borne equally regardless of wealth, all classes have symmlrical incen¬ 
tives to isolate themselves from each other, and migration between 
groups (even ignoring relocation expenses) will not voluntarily occur 
(if cost burdens arc equally borne), since both groups lose from the 
population transfer Indeed, since property and income taxes obviously 
are not levied equally in the real world, we get a partial explanation of 
why devices such as zoning are used to keep poor people out of nch 
communities and why (whatever the distnbutional consequences) zoning 
for this purpose may be efficient 


IV. Diversity of Local Public Goods and Processes of Group 
Formation 

Let us now turn to a process by which—^with “freedom of association”**— 
individuals of various preference structures and incomes might form 
themselves into groups of diverse sizes supplying diverse quantities and 
qualities of public good A truly adequate positive theory of such pro¬ 
cesses should have not only to explain grouping patterns, but also differ¬ 
ential burden-sharing outcomes as between different individuals within 
each local collective, since equal taxation of vanous individuals in the 
same local collectives is rarely even approximated Unfortunately, such a 
synthesis of differential taxation and group formation has not yet been 
obtained In this section, we examme instead a process of group formation 
on the assumption of equal burden shanng within each group To the 
extent that tax differentials between groups far exceed those among 
groups, this exercise will have empincal application, and in any case, the 
methodological techniques of analysis developed for our more hnuted 

** Die reader is reminded that mdividual preferences range only over goods and not 
over other people per se or association with them Thus, we are expliatly ignoring realisuc 
uttutioiu m which "fireedom of anoaation” is ambiguous—that u, situations in which one 
group wants to associate with a second but the aecond wants to disassociate itself from the 
first 



136 


JOURNAL OF FOUTICAL ECONOMY 

process should nnd use m the analysis of more complex local collective 
and ta^-sharing situations 

We take it as axiomatic that an individual choosing a community will 
compare alternative cost-benefit packages and choose the one with the 
greatest net gam “In the short run,” where the alternative public facilities 
available are arbitrarily limited m number, entry fees, user fees, or taxes 
levied as a rationing device may result in some (low-income) groups being 
totally excluded from access to particular types of facilities For example, 
consider two populations and two correspionding public facilities If one 
facility were reserved for eacli community and suboptimizing user fees 
were charged, each group would enjoy some access to the public service 
in question But this outcome might be meflicient, one group might 
profitably bribe the other to turn over its facility In this latter case a 
single price would prevail at both facilities, a pnee which might easily 
exclude one group from use of any service altogether Following the line 
of thought suggests that in the interests of equity it may often be justifiable 
to reserve some facilities for certain interest groups, to charge discrimina¬ 
tory prices among groups for use of the same facility, or to deliberately 
allow congestion itself (rather than a price-tax mechanism) to regulate 
total demand 

But such limitations on overall efficiency as these are unjustifiable in 
the long run when group formation and public facility provision are both 
variables In the long run, tf the total society is big and varied, an in¬ 
dividual making the decision of which local collectives to join (and there¬ 
fore of his tax and public good consumption level) will be confronted 
with a wide range of choices, each slightly different from the next As one 
basic assumption for the analysis of these choices, we take it that each 
local group is willing to accept new members who pay the full additional 
cost they would impose on the group, m other words, the price of 
admission to group A is assumed to be C„(x*, «*) Now m comparing 
any two groups, an individual will observe that the differential cost 
between groups k and j is n‘) — C„{x\ n^), and the differential 

benefit is .x* — x-* = Ax Assuming the two groups to be fairly “close,” 
the differential cost is approximated by 

AC = C,,Ax + 


Hence, any one individual who is “shopping” for a collective to join will 
calculate a marginal cost-benefit ratio of 


dC 

dx 


= C. 


+ C - 

• '-'wi J 

dx 


supply 


’ ’ Thu full additional cost u the amount of money a newcomer must contribute to 
insure that hu entry into the group leaves the established memben* benefits and costs 
unaffected 



group 8EOREOATION 


lay 


and will proceed in his shopping until he finds the group at which this 
marginal rate of technical substitution equals his subjective marginal rate, 
that IS, 

U, 


U. 


C,. + 


_ dn 


•uppljt 


As a second basic assumption to carry the analysis forward, we suppose 
that every member of the population (including old members of juris¬ 
dictions who mutually impose opportunity costs on one another) has the 
option of shopping around among about the the same range of packages 
and IS required to pay opportunity cost, for his public services 
Given this further assumption, it follows that the entire (heterogeneous) 
population will be led to a constellation of diverse groups, each (roughly) 
homogeneous Each group will tend to contain individuals with the same 
subjective rate of substitution between money and the public good This 
clearly is a process of self-selection or segregation by taste and income 
level Each group will be characterized by an equation, 


« — = + "^1111 — 
f/, dx 


»uppl> 


Last, let us further assume that each alternative jurisdicuonal opuon or 
tax-expenditure package is efficiently sized—such that in every jurisdicUon 
Cjn = C, This assumption, together with free, utility-maximizing 
search and margnal cost “membership charges,” implies an overall- 
efficient constellation of segregated jurisdictions For under these assump¬ 
tions, when a single individual compares benefits and costs of alternative 
jurisdictions, he will observe that size and local service are correlated as 
(see n 11)' 


dn' 

dx 


■uppir 


C, - 

nC.. 


Substituting this expression into the individual consumer’s cost-benefit 
equation, one obtains 

U, ^ „ /C, - nC., 


U. 


= + nC, 




C, Z MRS, = MC, 


Consequently, the process of individual search for the right cost-benefit 
package will lead to efficient group formauon on the demand side In 
short, given the above assumptions (including equal taxauon within 
groups) we can build a self-sustaining process whereby an imtial con¬ 
figuration of efficient jurisdictions is mtamtained But arc these assump¬ 
tions realistic? Or do individual incentives exist to violate the 

Again, vre ignore relocation coeti 

As an awumption this has appeal m the coDteat of equal (iqiportumty coat) taaea 
for all members of the group. Unleas the group is sized effioently, total collections, aC„ 
will not equal total costs, anti the group wiU iow a defiat or su^us. 



128 


JOURNAL OF POLITICAL ECONOICY 



assumptions’ And what if wc start from an inefTicicnt configuration, 
would the process of individual choice we describe correct or reinforce 
such inefficiency’ 

One incentive tending toward inefficiency is well known, namely, the 
“free-nder” potential and the resulting tendency for collectives to under¬ 
produce public goods ** We can use figure 6 to illustrate the effect of 
systematic underproduction of public goods by old residents upon the 
selection-of-jurisdiction process by newcomers In doing this we also will 
illustrate one approach to the problem of the interaction between the 
individual choice to join or quit a local group and the collective choice of 
public good supply Suppose that an “ideal” distribution of jurisdictions 
IS given by the line SS Jurisdictions of type A would ideally be inhabited 
by individuals with demands and would supply Xjt and similarily 
for B, C, etc Suppose ail ^-type, B-type, etc residents, in attempting to 
exploit the free-nder potential, reduce the public good supplied to 

■*«> Therefore, a new potential member, Mr K, sees a set of 
possibilities S'S' Mr K evidently will choose group A', for in doing so he 

Thu tendency on the part of establuhed memben can coexut with rational selccUon 
of benefit-cost packages by newcomers Here one shmdd emphanze the mutant dulmctum 
between the eollectwe dectsum of the membership as to the tax-expenditure package and the tndsmdual 
dectston of airy one person as to whuh poup to jotn A local collective good, although “public” 
to members within the group, u “private” as among groups Thus, a system of local 
coUectaves u created and sustained by economic forces stemming from two very different 
types of decuion process At observed recently by EUickson (1971), a ngorout theory of 
lo^ public goods mutt incorporate the mteracuon between these two types of choice. 










group seoreoation 129 

maximizes his utility Note that Mr K will (on the assumption of equal 
cost shares). 

1 Join a smaller group than he “should,” This 

reinforces the tendency toward underproduction (A down¬ 
ward and leftward shift in SS has the same effect) 

2 In his own interest Mr K will not choose a demand-efficient 
jurisdiction, that is, he will reject point This reinforces the 
nonoptimal derision of the old residents 

3 Once a suboptimal level of public good is voted, all members 
have a cost incentive to expand membership to a fxnnt such 
as n'lt 

4 As this process continues, a jurisdiction becomes “integrated ” 
Obviously this static example has the makings of a dynatme 
model of determination of public good supply 

Another incentive individuals may have to create a nonoptimal set of 
jurisdictions, or to maintain the existence of such a nonoptimal set, will 
arise from any opportunity to restnet the number of jurisdictions If a 
new entrant to a local jurisdiction pays the marginal cost he imposes on 
his system, established members of collectives may benefit if they restnet 
creation of new jurisdictions For by restricting entry they will force new 
members to join and thereby to overcrowd existing jurisdictions, in this 
way old residents might turn a profit on their “old” low-cost memberships 
(capitalized when they sell out) Established members of established local 
governments may desire to develop and exploit such a monopioly jxisition 
whenever possible 

Wc can employ figure 7 to show the implications of such monopolistic 
behavior by established residents Uniform restrictions on freedom of 
creating new jurisdictions would tend to produce “overcrowded” groups 
as shown by the inefficient supply curve S"S" Clearly, we can repeat for 
figure 7 the sort of analysis behind figure 6, this analysis again would 
show that static, nonoptimal collective behavior (restrictions on entry) is 
reinforced by the individual “dynamic” behavior of the newcomer’s 
choice among the inefficient options Specifically, Mr K in figure 7 will 
choose a nonoptimal jurisdiction such as A", rather than K, the (un¬ 
available) optimum Note that static “free-rider” behavior shifts the 
curve SS downward, while static “monopolistic” behavior shifts the curve 
SS to the right One might think that these two tendencies cancel out, but 
that would be wrong, since in fact they reinforce each other In combina¬ 
tion these two typies of nonoptimal behavior tend to shift groups from 
optimaljunsdictions like A to nonoptimal outcomes like A* {B to B*, etc.) 
Thus, the combination of these two sorts of nonoptimal mdividualism 
just might result in an “efficient” range of junsdictionals, sizes, and 
public-good outputs, but with each and every jurisdiction “inhabited” 



130 


JOURNAL OF POLITICAL ECONOMY 



by the “wrong” people This analysis suggests, therefore, that one key to the 
maintenance oj efficient jurisdictions is a procedure for easily creating new ones 

But IS ease of local government formation an unequivocal blessing^ I 
think not Here we may borrow a leaf from Edward Chamberlin (1%0), 
who, in the context of Monopolistic Competition, also pondered the costs of 
diversity in the presence of scale economies Chamberlin showed that 
when the group of competing monopolists is small—such that each 
member avoids pncc-cutting competition—and when at the same time 
entry to this small group is easy, a stable solution “high up” on each 
individual producer’s average-cost curve may result This notion is 
obviously applicable to a theory of optimal local collectives, it suggests 
that a lack of price competition among adjacent groujjs, when combined 
with ease of entry, may result in over diversification and overproliferation of 
small, high-cost junsdictions 

We must close this section of the paper, therefore, with a question The 
question is, How can one promote price competition among local juris¬ 
dictions? The answer is important since it takes us halfway to an efficient 
configuration of local governments 

Roger Noll and Thomas SchcUing both suggested this analogy between this model 
and Chamberlin’s Evidently a cost and benefit analysis of diversification has widespread 
applicauon to economic systems in general, not merely local collectives 

See Chamberlin’s famous diagram of the individual producer’s demand curve 
shdmg up and down an aggregate muket demand curve (1960, pp 91-92) The solution 
I have in mind is the tangency between the aggregate market curve and the single 
producer’s average cost curve 


group 8 EOREOATION 
IV. Conclusion 


J3« 


Just what it IS that draws some people together and separates others will 
never be entirely captured by a single theory or model There is no “one 
thing” which can account for the totality of group formation patterns 
No claim is made that this paper dcveiopis such a universal model But our 
consolidation of the Tiebout-Buchanan analysis does suggest that one 
cause may be group formation for the provision of local collective goods and 
services Even if people are entirely devoid of any feelings toward each 
other (sympathetic or antipathetic) at a personal level, they may find it in 
their interests to set up and join associations for the provision of certain 
collective goods and services—goods which cannot readily be supplied 
and priced on a variable unit-of-scrvice basis, but which are best provided 
communally at the same agreed level to all members of the association, 
in such a way that nonmembers are excluded from enjoying them 
altogether As a result the same forces which govern the production and 
consumption of things can lead people into isolated homogeneous groups 
In conclusion, we have shown that incentive for segregation may stem 
from an impersonal, anonymous, unnoticed economic interaction 
among individuals and technology Such segregation, unlike the forces 
which call It forth, may be conspicuous, even blatant, and at the same 
time wholly unintended Ordinarily one might suppose that both kinds 
of segregation may be present, our kind and Schclling’s (see n 1) Where 
he dealt with a “pure” case in which all the externalities are personal 
and have to do with people’s preferences regarding whom they live 
among, we have dealt with the opposite “pure” case in which nobody 
cares in the slightest whom he is with, as long as the commodity value of 
his “location” yields him the greatest net utility Furthermore, both 
kinds may often reinforce each other, and undoubtedly a good deal of 


^ ’ A precautionary remark should be added further to distinguish our model from a 
world of personal, individualized interacuon and conflict first, while the term “local" 
ordinarily connotes some physical, geographical proximity, a “local public good” m the 
formal structure of our model may be merely subnational When based on personal tastes 
for association, segregation is bound to the ideal of physical propinquity and geographical 
location, whereas the theory of segregated group formauon and collecUve-good provision 
developed in this paper does not contain an explicit spatial element In prmapal our 
model may apply to any good or service which m fact is (or could be) supphed umformly 
to a subset of a large (say national) popuUuon, the model requires that people make a 
choice which identifies them with a particular version (quality or quantity) of the public 
good that IS available to choose among It is true that such choice most emphatically does 
arise when people decide to live in the same locale II is also true that the instances of 
segregabon which concern society most often involve geographically concentrated, self- 
conscious groups Nevertheless, the fact that people are geographically bound plays no 
explicit role in our analysis, and choice of locale is merely one instance (the most interest¬ 
ing from a pracbcal standpoint) of the larger class of decuioiu by whidi people identify 
themselves with a quantity or quahty of pubbe good 



132 


JOURNAL OF POLITICAL ECONOMY 


segregation by color, by language and accent, etc , contains elements of 
income segregation, of taste segregation, and perhaps then a further 
reinforcing segregation by sheer class, ethnic, or cultural identity 


References 

Buchanan, James “An Econonuc Theory of Clubs.” Ecotumua 32 (February 
1965) 1-14. 

- “Private Goods as Public Goods.” In The Demand and Supply of Public 

Goods Chicago Rand-McNally, 1968 

Chamberlin, ^ward Theory of Monopolistu Competition Cambridge, Mass 
Harvard Univ Press, 1960 

Ellickson, Bryan, “Jurisdictional Fragmentation and Residential Choice.” 

A E R Papers and Proc 61 (May 1971) 334-39 
McGuire, Martin. “Pnvate Good Clubs and Public Good Clubs Economic 
Model of Group Formation " Swedish J Econ 74 (February 1972) 84-99 
McGuire, Martin, and Aaron, Henry “Efficiency and Equity in the Optimal 
Supply of a Public Good ” Rev Eton and Statis 60 (February 1969) 31-39 
Pauly, Mark. “Optimality, ‘Public Goods,’ and Local Government A General 
Theoretical Analysis.” y P£ 78, no 3 (May/June 1970) 572-85 (a) 

- “Cores and Clubs ” Public Choice 6 (Fall 1970) 53-65 (A) 

Rothenberg, Jerome “Local Decentralization and the Theory of Optimal 
Government ” In The Analysts of Public Output, edited by Julius Margolis 
New York Nat Bur Econ Res, 1970 

Samuelson, Paul “Pure Theory of Public Expenditure and Taxation ” In Public 
Economics New York St Martin’s (for Intemat Econ Assoc ), 1969 
Schellmg, Thomas C “Models of Segregation ” RM 6014-RC Santa Monica, 
Cahf RAND Corp , 1969 

Tiebout, Charles M “A Pure Theory of Local Expenditures ” J P E 64 (October 
1956) 416-24 

- “Economic Theory of Fiscal Decentralization ” In Public Finances. 

Needs, Sources and Utilization Princeton, N J Nat Bur Econ Res, 1961 
Williams, Alan “The Optimal Provision of Pubhc Goods in a System of Local 
Government” 7P£ 74, no 1 (February 1966) 18-33. 



Stability and the Quantity Theory 


P W Howitt 

Northunitem Umversily and Unwtrsify of Western Onlano 


This paper presents a method for analyzing the microdynainics of 
monetary exchange The central feature of the analysis is that trans¬ 
actions take place during the process of price adjustment The method is 
used to demonstrate the stability of a long-run equilibrium in which the 
quantity theory is valid If all goods are gross substitutes and the real 
balance effect is operative, then the equilibrium is stable if (i) there is no 
rationing, (ii) there is a form of competitive rationing, or (iii) there are 
no distribution effects. 


The neoclassical long-run quantity theory consists of two basic propiositions 
The first is the familiar quantity theorem—that m equilibrium the level 
of money prices will be directly proportional to the quantity of money. 
The second proposition, without which the quantity theorem would have 
little empirical significance, is that this equilibrium is stable 

Patinkin (1956) showed that the neoclassicals had no sausfactory 
explanation of the stability proposition He suggested that such an ex¬ 
planation could be provided by considering the real balance effect on 
excess demands If there were no distribution effects, and if appropriate 
restrictions were placed on the subsututabihty of commodities,* it could 
then be shown that a Walrasian tiitonnement would converge on a vector 
of money prices proportional to the stock of money Archibald and Lipsey 
(1958) provided a method for extending this analysis to the case of 


Thu paper n a revised version of one chapter of my doctoral dissertation, which was 
written with financial support from the Canada Council I am indebted to Robert 
Glower, John Ledyard, Axel Leijonhufvud, and Joseph Ostroy for helpful comments and 
discussions 

* Patinkm 1956, pp 342-44 Arrow and Hurwicz (1958) showed that a sufficient 
condjUon is for all goods to be gross substitutes 


>33 



134 JOURNAL OF POLITICAL ECONOMY 

distnbution effects. ^ If we disturb a long-run equibbnum by, say, doub¬ 
ling the quantity of money, the temporary eqmhbrium established by a 
titonneraent might contradict the quanuty theorem unless each trans¬ 
actor’s money balances are exactly doubled. However, these distribution 
effects are transitory Money balances will change hands from one 
equilibrium to the next, until in the limit each transactor holds exactly 
twice his onginal money holdings and the money price of each commodity 
exactly doubles 

As amended by Archibald and Lipscy, Patinkin’s analysis does provide 
an explanation of the stability propiosition It is in fact the only such 
explanation in the literature However, the analysis has the well-known 
drawback of Walras’s onginal formulation of the tatonnement process— 
that no transactions occur until market clcanng prices have been estab¬ 
lished To state the issue m familiar language, our understanding of the 
quantity theory is still impaired by a dichotomy There are two basic 
forces that act to propel the economy from one long-run equilibrium to 
the next First, there is the force of competitive bidding that makes prices 
adjust to their market clearing values Second, there is the force of ongoing 
transactions that makes each transactor’s money holding tend to its 
stationary level While it is commonly acknowledged that the quantity 
theory is meant to apply to a world where these forces act simultaneously, 
the most complete explanation available of the quantity theory requires 
that we consider them separately—while the one is operating the other 
must be suspended ^ 

Glower (1965) and Leijonhufvud (1968) have demonstrated that the 
basis of Keynes’s attack on orthodox monetary theory was the contention 
that this dichotomy was invalid 1 hey show how false trading g^ves rise 
to the multiplier process that can make the economy move away from its 
full-employment equilibrium, even with flexible wages and prices If the 
quantity theory is to be maintained in the face of the Keynesian challenge, 
or if we are to have a monetary theory that can in fact reconcile Keynesian 
economics with the quantity theory, wc need some method for presenting 
the stability proposition that does not rely on the dichotomy of price 
adjustment and transactions The purpose of ihis paper is to present such 
a method 

In order to do away with this dichotomy we must reexamine the micro- 
foundations of the quantity theory This is because, first, there is nothing 

* Archibald and Lipacy’s analysis was the subject of the symposium by Baumol ct al 
(1960) Their model was extended by the exchange between Hahn (1902) and Glower 
(1%3) to include a ncher dcscnption of intertemporal choice 

^ Grossman’s (1971) interesting analysts of price dynamics in a Patinkineique model 
with false trading fails to do away with the dichotomy because prices do not change until 
quantities have all adjusted to a “fix-pnce” equilibrium, the effect is to retam the 
dichotomy by reversing the relative speeds of price adjustment and transactions The 
same is true of Frevert (1970) 



>35 


quantity theory 

in accepted general equilibrium theory to tell us how transactions occhr 
even in equilibrium,^ much less when there are nonzero excess demands. 
Because the time path of prices is affected by the false trading’ it becomes 
important to determine what transacuons take place. Second, a meamng' 
ful monetary theory must specify a money commodity that changes hands 
with each transaction, this is not done by accepted general cquilibnum 
theory * Third, the excess-demand funcUons of general equilibrium theory 
are usually derived under the implicit assumption that all planned trades 
will in fact be made, an assumption which appears to be inconsistent with 
the existence of false trading ’ 

To deal with these issues we shall describe a highly simplified exchange 
system organized by a group of shopkeepers One of the consequences of 
this approach is that we are able to handle these problems, which arise in 
Patinkin’s Walrasian approach, with a model that is formally very similar 
to Patinkin’s Indeed the long-run equilibria of the two models are 
identical This allows us to demonstrate the stability propiosition using 
many of the tools developed for the Walrasian system 


I. The Exchange Environment 

In order to keep track of actual transactions, our desenpuon of the shop¬ 
keeper system will be very explicit about how these transacuons occur 
There are in the economy B transactors and C -I- 1 commodiues For 
reasons that will soon be apparent, the (C + l)st commodity is called 
“money ’’ Each transactor b begins each penod with the endowment 
IF, ^ 0 of commodity c, c = 1, , C, and an endowment A/j ^ 0 of 

money The first C commodiues are nondurable goods that disappear 
after each period * These endowments reappear at the beginning of each 
period in unchanging quantities There arc C shopis in the economy In 
shop c the only exchanges that may be made are trades of money and 
commodity c for each other Each shop is presided over by a separate 
shopkeeper whose job it is to set the pnee for his commodity and to 
facilitate the exchange of that commodity from seller to buyer by acung 
as middleman 

Trading proceeds once a period in each shop Dunng the “morning,” 
shopkeeper c posts a money price, p^, that will prevail for all exchanges on 
that day Later in the day each transactor goes to all the shops where he 

* Thu point a made very forcibly by Ostroy (1973). 

* Se» Edgeworth (1881, pp 22-30) and Kaldor (1934) 

* See Glower 1967 

’ See Glower 1965, p 117. 

Even if money were not the only durable commodity it would still have the distm- 
guuhing feature of bemg the unique medium of exchange (tee below). However the 
nondurable nature of nonmoney ccniunodibei means that they will not be demanded for 
tpeculauvc putpoKt. 



JOURNAL OF POLITICAL ECONOMY 


136 

wishes to do business, placing an order to buy or sell a specific quantity 
of each commodity. We denote by the 6th transactor’s order for 
commodity e 

In order to make money a well-defined medium of exchange we 
suppose that the shopkeepers require all purchase orders to be accom¬ 
panied by the total money payment This requires that • ® 

T; PXc ^ -W* for all 6, (1) 

where 

P{b) = {c Xt ^ 0} 

For a sell order the transactor must deposit with the shopkeeper the entire 
quanuty offered for sale This requires that 

X^ ^ for all 6 and c (2) 

These conditions ensure that the transactor can only offer to trade com¬ 
modities that are available at the beginning of the penod This rules out 
forward transactions and eliminates the possibihty of bankruptcy ' ® 

The sum of all orders for commodity c, 

»»I 

IS called the “excess demand” for commodity c 
Once all orders arc placed the shopkeepers have two tasks First they 
convert orders into transactions If commodity c has a zero excess demand, 
we assume all Orders for commodity c are executed * * If excess demand is 
not zero, the shopkeeper has some rule to determine what transactions 
occur The quantity of commodity c the shopkeeper for that commodity 
allows transactor 6 to exchange is T/, which is positive (negative) for a 
purchase (sale) The shopkeeper’s transaction rules make 7)? a function 
of all orders for commodity c and the vector of money holdings That 
IS T* = T^{X^, , A’/, Af) for all 6 andc, where A/= (M,, , iV/j,) 

We assume that for all 6 and c, T* is a continuous function of all arguments, 
satisfying a Lipschitz condition on every compact set, and 

r* = Xl liX, = 0, 

|T*1 rS \X^\ 

and 

X^ ^0 

* This condition »similar to Glower's (1967) “dichotomized budget constraint ’’ 

The pomble comphcations of bankruptcy have been stressed by Hahn (1965) 

” This rules out the sort of phenomenon that Phelps (1970) described with job 
vacancies and involuntanly unemployed workers at the same time 

' ^ The reason for including money balances will become apparent in Section IV, in 
short. It allows us to include the competitive-rationing restrictions (see below) 



>37 


QUANTITY THEORY 

The first of these conditions has already been explained. The others say 
that no orders will be overexecuted and no purchase (sale) order will be 
converted into a sale (purchase) It follows from the last two that if no 
order is placed, no transaction will occur 

In the “afiemoon” the shopkeepers allow these transactions to occur 
and return the deposits corresponding to unnecuted orders Thus the 
transactors’ money balances for the beginning of the next period are 
given by the difference equations 

c 

AAfj = - 2 Pc’^c for all (3) 

c- 1 

The shopkeeper’s second task is to determine next penod’s price. We 
assume he follows the usual adjustment rule of raising (lowering) his 
price when there is a positive (negative) excess demand That is, 

for all c, (4) 

where 

as X,% 0. 

These tasks of the shopkeepers are not meant to be hteral accounts of all 
exchange and bargaining activities m a monetary economy Instead they 
are intended to personify the “forces of competition ’’ In this respect the 
shopkeepers are much like the Walrasian auctioneer. When A', is posiUve 
there will be some buyers who, in the absence of the shopkeeper, would 
be unable to find a supplier of commodity c In a Walrasian titonnement 
all trading would be suspended, some frustrated buyers would suggest 
that trading resume with a higher p^, and orders would again be sub¬ 
mitted on the basis of the new price Trie auctioneer is supposed to embody 
these forces acung on pnees In the environment outhned above, the 
bargaining cannot be suspended m the middle of a penod Excess demands 
become apparent to the transactors only after trade has taken place, only 
then can frustrated buyers and sellers make known their wilhngness to 
trade at different prices. We assume that the shopkeepers have been 
provided with some incentive to reflect these competitive forces in their 
pncing deasions 


II. Excess-Demand Functions 

Within this shopkeeper system we can derive excess-demand functions 
similar to Patinkin’s. However, the possibihty of false tradmg means these 
functions cannot be derived in the usual way because the typical trans¬ 
actor’s transactions depend not only upon his orders but also upon how 
the shopkeepers convert orders into transactions. Although shopkeepers 

‘ ’ The & represents a first diflerence Tune subscripts are omitted here. 



138 JOURNAL OF POLmOAL KOONOMY 

have deterministic transaction rules, the relationship between a trans¬ 
actor’s orders and transactions will appear to him as being stochastic 
because he does not have all the information available to the shopkeepers. 

One implication of this false trading is that the transactor’s current 
sales income can have a direct effect on his demand orders, which cannot 
occur in the general equilibrium theory of Patinkin with no false trad¬ 
ing 1 * With no false trading, a change m one transactor’s demands can 
only affect another transactor’s demands if it somehow results in a pnee 
change With false trading, a reducuon in some transactor’s demand for 
one commodity can cause sellers of that commodity to reduce their 
demands for other commodities, even at unchanged prices, because their 
current sales mcome will have fallen This intermarket connection that 
arises from false trading has been interpreted by Glower (1965) and 
others* * as the theoretical essence of the Keynesian consumption function 

Another implication of false trading is that the typical transactor will be 
especially concerned with the state of his money balances Money provides 
him with a form of insurance against the possibihty that sell orders might 
not all be executed in the future, which would force a reducuon m 
purchase orders if sufficient money balances were not held Even if some 
commodities other than money were durable, money would still be 
especially important to the transactor because of the constraint (1) that 
assigns the role of medium of exchange to only the one commodity, 
money Because money is the only medium of exchange it is also the only 
commodity that can provide that parucular form of insurance Even if 
current sales income were to have no direct effect on excess demands this 
special role of money would provide the same sort of intermarket con¬ 
nection as explained above An exogenous decrease in demand for one 
commodity will reduce the money holdings of sellers of that commodity, 
providing them with an incentive to reduce their demands for other 
commodiues even with no change m prices ' ® 

To be perfectly explicit about all this we would characterize the trans¬ 
actor’s decision problem as the maximization of the expected value of an 
mtertempioral utility function, where actual transactions depended upon 
orders placed but with random error terms to account for the shop¬ 
keepers’ activities Money balances would not enter this utility function 
directly but would be involved in the constraint (1), which would have to 
hold in every time period Money would become the “state variable’’ of 
the decision problem To include the direct effect of current sales income 

** Thu point is made very forably by Glower (1965, pp 111-12) 

” Barro and Grossman 1971, and Leyonhurvud 1968 

I have shown elsewhere (1973) how thu intermarket connection can produce, in a 
model like the present one, the sort of Keynesian ihort-run behavior usually associated 
with current mcome having an important effect on excess demands in the form of the 
consumpuon function 



quantity theory >39 

wc could assume that the transactor revised his estimates of the probability 
distribution of the error term every time a new observation was generated 
by the shopkeepers If a reduction in demand for a commodity he was 
selling caused a decrease in his sales without an initial change in his sell 
order, he would form more pessimistic expectations about how the 
shopkeeper for that commodity was ratiomng him This would cause him 
to reduce his demands elsewhere because of a reduction in his future sales 
prospects 

Rather than describe the intertemporal decision problem explicitly we 
proceed in the following way We make the simplifying assumption that 
the transactor has no idea how the future relationship between orders and 
transactions will be affected by variables about which he has any in¬ 
formation, including the current decisions of the shopkeepers In this case 
the subjective probability distribution of error terms is an unchanging 
one, and there is no longer any direct effect of current sales income To 
incorporate the direct effect of money balances, we characterize the 
decision process as a single period choice, where the choice vanables arc 
this period’s orders and the money balances the transactor would have at 
the end of the period if all orders were executed The ith transactor’s 
objective function is -h %, />), where JT* = (A’f, , A^), 

P — {Pt> ) Pc)> intended change of Af^ over the period is 

c 

( 5 ) 

c-l 

Prices enter into this objective function because the usefulness of money 
depends upon its future purchasing power We make the usual homo¬ 
geneity assumption with respect to all nominal variables, namely, that** 
U\X\ mj, p) = Xp) for all A > 0 This is 

equivalent to saying that real instead of just nominal balances are con¬ 
sidered important The most convenient raUonahzation is to assume that 
all future price expectauons are unit elastic with respect to current prices * ’ 
Therefore if p, m^, and were all to double, all future price expectauons 
would double, and the future purchasing power of the transactor’s end of 
period money balances would be unaffected 


' ’ way of accounting for the direct effect of sales income differs from Glower's 
1965) “dual decision hypothesis” used by Bano and Groasman (1971) and Grossman 
1971) The difference IS that for these wnten sales income during the current period is 
mown before demand orders arc placed Tlus is equivalent to malting expected income 
ilways equal to current income, implying a much closer relationship between mcome and 
onsumption than implied by modem consumption theories This point has been made 
>y Leijonhufvud (1973) The present analysis would allow for a “looser” dependence of 
onsumption on income 
" See Samuelson 1947, p U9 
” See Hicks 1946, p. 205. 



140 


JOURNAL OF POLITICAL ECONOMY 


The transactor’s decision problem is to choose and so as to 
maximize U'’ subject to (1), (2), and (5) The solution to this decision 
problem implicitly defines the transactor’s net demand functions. 

xo = M,) = iXt(p, M,), , X^ip, M,)) 

c 

m, = m,{p, M,) = - Y^PeXtiP, M,). 

C-1 

We assume X*‘ and possess continuous first-order partial denvauves. 
The net demands may be summed to form the excess-demand function 

UP. M) = Y ^'iP. 

»=i 

This IS the same formal model of the transactor’s decision process as 
Archibald and Lipsey’s (1958) except for the constraint (1). This con¬ 
straint makes the model very similar to Glower’s (1967) Including the 
constraint allows us to avoid the anomalous results found by Glower 
(1967) and Hahn (1965) in Patinkin’s general equilibrium theory—that 
the economic system could proceed as usual even without the medium of 
exchange because there is no peculiar characteristic of money that would 
require any transactor to hold positive quantities of it in equilibrium 
This anomalous result is avoided with the constraint (1) because there is 
now a compelling reason to hold money—a transactor needs to have some 
at the end of this period m order to submit purchase orders at the be¬ 
ginning of the next period Although we have derived this demand for 
money partly from putting money into the utility function this is not 
subject to the usual criticism that it begs the question of why transactors 
hold money they hold it in order to place purchase orders 

in. The Dynamica] System 

We may complete the descnption of price adjustment given in (4) by 
assuming that there exist conUnuously differentiable functions Hg[p, M), 
for all c, such that = H^{p, M) If we approximate the time paths 
generated by (3) and (4) by their continuous Umc analogs, we get the 
following system ^ * 

A = HXP.M),c= 1, . ,c, 

c 

- Y PcT\\Xl {p. Ml), , Xf{p, M,); M]; i = 1,..., fi. 

e-l 

dower’s excess demand functions have been analyzed by Lloyd (1971) and dower 
(1971) 

The “dott" refer to tune denvatives 



QUANTTry THEORY I4I 

Wc refer to this system as Q. An equilibrium to Q is a vector {p, M) >0^* 
such that the right-hand side of Q equals zero. We assume that at least 
one such equilibrium exists In equilibrium excess demands are ail zero 
and each transactor’s money holding is stationary This is the long-run 
equilibnum analyzed by Archibald and Lipsey (1958). 

The two stages of the Paunkin, Archibald-Lipscy process are merged 
into one by Q. The t&tonnement stage could be represented by holding 
M constant and allowing prices to adjust If this were stable it would 
establish a temporary equilibrium p with X^(p, M) = 0 for all c The 
second stage could be represented by assunung the t&tonnement to take 
place instantaneously, so we are always in a temporary equihbnum. The 
adjustment of money balances would then be given by = m^(p, 
because there would be no rationing in the temjjorary equihbnum 
The system Q satisfies the following two properties 

1 Walras’ law 

C B 

S Pc^ciP, ^ m^{p, M^) = 0; 

c°l i«l 

2 Homogeneity of net demands For any A > 0, 

^cip> ^b) = ^li^P, and m^{p, M^) = - 

A 

In proving the stabihty proposiuon we make the following two 
assumptions 

1 Gross substitutability 

~ X^ip, M^) > 0 and ^ m^{p, M^) >0, for all k, b, c{c # k ), 
oPb opt, 

2 Real balance effect 

^cip, ^b) > 0 for all b, c 

cM* 

The system Q together with gross substitutability and real balance effect 
IS referred to as Q* There is a very striking resemblance between Q and 
the Walrasian system Some of the results which have been developed in 
the Walrasian framework carry over directly to Q, and we make use of 
this strong analogy in proving the quantity theorem and the stabihty 
proposition. 


We UK the following notaUon throughout for any a-dimenstonal vectois, * =• 
til , *,) and *'■! (x'„ . means*, >*; for all»,» a x* means X, a 

^ __ 1 A 


lor all j, and * 2: means » 2t ;ir' and » 9* ✓, 



143 


JOURNAL OF POtmCAL BOONOMY 

Before proving the quantity theorem we state a result very similar to 
a lemma proved by Arrow, Block, and Hurwicz (1959, p. 88). 

ABH lemma • Given gross substitutability and the real balance effect, the 
relauons 0 < (/>', M') ^ {p\ Af"), p', ^ p”, for e e R c {1, , C), 

Af; = Af" for ieJ <={!,. .,5); imply that (a) X^{p',M')< 
for all ceR, and {b) m^{p\'Ml) < mj^p", M^) for all 
b e S it and only it p' ^ p" 

We may now prove a suong version of the quantity theorem ** 
Quantity theorem • Given any value M > 0 of the money supply, there 
IS exactly one equihbnum to Q* for which 

a 

= M 

6»1 

Given two values of the money supply, M and M', if (p, M) and (p', M') 
are the corresponding equilibria, then (/>', M') = (Bp, BM) where 
fi = (M'/M) 

Proof; To prove uniqueness for a given M, suppose {p, M) is an equi¬ 
librium to Q* such that 

B 

M 

fc* I 

Take any other vector, (p, Jd) such that (/, A?) ^ {p, M), and such that 

B 

E = M 

6 » 1 

We show that {p, St) cannot be an equihbnum Define A = 
nun {/!,//»„ ,Pc/Pc, , A?b/A/,} Since 

E = E 

»-i s-i 

It cannot be that (/, A?) = (A/i, AAf), for this would imply that (/, Af) = 
(/, A?) Therefore, {ip, i.M) ^ (P, Id) There are two cases to consider 

a) If there is some c such that A = pjp^, then by part (a) of the ABH 
Icnnma and by homogeneity, 0 = X^{p, M) = X^{ip, AM) < X^(p, A?) 
Therefore (/, A?) is not an equilibrium 

b) If there is no such c, then ip ^ f, and there is some b such that 
A = JdJMy Therefore, by part (6) of the ABH lemma and by homo- 
pneity,0 » im^iPyM^) = mi(ip,iMt) < m*(/, M^). Therefore, (/, A?) 
is not an equilibrium. 

This proof is similar to the proof of the analogDus umqueness theorem of Arrow 
et al. (1959, pp. 89-90). 



quantity thborv 143 

The rest of the proof follows directly from the homogeneity of net 
demands. If {p, M) is an equilibrium to Q* such that 

B 

Z - M, 

t-i 

and if ^ = (M'/M), then by homogeneity ^Af) = X^{p, M) => 0, 

for all c, and my,{fip,pM^) = Pm^{p,M^) = 0 for all A Therefore 
[Pp, pM) is an equilibrium to Q* and 

Y, = M' 

fc-i 

It follows from the previous paragraph that {Pp, pM) is the only such 
equilibrium for Af' 

This IS the quantity theorem in its fullest sense If, starting from a 
long-run equilibrium, the stock of money is doubled in the economy, then, 
regardless of how this money is imually distributed, the economy will not 
come to rest again until all prices have exactly doubled and every trans¬ 
actor’s money holdings have exactly doubled All that is missing from 
the quantity theory as stated m the introduction to this study is a proof 
that the economy would indeed come to rest if it were disturbed in this or 
in any other way 


IV. The Stability Proposition 

We interpret the stability proposition to mean that Q is asymptotically 
stable That is, for any initial position (p®, Af') > 0, every solution 
(/>(!, p”, Af'), Af(t, p", Af')) to Q starting at (/»', Af') converges to an 
equilibrium ets ( -» oo We atssume that for any imtial position a unique 
solution exists to Q, which is continuous with respiect to the iniual posiUon 
and which is bounded ** We can demonstrate that Q* is asymptoucally 
stable under three different sets of restrictions, as in the following three 
theorems 

For the first two theorems we shall use the result, due to Uzawa (1961, 
o 619), that Q* is quasi-stable^* if there exists a conunuous function, 

We have been unable to find m the htetatuie a proof that the long-run quantity 
heory equiUbnum of stocks and pnees u unique Patinkin (1965, p. 45) asamml the 
miqueneis of equihbnum in order to denve the quantity theorem, llte same is true of 
Archibald and Lipsey (1958). Arrow et al. (1959, pp. 89-90) proved the umquenea of 
>nces for given stock holdings Hadar (1965) pro^ the uniqueness of stock hoidiiigi 
or given prices. 

** I have elsewhere (1973) shown that the system Q* satiifies all three assumptioiii 
inder the premises of theorems 1-3 below. 

** "Fhe system Q* is defined to be quasi-staUe if for any uunal position (JS*, M*) > 0 
my solution to Q* IS bounded and every hinit point of the solutioa is an equ^brii^ 



144 JOUSNAL OF POLITICAL ECONOMY 

V{p, M), called a “modified Lyapunov function,” defined on 
such that for every {p‘, M") > 0 the function v{t) = V[p(t; p", M"), 
M{t, p’, Af”)] IS a strictly decreasing function with respect to t unless 
p{t; p", Af*), Af(/, p", M") IS an equilibrium to Q*. 

First, suppose the shopkeepers can maintain stocks of the commodities 
and do not ration the transactors. All excess demands are absorbed by the 
shopkeepers’ inventories In this case we can use the modified Lyapunov 
function. Af) = rmx.J{p,M) — min y(^, Af), where J{p,M) = 
{/’i/^i. . Pcih, ATj/A?,, , MJldg}, and (/, Xt) is an equilibrium 

to Q An intuitive sketch of this stabihty proof IS helpful If miny(/>, Af) = 
PclPc> then p^ IS SO low that > 0 and therefore /,, > 0 If = 

mm J(P, M), then is so low that > 0 Because there is no ration¬ 
ing, the Ath transactor’s attempt to increase his money holding will be 
successful, and A/,, > 0 Smularly, the maximum of the ratios will be 
decreasing. 

Theorem 1 If 7’J’ = X^, Q* is asymptotically stable. 

Proof In this case Afj(t) = m^[p{t), Afj(t)] for all We define 

1 ) 1 ( 1 ) = max Af(t)], 

Vi(0 = nun J[p{t), M{t)], 
vft) = 5,(1) - v,(t), 

Wi{t) = lim sup [i/j(l + d) - t'i(l)]/rf, 

d -.0 

w^(^) = Iim sup [5,(1 + d) — v^{t)]/d, 
e-o 

w,(l) = hm inf [v,(l + d) - v,(l)]/rf, 
s-o 

V^(p, M) IS continuous on Take any t such that {p{t), Af(t)) is not 

an equilibrium To prove t>,(l) is strictly decreasing we show that 
01 ,( 1 ) < 0 It can be shown that 

i) 01,(1) :S o),(0 - Wi(l). 

First consider ^,(1) It can be shown^* that {a) there is some c such that 
Vj(l) = Pf{t)lp^ and w,(l) = {>c{t)lp^, or {b) there is some b such that 
v,(l) = M*(l)/A?* and w, = Afj(l)/^j. If (a) then, defining A = v,(l), 
{p{t), Mijt)) S {Xp, kXi) a.ndpfi) = Xp^ Therefore, by homogeneity and 
theABH lemma, A'j[/(1), Af(l)] > XfXpjXM) — 0, thereforew,(l) > 0 
If(6)andp(l) ^ then mt[p(l), Afj(l)] > m^{Xp,XM^ = 0; therefore 

For any uit^er n, a the set of all n-dimensional vectors x such that x > 0. 

The arsuments {ffi, Af*) are omitted whenever no ambiguity results. 

*• Arrow et al. 1959, pp 96-97. 



quantity theory 145 

Wj(<) > 0. If(i) and/>(/) = then%[/»(0i Af*(/)] «= XM) = 0, 

and wi(0 = 0 It follows that 

II ) Wi(<) ^ 0 with equality only if v,(/)/ =/»(<) 

By an analogous argument, 

III ) a)i(0 < 0 with equality only if = p{t) 

The equality cannot hold in both (ii) and (ui) because this would imply 
v,(/) = in which case (/)(<), M(t)) would be an cquilibnum There¬ 

fore, combimng ( 1 ), ( 11 ), and (iii), w^{t) < 0, and hence Q* is quasi- 
stable It can also be shownthat — v^(^) 0 ns t -* 00 , so that 

(/)(<), M{t)) -* {otf, aJQ) for some a > 0 Therefore Q* is asymptotically 
stable 

Next, suppose that at least some rationing does take place out of 
equilibrium We can demonstrate the stability proposition again, provided 
the transaction rules satisfy a restriction that we shall now describe It is 
clear from the proof of theorem 1 that Q* will also be stable if for all b, 

T* = X* for whenever ^ for all k, and 

T* = A'* for all X^ ^ 0 whenever ^ for all k This 

restnction is referred to as “competitive rationing ” Given competitive 
rationing then, referring to our intuitive sketch of theorem 1, if = 

min J{p, M), then transactor b will not be raUoned on sell orders and 

Afj S mj, > 0 Similarly if = max J{p, M), then S m* < 0. 

We call this restriction competitive for two reasons. First, it says that 
the shopkeepers must assign the most favorable queueing position to those 
who have the greatest incentive to be first in line, in the sense that a 
transactor with low money balances will want to sell more than the same 
transactor with high money balances Second, we generally think of the 
forces of competition underlying the pnee adjustments as being the bids 
and offers made by those who have been unsuccessful in trading at the 
previous prices Their competitive bidding should have the effect not only 
of changing prices but also of allowing those who have been rationed 
most in the past to be rationed least in the present, because unless there 
aie barriers to entry, the unsuccessful seller has only to underbid his 
compietititors in order to sell his goods If the transactor with the 
“lowest” money balances is also the one whose sell orders have been most 
heavily rationed, the compctiUve raUoning restnction can be interpreted 

1 Uzawa 1961, p. 624 

I This phenomena would be more easily formalized in a model where each transactor 
p allowed to set his own bid or asked price, which in turn would determine his share of 
|Mie rationing This is the approach of Bushaw smd Qower (1957, pp 176-90) and 

P “SUCT 



146 JOURNAI, OF FOUTICAL ECONOMY 

as reflecting his successful competitive bidding Thus wc have the second 
version of the stability proposition. 

Theorem 2 Given competitive rationing, Q* is asymptotically stable 
For the final version, suppose the shopkeepers hold no inventories at all, 
in which case 

IT T‘ = 0 

for all c With no inventories we cannot be sure that there exists a set of 
transaction rules satisfying competitive rationing Even the transactor 
who IS first in line might have to be rationed if, say, his net demand is 
negative and exceeds in absolute value the sum of all positive demands for 
that commodity Therefore the stability proposition cannot be demon¬ 
strated using the same method as in theorems 1 and 2 However, it can be 
demonstrated for the case of no distribution effects, that is, X^{p, M) = 
X^[p, M') for all p, M, and M' such that 

ft=i j>=j 

Theorem 3 Given no distribution effects, if 

Z = 0 

1 

for all c then Q* is asymptotically stable 
Proof Take any M°) > 0 Define 

B 

M s 2] 

Let (/, be the unique equilibrium such that 

= M 

I 

Because 

E = 0 

I 


for all r, wc have 




E M,{t) - M 

b^l 


Therefore for all / ^ 0 



QUANTITV theory 147 

and, because of no distnbution effects, X^[p{t), M (t)] = Xg[p{t), M] for 
all c Suppose (p{t), M) u not an equihbnum Then by the ABH lemma, 
if Pc{‘)lfc = {Pk(‘)lh} then X^lpit), Mit)] « ^,0(0. > 

X,{p, “ 0. and p,{t) > 0 Similarly if />„(<)//, = max {pi,(i)lp^} 

thenPf{t) < 0 It follows*^ that (i)/(<) -»/ast-+ 00 . Because of ( 1 ) and 
the continuity of A', it follows that for alt c, A'j[^( 0 > ^( 0 ] = 
Xclp{t)i Xc(p, A?) = 0 as / -* 00 Therefore, because T* and 

are continuous, and because = X^ whenever X^ = 0 , for all 
b M^{t) -* mi,[p{l), as < -► 00 . It follows from (i) and the con¬ 
tinuity of m* that for all A/j(/)] -► m,,[p, as / -► oo 

Therefore the motion of Af^ for any b is given by the nonautonomous 
system A/»(<) = rntf/, ^^(t)] + h^(t), where h^{t) -► 0 as / -*■ c» The 
corresponding autonomous system, Af* = m,,[p, is asymptotically 

stable because, from the real balance effect, {dldt){M^ — — 

2(Mj — Mi,)mi,{p, Mj) < 0 Therefore^^ the nonautonomous system is 
also stable and, ( 11 ) Al^ for all 4 as t -+ 00 The conclusion 

follows from ( 1 ) and ( 11 ) 


V. Relationship to the Nontatonnement Literature 

Mathematical economists who have studied the dynamics of nonrecon- 
tracting exchange models have not had to impose gross substitutability on 
the excess-demand functions in order to prove the convergence of prices 
and commodity holdings This has given rise to the conjecture that the 
recontracting itself was a source of instability—that by pemutung false 
trades an unstable exchange model could be made stable The question 
naturally arises why we had to impose gross substitutability in the present 
model 

There are two fundamental differences between the present model and 
those of the “nontatonnement” literature Both of these rule out the 
stability proofs that have been used in that literature First, prices enter 
the utility functions in the present model but not in the nontatonnement 
literature Second, the nontatonnement literature has been concerned 
exclusively with pure stock models, whereas we arc dealing here with a 
stock-flow model 

Uzawa’s (1962) method of proof was to use the negative of the sum of 
all transactors’ utilities as a modified Lyapunov function This funcuon 
will be decreasing over time in his model if every transactor refuses to 

1 Uzawa 1961, pp 624 

Coddmgton and Levinion 1955, p 327 

The two most important works in thu area are those of Hahn and Ncguhi (1962) 
*3 9 ^**"^* The literature u lummanzed by Neguhi (1962) 

Thu conjecture u referred to (although not necessarily mamtamed) by Negiihi 
962, pp. 658-59) and Arrow and Hahn (1971, p 328) 



JOURNAL OF POLITICAL ECONOMY 


148 

make any trade that results in a decrease in utility. However, once prices 
enter the uUhty function a transactor’s refusal to accept a utility-decreas¬ 
ing trade will no longer guarantee that his utility will be increasing over 
time, for the price changes can make it decrease. There appiears to be no 
way of avoiding this in a model of monetary exchange where money is 
only held with a view to its future purchasing power Also, in a stock-flow 
model where the flow commodities disappear after the trading penod the 
transactor’s uulity might be decreasing if he is unable to engage m the 
same trades as in the previous penod, even if he refuses to engage in 
utility-decreasing trades 

Hahn and Negishi’s (1962) proof depends upon an assumption that the 
value of this period’s endowments for any transactor, evaluated at last 
penod’s pnces, is the same as last penod’s endowment evaluated at last 
period’s pnces In a pure stock model this assumption is natural because 
this penod’s endowments are all the result of last period’s trades, none of 
which can change the value of the transactor’s commodity holdings 
However, this assumption cannot be carried over to a stock-flow model 
where production and consumption also change the endowments Arrow 
and Hahn (1971) and Fisher (1972) have transposed Hahn and Negishi’s 
analysis to a model of monetary exchange, but they too have a pure stock 
analysis without prices in the utility function 


VI. Summary and Conclusion 

We have presented a method by which the stabihty proposiUon of the 
quantity theory can be venfied without the artificial separation of ex¬ 
change and price adjustment The basic features of the formal model were 
that sales income was not an argument of the excess-demand functions, 
that all commodities were gross substitutes, and that the real balance 
effect was operative on all transactors’ net demands It was shown that 
the long-run equilibrium of the quantity theory was asymptotically stable 
if there was no rationing, if there was a form of “competitive rationing,’’ 
or if there were no distribution effects 

course it does not follow from acceptance of this stability proposition 
that one must then accept the policy recommendations of the quantity 
theorists, even if the proposition were to be proved in a much more 
realistic model than the one used here The mam reason for this is that 
the stability proposition refers only to the behavior of an unchanging 
system as time goes to infinity. We still have no well-articulated theory to 
tell us what to expect in the short run, the penod of interest to pohey- 
makers What is needed, as Leijonhuivud (1968) has argued so con¬ 
vincingly, is a short-run theory that can explain the dynamics of an 
economy in which the coordination of economic activities is not costless 
Only after this has been accomplished can we begin to understand the 



149 


quantity theory 

importance of Keynesian economics and its relation to the quantity 
theory. All this study shows is that the quantity theory can still be valid 
in the long run even in such a world of costly coordination.^* 

One of the lessons of Keynesian economics is that the short-run behavior 
of an economy is very much affected by the intermarket connecuons, 
other than price changes, that make one transactor’s decisions dependent 
upon other transactors’ expenditures In particular, this is the essence of 
the multiplier process Our problem is that the only short-run theory 
capable of describing such connections in a multimarket context is the 
textbook Keynesian theory which postulates that one set of prices (money 
wages) is completely inflexible On the other hand, the only theory that 
allows all prices to adjust is the Walrasian theory of Patinkin which does 
not allow for any of the intermarket connections because it cannot handle 
false trading The difficulty that has stood in the way of allowing for 
false trading in a model like Patinkin’s is that the microfoundations have 
not been strong enough to handle the problems mentioned above 
Because the “shopkeeper” approach used in this study attempts to deal 
with these problems it provides one way of allowing for the intermarket 
coniKClions without postulating any absolute price rigidities Thus it 
appears capable of handling some of the short-run questions that have 
iradiiionally been the exclusive domain of Keynesian economics 
In fact. It can be shownby using this approach that a monetary 
economy can behave in a Keynesian fashion in the short run even though 
It has a stable neoclassical long-run equilibrium In particular, there can 
be a Keynesian multiplier process at work in the short run that amplifies 
the effects of an exogenous change m aggregate demand Many practical 
rnacroeconoinists have viewed Keynesian economics as important in the 
short run when prices have not fully adjusted to their equilibrium levels, 
and the quantity theory as important in the longer run when the economy 
approaches its long-run equilibrium The present approach appears to 
1 provide a way of incorporating this view in a formal theory 


References 

Archibald, G C , and Lipscy, R G “Monetary and Value Theory A Critique 
of Lange and Patinkin ” Rev Econ Studies 26 (October 1958) 1-22 
Arrow, K J , Block, H D , and Hurwicz, L “On the Stability of the Com¬ 
petitive Equilibrium II ” Econometnea 27 (January 1959) 82-11)9 


That ij, costly m the sense that the activiues of production, consumption and 
exchange cannot be suspended during the search for equilibrium prices 
’ See Leijonhufvud 1973 
See above 
”Howitt 1973 

Of course this is not the view of the monetansts, who view the quantity theory as 
I important in the ibort run also 



150 JOURNAL or POLITICAL ECONOMY 

Arrow, K J., and Hahn, F. H. Gitural Cmp«titiV€ AtuUysts. San Franciwx): Holden- 
Day, 1971. 

Arrow, K. J , and Hurwicz, L. “On the Stability of the Competitive Equilibrium 
I.’’ Econonuirtca 26 (October 1958) 522-52. 

Barro, R J , and Grossman, H. I “A General Disequilibrium Model of Income 
and Employment." A.E R 61 (March 1971). 82-93. 

Baumol, W J , Glower, R W , Burstein, M L , Hahn, F. H , Ball, R J.; 
Bodkin, R , Archibald, G C , and Lipsey, R. G “A Symposium on Monetary 
Theory ’’ Rtv Econ Studus 28 (October 1%0) 29-56 
Bushaw, D W , and Glower, R W Introduchon to Mathematical Economtcs. Home- 
wood, Ill. Irwjn, 1957 

Glower, R W “Permanent Income and Transitory Balances Hahn’s Paradox ” 
Oxford Econ Papers 15 (June 1963) 177-90 

- “The Keynesian Counterrevolution A Theoretical Appraisal ’’ In The 

Theory of Interest Rates, edited by F H Hahn and F P R Brechling. London 
Macmillan, 1965 

- “A Reconsideration of the Microfoundations of Monetary Theory ” 

Western Econ J 6 (December 1967) 1-9 

- “The Microfoundations of Monetary Theory Reply ” Western Econ. J, 

9 (September 1971) 304-5 

Coddington, E A , and Levinson, N Theory of Ordinary Differential Equations New 
York McGraw-Hill, 1955 

Edgeworth, F Y Mathematical Psychics London Kegan Paul, 1881 
Fisher, F M “On Price Adjustment without an Auctioneer ’’ Rev. Econ Studies 
39 (January 1972) 1-15 

Frevert, P “On the Stability of Full Employment Equilibrium ’’ Rev. Econ. Studies 
37 (April 1970) 239-51 

Grossman, H I “Money, Interest, and Prices m Market Disequilibrium " J P E 
79 (September/October 1971) 943-61. 

Hadar, J “A Note on Stock-Flow Models of Consumer Behavior” QJE 79 
(May 1965) 304-9 

Hahn, F H "Real Balances and Consumption " Oxford Econ Papers 14 (June 
1962) 117-23 

- “On Some Problems of Proving the Existence of an Equilibrium in a 

Monetary Economy ” In The Theory of Interest Rates, edited by F H Hahn and 
F P R Brechling London Macmillan, 1965 
Hahn, F H , and Negishi, T “A Theorem on Non-Tatonnement Stability " 
Econometnea 30 (July 1962) 463-69 
Hicks, J R Value and Capital 2d ed Oxford Clarendon, 1946 
Howitt, P W “Studies in the Theory of Monetary Dynamics ” Doctoral disserta¬ 
tion, Northwestern Univ, 1973. 

Kaldor, N “A Classihcatory Note on the Determinateness of Equilibrium.” Rev 
Econ Studies 1 (February 1934)' 122-36. 

Leijonhufvud, A. On Keynesian Economics and the Economtcs of Keynes. New York 
Oxford, 1968 

- . “Effective Demand Failures ” Swedish J Econ 75 (March 1973). 27-48 

Lloyd, C. “The Microfoundations of Monetary Theory: Comment.” Western Econ 
J 9 (September 1971) • 299-303 

Negishi, T “The Stability of a Competitive Economy A Survey Article” 
Econometrua 30 (October 1962) 635-69. 

Ostroy, J. M. “llie Informational Ei&cienc)' of Monetary Exchange.” A.E.R, 
63 (September 1973) • 597-610. 



(JUANTITr THEORY I5I 

Patinkin, D. Mmy, Intmst, and Pricts New York: Harper & Row. lit ed., 1956; 
2d ed.. 1965. 

Phelpi. R. S* “Money Wage Dynamici and Labor Market Equikbrium.’’ In 
MtcrMeanomic Faundahons EmpUtymml and Iiffialun Themy, by E. S. Pbelpi et al. 
New York: Norton, 1970. 

Samuebon, P. A. Faundatums qf Economic Analysu. Cambridge, Man.: Harvard 
Umv. Press, 1947. 

Uzawa, H “The Stability of Dynamic Procesiei ” Econonutnea 29 (October 
1961)-617-31. 

- "On the Stability of Edgeworth's Barter Process.’’ Intemat. Econ. Reo. 3 

(May 1962): 218-32. 



Communications 


The Interrelation between Labor and Capital 
Components of Racial Income Differences 

George M von Furstenberg 
Indiana Unwariity 


The isolation and projection of incomes from capital represents a recurring 
problem in the study of income inequality Almost without exception, 
past studies of racial income or earnings differentials have attributed the 
residual remaining after skill adjustment to current discnmination or to 
other factors unconnected with property If attempts to account for 
differences in capital holdings were made at all, they remained indirect 
and incomplete because of inadequate data * To compensate some of this 
lack, the relation between the earnings, capital endowments, and in¬ 
comes of the races is first clarified theoretically in this paper The inference 
drawn from the model is then matched with what evidence there is on the 
actual racial distribution of incomes from capital 

It 1 $ shown that in distributional equihbnum with capital accumulation, 
the earnings of whites relative to blacks equal the ratio of their total 
incomes, provided the savings propensities, the rates of return on capital, 
and the growth rates of labor (population) are identical This initial 
theorem and its less restricted forms yield equihbnum solutions of the 
ratios of capital per man between the groups The mathematical relations 
involved can be useful to researchers dissatisfied with the statistical sources 
on the racial breakdown of capital ownership and income They should 
also permit one to improve upon the rather indefinite treatment typically 
accorded to incomes from capital and property in intertemporal income 
comparisons for the two races The two-sector model used in this pajjer is 
adapted from the pioneenng work of Joseph Stiglitz (1%9) 

I am isdebted to a referee and to Franz Gehreli, Michele Fratianni, Enc Herr, and 
Ralph Walter for helpiul commenti on earlier drafb 

‘ See, Beigmann 1971, p 305 


‘52 




COMMUNICATIONS 153 

The Oenerml Deduction 

and u>i are the annual income and wage per capita of group t, C/ is 
the capital per man, and fj is^he rate of return on capital owned by 
group : rather than group j, 

J>, = W, + Tfif ( 1 ) 

With savings per capita, r|, assumed to be proportional to>„ and with the 
savings propensity of group t denoted by m^, 

s, = mtyi. (2) 

Since employment (and population) grow at the rate of n percent per 
year in group i, it follows from the definition of Cj that the time rate of 
change is 

c,lc, = sjc, - n, ( 3 ) 

Hence, the difference, D, in the rates of capital accumulaUon between 
groups 1 and j : 

D = eje, - Cjjcj = sjc, - sjicj + nj - n, (4) 

With the passage of time, capital per man would be equalized com¬ 
pletely between the two groups if the initial inequahty Cj > Cj implied 
that D IS negative (and vice versa) If W( equals Wj and the population 
growth rates, r values, and savings propensities are abo identical, as 
Stiglitz assumed, c,/c^ must be greater than sjsj as long as r, exceeds Cj. 
Hence D is negative and Cj inevitably grows faster than c^ until it catches 
up with c, However, equalization may remain incomplete in the raoiaU 
context Substituting from (1) and (2) for and Sj in equation (4) yields 
the general expression 

D = — fUjWjIcj + r,OT| — rjmj + rij — n^ (5) 

In the special case where nif = ntj = m, r( = = r, and n^ = Hj = n, 

D = m{w,lct - Wjjcj). ( 6 ) 

> (wj/iuy), £) remains negative as long as the degree of inequahty 
of capital per man in groups i and j exceeds the corresponding inequality 
of the wage ratio Conversely, if capital endowments were more equal 
than the wage rates, e, would grow faster than Cj until in equilibnum 

icilcj)* =. wjwj (7) 

Using (7), the ratio between the equihbnum levels of per capita income 
of groups I and^ b also found to be equal to the wage ratio. For, 

(J'tb'j)* “ (k'i + rcf)l{wj -f rcj) = wjwj. 


( 8 ) 



JOURNAL OF POLITICAL ECONOMY 


*56 

8 (7 90) for the two races.® Hence, the passage of time alone should 
reduce the ratio of white to black capital incomes per person from 8 to 
between 4 and 2 The ratio would fall further to 1 7, the last term in 
(7a), if population growth rates as well as the rates of return on capital 
were equalized but the per capita earnings (and total incomes) of whites 
remained permanently 70 percent higher than those of blacks ’ As 
explained m the next section, a ratio of no more than 2 could also be the 
long-term growth as opposed to the instantaneous redistribution equi¬ 
librium value, even if the growth rates of population differ 


Intertemporal Interpretation 

From census data for families and unrelated individuals, the 1970 labor 
incomes per head were estimated as tt/,o = u/jlVi/Nf and Wjq = 
yielding $2,635 and $1,550 for whites and blacks, respectively The 
corresponding capital incomes per head were $300 and $38 " Assuming 
the average rate of return on privately held capital assets was 5 percent 
for both races,® c,o equals S6,000 and Cjq equals $760 The weighted 
average of the two estimates of capital per head amounts to little more 
than half of $10,000 This is to be expected since the census measure of 

‘ In a related estimate standardized by age, Soltovv (1972, p 83) reports wealth at age 
so of $8,070 for all families and unrelated individuals m 1963, compared with $1,180 
for nonwhite households alone Though these figures are in 1870 dollars and include 
many more items such as household furnishings, the ratio is again over 7 A lower ^timMe 
of the difference is given for families in Kain and Quigley (1972, p 276) Hogfeequities 
appear to be more equally distributed From the 1970 Census of Housing (l^Til, pp 10, 
41), current per capita home values of $3,603 can be estimated for whites compared with 
$1,205 for blacks Since the latter use mortgage financing to a lesser degree, the ratio of 
the much smaller home equities per head is probably less than 3 

^ This may be considered unlikely since the partial equilization of object capital held 
will presumably be paralleled by a reduction in the inequahty of human capital accu¬ 
mulated Differences in human-capital endowments, however, which are frequently 
attributed to past discnmination, are one of the reasons for the wage differential between 
the races. Moreover, the crude labor participation rate, WjjNj, may itself be a function of 
racial income differences and birth rates 

* Disregarding the approximately 2 million nonwhites not classified as blacks, the 
average level of capital income per head is therefore $270 for all persons Multiplying by 
the 1970 population of 200 imllion yields $54 billion This is less than one-half of SI 14 
billion, the sum of dividends, net interest, rental income of persons, and half of proprietors’ 
incomes reported in the national income accounts for 1970 (By contrast, the wage and 
salary income of $541 billion is underestimated by only 10 percent.) As on tax returns, 
underreporting of dividends and interest income seems to be widespread in the census 
surveys However, as long as the percentage of capital incomes (or assets) excluded is 
similar for blacks and whites, underreporting does not vitiate our estimates With regard 
to self-employment mcome, alternative ways of inferring the property share are discussed 
by Bronfenbreimer (1971, p 29) 

’ In the United States, the ratio of dividends to the pnee of stocks hasjseeti consistently 
just above 3 percent. Including capital gams raises the real return to about 4.5 percent 
per annum The sensibvity of results to variations in r is examined in n. 12. 



OOHMUNIOATIONS 


157 


capital incomes suffers from underreporting and does not include real 
capital grains, imputed rent, and returns on consumer durables.* 

To determine the speed with which the relative amount of white- 
owned capital per head may be expected to fall, the differential equation 
solutions are denved for equations (9) and (10). From equations (l)-(3) 
It IS found that 

= mw, + (mr, - n,)c^ (9) 

and 


mwj + (mrj — nj)cj. 


( 10 ) 


Assuming the average productivity of labor, g, grows at the rate of 
3 percent per annum and g equals the constant rate of growth in per 
capita real wages for whites and blacks, it follows that the ratio of per 
capita incomes at time t, with r, = tj = r, 13 given by 


‘MIcjii) = 



+ mwjo j* 


(H) 


Substituting the initial values and the values of the parameters m, r, 
g, rii, and fij previously selected shows that the equilibnum ratio of Cj and 
Cj approaches the ratio of Wiq to Wjo as time goes to infinity More preasely, 


ci{t)*lcj{t)* = (g - mr + nj)w,ol(g ~ m + n,)Wjo = 20 (12) 

The larger g relative to tij, the closer the approximation of the asset ratio 
to the ratio of labor earnings (1.7) for any given excess of tij over n, This 
is a consequence of our assumption that as long as the aggregate rate of 
growth of population remains constant, g is invariant to differences in the 
growth rates of the white and black subgroups ‘ * If the black population 
increases faster than the white, its share of total employment rises also 
Hence, while the capital stock per man is diluted by faster relative rates 
of population growth withm any separate group, labor income is not 
If mr is less than both Rj and Rj, so that net savings out of capital incomes 


While imputed rent is at least mduded m the nabonal income accounts, both 
construction and consumer durables were contained m the capital^stock measures 
previously cited 

“ To mamtam the aggregate rate of population growth, both group rates must fall 
over tune since the share of the iiuter-growing group in the total population nses. On the 
other hand, if the aggregate rate of growth of populauon were allowed to vary, the rate 
of growth of the average productivity of labor would be affected. For the last 2 decades, 
estimates of f range from 2.4 percent to 3 2 percent per annum Fortunately, it u not 
necciiary to be completely consistent in these respects, since changing the population 
growth rates has little effect on the ultimate equibbniun solution Sensitivity tests for / 
are tqiorted m the nest note. 



JOUSNAL or POUnCAL ECONOMY 


158 

alone do not sufBce to sustain the initial level of capital per man, as our 
numbers imply, the cumulative savings out of labor incomes determine the 
ultimate ratios of capital per man. As time goes by, differences in popula¬ 
tion growth rates and in the initial capital endowments become un¬ 
important. Even if mr < ay but atr > a|, so that savings out of capital 
incomes more than suffice to maintain Ci but not ty, savings out of labor 
incomes remam decisive. 

Assuming Cf{t)lcj(t) falls from its miual level of 8 to the long-run 
equihbrium value of 2.0, it is interesting to gauge how many years would 
have to pass for some specified part of the disequilibrium gap to be closed 
From equation (11) it is found that it would take 9 years to reduce the 
ratio from 8 to 4, and another 9 years to reduce it from 4 to 3 ‘ * In all, it 
would take 50 years for the ratio of white to black capital per man to be 
brought down to 2 2 Looking backward, it is therefore entirely possible 
that the process of the partial equalization of capital holdings, begun by 
the Emancipation Proclamation‘s and reinforced in subsequent world 
wars, has still not run its course. 

Since net asset data by race are incomplete for earlier periods,** it 
IS not possible to test whether the convergence in capital holdings has 
actually continued during the last few decades at the rate suggested by the 
model. However, some indirect inferences can be drawn Extrapolating the 
numerator of equation (II) backward for 31 years (< = —31) yields a 
Ci(t) of $2,315 for 1939 compared with $6,000 for 1970 The capital stock 
per black, ry(/), would fall even more, it becomes negative, indicating net 
indebtedness This result appears at variance with the findings of other 
researchers ‘ ^ Hence the previous parameters of the model appear to 
exaggerate the speed with which capital has been accumulated during 
the last 31 years Clearly the parameters chosen arc not suitable for 
backcasting 


The results are not sensitive to the choice of parameters For instance, increasing 
the rate of return r irom S to 10 percent without reducing c,q or Cjt raises the ume required 
to halve the mequahty m capital per man from 9 to 9.4 years Reducing the rate of 
growth in producavity, g, iirom 3 percent to 2 6 percent has about the same effect 

At the end of the Civil War m 1865, “sm estimated 80 percent of Negroes were 
totally ittiterate and most were landless and destitute’’ (Kam 1969, p 7). Southern 
blacki, in particular, did not eqjoy much security of property for decades thereafter, and 
significant net savings were unlikely to materialize under these conditions. For estimates 
of wealth Cor adult white and ooni^te males m 1870, see also Soltow (1972, p 83). 

Datn on the racial distnbution of some capital mcomes (dividends, mterest, net 
rental incosne, income from estates and trusts) were first reported m 1968, with net 
royalties sdded in 1970. The degree of instabibty shown by the figures ftw the fint 2 years, 
which is characteristic of beginning data senes, implies a caution against overmterpreta- 
tkm of the numerical esumates. 

’ * Sterner (1943, p. 92) finda a positive surplus of income over expenditures for most 
clasKS of bla^ even m 1935-36 (see also lamsing and Sonquist 1969, pp. 47-53). 
Significant net ind^tedneis, is, of course, unlikely to be obtiunable for low-mccsne 
households. 



OOtOtUMOATtOm 


«59 

One pomible reason for this is that the savings functions have negative 
intercepts. Then even with identical marginal propensities, the average 
savings propensity of blacks (sty) falls short of the average propensity for 
whites (mi), and the more so ^e greater the difference in incomes. For 
forecasting purposes my was therefore halved, and for backcasting it was 
reduced by 70 percent from its origmal level of 8.5 percent. With m, of 
0.085 and my of0.0425 substituted for m in the numerator and denominator 
of equation (11), respectively, the equilibrium ratio of capital per man is 
found to nsc from 2.0 to 4 2. Partly because the equilibrium level itself has 
risen, it now would take 30 rather than 6 years to reduce the ratio of 
capital holdings per man from 8 to 4 8 Similarly, backcasting with m, 
of 0 085 and my of 0.025 implies that while whites owned an estimated 
S2,315ofcapitalper manin 1939, blacks owned $219,yieldmgar,(f)/cy(t) 
ratio of over 10. 

Another reason for overestimating the speed with which capital holdmgs 
converge may be the disregarding of differences m the rates of return on 
capital between the races in the previous denvation. Even though Kam 
and Quigley (1972, p 270) reported that price appreciation of homes 
may be higher for blacks than for whites, the net return on home eqmties 
may still be lower if reduced benefits from leverage and tax deductions 
are taken into account Furthermore, the altemauve mvestments of 
most blacks, such as U S savings bonds and savings and loan shares and 
bank deposits, may be expected to yield a lower rate of return than the 
average return obtained by more diversified white investors To test for 
sensitivity to differences m r, fy was therefore lowered from 5 percent to 
3 percent. The equilibrium ratio of white to black asset holdings per head 
rose only from 2 0 to 2.1, but the convergence of capital holdings was 
retarded considerably Once savings propensities are restored to 
equality, the capital holdings of blacks would again be imphed to be 
negative in 1939. In other words, accumulauon would still be sufficiently 
rapid to require negative wealth holdings in 1939 if blacks are to end up 
with no more than what they actually owned m 1970 under the assumed 
savings behavior. Hence varying ry does not make backcasting any more 
feasible 

Not surprisingly, the forecasting results were sensitive to differmcts in 
the average savings propensities assumed for whites and blacks However, 
because savings out of capital meomes are insufficient to keep up with 


'* Kain and Quigley (1972, p 276) report that home equibei amount to about 40 
percent of the net worth of lower-middle-income white fiunilici and to 67 percent of the 
net worth of black famihes in the Survey of Economic Oppoetumty sample. 

' ’’ Since the capital stock was inferred from capital incomes, as explained at the bfr' 
ginning of this section, c,o nae* from $760 to $1,267, yielding an initial smet ratio of 
below 4.8 tor white to black. It would take SS years to decrease this ratio by one-half. 
1 owe ihk entire paragraph to the r e fe re e . 



l6o JOUXNAl. OF POUnOAI. BOONOMY 

population growth in this model, the sensitivity to differences between r/ 
and fj was found to be smaller. While it is interesting to let the average 
savings propensities and even the rates of return on capital coincide in 
long-term forecasting experiments, they must have differ^ systematically 
in the past, and they may still differ The reduction in the inequality of 
capital holdmgs per head would be slowed accordingly. Since the adjust¬ 
ment is gradual and since property incomes amount to a small fracuon of 
total incomes, little immediate relief can be expected from the ongoing 
convergence of capital holdings Nonetheless, it is important, if not very 
comforting, to keep in mind that time will cure a significant part of one 
of the most glanng distributional inequities originally created by the 
condition of slavery All of our estimates suggest that tlus part is at least 
one-half, as when the ratio of asset holdings is reduced ultimately from 
8 to 4, and it may be as high as three-quarters on the most optimistic 
assumption that all parameters (n, r, and m) other than labor incomes 
coverge for the two races. 

An Empirical Derivation of the Equilibrium Distribution of Assets 

So far, all our results implied that the capital holdings of blacks are far 
from their distributional equilibrium. It is possible to test this inference 
empincally by asking what the capital endowment of blacks would 
have been if they held the same amount of assets in each income class as 
the U.S population as a whole This test may be ambiguous to the extent 
transition probabilities between income classes are higher for whites than 
for blacks On the other hand, except for recent immigrants, low-income 
whites may have been adversely selected from the generally higher- 
income white population Hence the comparison need not be biased • if 
low-income whites have larger negative transitory income components 
than blacks, they would be expected to own more assets in any income 
group However, whites would be expected to hold lower nonhuman 
wealth assets in equilibrium if poor whites are more likely to be the 
product of adverse self-selection than poor blacks,* ** ’ who have fewer 
options to start with 

On balance it is assumed in table 1 that blacks would have about the 
same amount of assets in any income class as the population as a whole if 

* * In long-term growth the senuuvity to difference* in r is far lets than in the instan¬ 
taneous equilibrium discussed in the first sections of tbu paper because labor productivity 
IS allowed to grow In the earlier model, equihbnum would have to be achie^ through 
forced ledistribuuon generating a current ratio of asset holdings that could be maintained 
without further redistnbuuon at guwi labor income* (and growth rates) for the races. 

** Hanoch (1967) has suggested that pnvate rates of return on higher education are 
tower for bbuJa than for vdutes For othw reference* see Bninmer and Harper (1970, pp. 
804-5). In a portfolio-balance model with imperfect subsutuubihty between human and 
nonhuman capital, one would thus expect Uada to denuuid less human and more non- 
human capital than nmilarly situated whites if it is assumed that returns on nonhuman 
capital are identical fiw the two races. For a comparison of rates of return on human and 
nonhnman capital see Kendnck (1972, p 120). 



OOMMITNICATIONS 


l6l 


TABLE 1 

Size of Wealth by Income Clahu aho Imfbreed EouiUBxnni 
Wealth HoLDmot fob Whitbi and Blacki, 1962 


Household 
Income (Y) 

(S) 

Average 

Wcalui 

i5! 

y Dutribution 
White 
Households 
(%) 

(2) 

Product 
(1 ^ 2) 

s- 

y Distribution 
Black 

Households 

(%) 

(4) 

Product 
(1 X 4) 
($) 

(5) 

0-2,999 

7,609 

25 80 

1,963 

53 80 

4,094 

3,000-4,999 

10,025 

18 10 

1,815 

22 70 

2,276 

5,000-7,499 

13,207 

24 70 

3,262 

14 20 

1,875 

7,500-9,999 

19,131 

15 20 

2,908 

5 10 

976 

10,000-14,999 

28,021 

1160 

3,250 

340 

953 

15,000-24,999 

62,966 

3 70 

2,330 

0 70 

440 

25,000-49,999 

291,317 

0 74 

2,156 

080 

233 

50,000-99,999 

653,223 

0 15 

980 

0 02 

131 

100,000 plus 

1,698,021 

0 01 

170 



Total 


100 000 

18,834 

100 00 

10,978 


SouRCEi >—Projector and WctM (1966, p 1 tO) US, Department of Cx>mmerce, Bureau of the Cen»m> 
'^Income of Familic* and Personi in the United States 1962, Comumer Income, Series P*60, no 41 (Washing* 
ton Government Printing Office, 196S), U S, Treasury Department, Internal Revenue Service. iStaii/licjii/ 
Incomt, 1962 Individual Income Tax Rttums (Washington Government PrmtingC^ce, 1965) (The latter 
source is used as a guide for distributing households with incomes above $25,000, the cutoff us the census 
source ) 

Note —In the source followed, the difference between the concept of wealth and that of net worth is small 
since wealth is reported net of the debts secured by specific assets Only unsecured personal indebtedneaa it 
not yet lubtracied 


equilibrium had already been reached in 1962 For that year, Projector 
and Weiss (1966) have provided estimates of the net wealth of households 
by income classes, which can be matched with the income distribution 
of black and white families and unrelated individuals. If only income 
matters and race plays no part, the predicted equilibrium ratio of capital 
per white over black household is $18,834/$10,978, or 1 7, while conversion 
to a per capita base yields $5,955/$3,090, or 1 9, for 1962 

This result confirms dramatically that the existing inequality in wages 
or incomes implies a hypothetical equilibrium ratio in capital stocks per 
man of around 2 At least this would follow if blacks had asset holdings 
equal to those for the rest of the population with incomes distributed like 
those of blacks Since the actual asset ratio is probably close to 8 and almost 
certainly no less than 4, current income differences do not provide a 
sufficient explanation for differences in capital holdings between the races 
so that blacks arc not just like poor whites Rather, reference must be 
made to the histoncal conditions and externally imposed handicaps 
specific to blacks if the continuing distributional disequilibrium in capital 
accumulation is to be accounted for 


References 

Bergmann, B. R. “The Effect on White Incomes of Ducriminauon m Employ¬ 
ment.” J.P.E, 79 (November/December 1971) 294-313. 

Bishop, G. A. “Income Redistribution in the Framework of the Nauonal Income 
Accounts.” ffat. Tax d. 19 (December 1966). 378-90. 







i63 


JOUKNAL OF POUnOAL BOONOMY 


Brimmer, A. F., and Harper, H. “Economists* Perception of Minority Economic 
Problems: A View of Emerging Literature.*' J. Eton, Literatwt 8 (September 
1970)- 78S-806. 

Bronfenbrenner, M. Incomt DutrUmtion Thtcjy. Chicago* Aldine, 1971. 

Gaathon, A. L “Economic Productivity in Israel, 1958-1965.** Rtv. Income and 
We<dth 16 (March 1970): 1-18. 

Galenson, M “Do Blacks Save More>** A.EJt 62 (March 1972). 211-16. 

Hanoch, G. “An Economic Analysis of Eammgs and Schooling ** J, Human 
Resources 2 (Fall 1967)- 310-29 

Kain, J F Race and Pooerty The Economics of Dtscrtmtnalion Englewood Chfis, 
N J . Prentice-Hall, 1969 

Kain, J. F , and Quigley, J. M. “Housing Market Discrimination, Homeowner- 
ship, and Savings Behavior.** A.E.R. 62 (June 1972). 263-77. 

Kendrick, J. W “The Treatment of Intangible Resources as Capital *’ Rev. 
Income and Wealth 18 (March 1972) 109-25. 

Lansing, J B , and Sonquist, J “A Cohort Analysis of Changes in the Distribution 
of Wealth ** Studies in Income and Wealth, vol. 33. Six Papers on the Size 
Distribution <f Wealth and Income. Edited by L. Soltow New York Nat. Bur. 
Econ Res., 1969 

Projector, D S , and Weiss, G. S. Surv^ of Financial Characteristics of Consumers. 
Washington Board of Governors, Federal Reserve System, 1966. 

Soltow, L “A Century of Personal Wealth Accumulation *’ In The Economus of 
Black Amerua, edited by H G. Vatter and T Palm. New York. Harcourt 
Brace Jovanovich, 1972 

Sterner, R. JTie Negro’s Share New York. Harper, 1943. 

Stiglitz, J. E. “Distribution of Income and Wealth among Individuals.*’ Econo- 
metnca 37 (July 1969) 382-97. 

Tice, H S “Depreaation, Obsolescence, and the Measurement of the Aggregate 
Capital Stock of the United States, 1900-1962 ’* Rev. Income and Wealth 13 
(July 1967) 119-54. 

U S., Bureau of the Census “Income in 1970 of Families and Persons in the 
United States ’* Current Population Reports. Series P-60, no 80 Washington 
Government Prmting Office, 1971. {a) 

- 1970 Census of Housing. “General Housing Charactenstics United States 

Summary *’ Washington Government Prmting Office, 1971. (i) 



An Optimal Program for Managing Common 
Property Resources with Congestion 
Externalities 

Gardner Brown, Jr. 

Utttversitji ()f Washmgton 


The purpose of this paper is to derive an optimum program for managing 
a common-property natural resource—fish, fowl, and some groundwater 
resources—whose rate of growth of the resource stock depends on the 
resource stock level and the current rate of “extraction ” Looking forward 
in time, the dynamic property of this problem is provided by the fact that 
an instantaneous increase in the natural stock of a resource forever in¬ 
creases the physical productmty of that resource, while the extcmahty 
aspect IS of course inherent in the common-propertyness assumption. 
Congestion extemahties arise from a specification of the aggregate pro¬ 
duction function which implies interdependent microproduction functions 
It will be shown that if the resource stock initially is sufficiently small it 
will be socially optimal to have a period of no extraction When extraction 
begins. It and the variable factor increase faster than the resource stock is 
increasing Charges reflecting the imputed marginal social value of the 
resource are a decreasing function of the level of the stock These results 
differ from Vernon Smith’s recent static treatment of the same problem 
(Smith 1968)* and are discussed below. 

The Model 

The rate of resource extractions, x, depends on the stock of that resource, 
X, and a variable factor, K: 

x^G{X,K). (I) 

It IS assumed that G is strictly concave and homogeneous of degree I 
The natural (percentage) rate of growth of the resource stock is denoted 
by 

f(x)j’ < o.r s ojm - 0, (2) 

I am indebted to my colleagues F. Mills, and Oms Herfindahl and to a referee for 
helpful comments, but am responsible for all errors 

‘ But tee Qpirk and Smith (1969) for the correct formulation and solution of a tunilar 
problem. Arnold Zellner brought t^ paper to my attention after my unievued version 
was completed. 


163 



164 


JOURNAL OF FOLmOAL SOONOMY 



Fro i 


where “prime” refers to the derivative of f with respect to X In the case 
of the fishery, 0 ^ X ^ X, dis illustrated in figure 1 For a rough approx¬ 
imation, fishery population-dynamics models express the population 
equation as f{X) = r(l — X), X, where r and X are parameters. Re¬ 
ferred to as the Verhulst-Pearl logistic, it has been used for analyzing 
halibut (Crutchfield and Zellner 1962), ycllowfin tuna (Schaefer 1967), 
and other species (Beverton and Holt 1957) An alternative form for f{X) 
is the Beverton-Holt relationship l/(y -t- fiX), where y and are para¬ 
meters Originally this was applied to fish populations, but recently a 
slightly modified version seemed to provide an empirically adequate 
characterization for the North American mallard waterfowl population 
(Brown and Hammack 1971) Groundwater optimization models in 
practice generally assume that the natural absolute yield is independent 
of the stock level (Brown and McGuire 1967; Burt 1967) and therefore 
can be described by the Beverton-Holt relationship with p = 0. la fact, 
the gcohydrology of aquifers is so varied that the signs of f and/" depend 
on the particular basin studied (Walton 1970) 

It is assumed that the agency m control of the resource wishes to 
maximize the present discounted value of profit over time, and it punues 





COMMUraOATIONS 165 

this objective assunung a constant product price, P, a constant variable 
factor price w, and a constant discount rate, p 



[PG{X, K) - wK]e-^'dt. 


(3) 


More generally, the net value function under the integral can be denoted 
by V(x, X, X), where X can be defined in terms of boats in the case of 
the fishery, number and location of wells in the instance of groundwater, 
and hunters in the case of waterfowl. In these examples a good case for 
the existence of congestion externalities can be made The use of thinning 
and cutting crews and doses of fertilizer by managers of forests do not 
appear to cause externalities 

From the homogeneity assumption, G(X, X) = XG(l, k) = Xg{k), 
where k = XjX. Rewriting (3) in terms of k, the agency now wishes to 
maximize 


subject to 


{[Pg{k) - wk\X)e-^'dt, 

X ^ 0, 

^ = Xf{X) - ;r, 

X = Xg{k), 


(3') 

(4) 

( 5 ) 

( 6 ) 


and given A'(O) This problem can be solved by the application of 
Pontryagin’s Maximum Principle (Pontryagin et al 1962) The appro¬ 
priate current value Hamiltonian (Arrow 1968, p 94) is 

H = - wklX + X[XJ{X) - A^CA)]}. (7) 

The auxiliary variable, X, may be interpreted as the imputed marginal 
value of the stock resource 

In addition to satisfying (4), (5) and ( 6 ), the necessary conditions for 
a maximum are 

^ = (F - X)g' - ^ 0. (8) 

oK 

A - pA - - ^ -[(P - A)(g - kg') + X{Xf' -(- /)], (9) 


and the transversahty conditions. 


lim XXi~** => 0 , bm Xt~*' ™ 0 . 

I-*® 


(10) 



i66 


JOURNAL OF POLinOAL BOOMOMY 



Fw 2 


The sufficient conditions for an optimal policy established by Arrow 
(1968, pp. 91, 92,95, 97) are satisfied in this problem essentially by assum¬ 
ing that the production function is concave, the conditions on natural 
resource growth specified by (2), and the assumption that P > A * 

Equations (5), (6), (8), and (9) comprise an autonomous, two-dilTeren- 
tial-equation system in the sense that time is not an explicit variable 
(Arrow 1968, pp. 91, 95, %, Arrow and Kurz 1970, pp 64—66). Thus, the 
optimal path is independent of initial values of X The system can be 
characterized in the following phase diagram (fig 2) The underlying 
mathematics are in the Appendix The motion of this system in phase 
space has the following economic interpretation A manager could estab¬ 
lish a charge per umt of resource extracted sufficient to keep the stock level 
of the resource unchanged. If he levies a lower charge per unit extracted, 
harvest in excess of the natural amount of growth will occur, and the stock 
level will fall. Thu explains the leftward direcuon of the horizontal arrows 

s Define 0 ^ k, i, t). The unimpbon that 6 ii concave in for 

given A and I unifies that the necaaary oondiuont are suflSdent for optimality. It is easy 
to show that - (J» - A)((g'**)/A^ + (2/' + < 0. 






COMMUJaCAnONS 


167 

in quadrants III and IV, Correspondingly, a charge too high reduces 
harvest below the physical equilibrium level, and the stock builds up as is 
indicated by the nghtward horizontal arrows in quadrants I and II. 

For a given X there exists an X such that X « 0 Should the manager 
chaige that same A for a higher X, more extraction than is desired will 
take place (because cost is falling in the stock vanable), the stock will be 
reduced below the desired levels, its natural marginal productivity will 
nse, and A will rise to reflect this fact, expressed geometncally by the 
rising vertical arrows in quadrants I and IV. Falling vertical arrows in 
quadrants II and III follow from anaiagous reasoning. 

It bears emphasizing that other phase diagrams can be drawn for this 
problem Clearly, there arc natural resources for which it never 15 profit¬ 
able to harvest because the unit extraction cost is higher than the price 
consumers are willing to pay Just as obvious, there can be a discount 
rate large enough, a biological growth rate low enough, for any level of X 
and a w small enough to warrant econonucally the extinction of a species 
(Qjiirk and Smith 1969, pp 20-21) The blue whale may be an example 
of the latter case if society believes the loss of this species is relatively low 


Transient Stales 

The natural place to begin the dcscnptivc analysis is at low initial levels of 
the stock resource, A'(O) < X*, where the asterisk refers to the optimal 
stationary value This condition arises when valuable natural resources 
in the past have been regarded as common property by the users For X 
sufficiently small, its own marginal rate of return, dXijdX = f + Xf, is 
very high The marginal stock resource value, A, will be greater than the 
market price of the harvcstable product Therefore, from (7) and (8), 
the variable factor = 0 and there should be no extraction Since it is 
specified that the marginal product of natural capital falls with increases 
in X, ultimately the optimal policy calls for harvesting, the inequality in 
(8) no longer is binding, and we follow the fanuliar rule that k should be 
so chosen that marginal cost, wjg' equals the net price of P — A 
From the individual user’s point of view, when X is small and unman¬ 
aged, his marginal cost of extraction is high and he cannot fully capture 
in the future what he docs not harvest today, so his marginal valua¬ 
tion of natural capital is low and rises as the stock of natural resources 
grows Of course the evoluUon of A from the manager’s point of view is 
just the reverse Initially the manager should set A high because in future 
periods he can capture the high return from investing in the present As 
capital accumulates, investment in natural resources becomes less attrac¬ 
tive relative to p, the rate of return available elsewhere Accordingly A 
falls. The optimal path in the phase diagram simply corroborates good 
economic sense. 



i68 


JOURNAL OF POLmOAL ECONOMY 


As the resource grows, the rate of extraction, x, and the variable factor, 
K, grow over time until the optimal stationary level x*, X*, and K* are 
reached, given that x > 0 Then the rate of harvest equals nature’s 
periodic yield, X*f{X*) 

When X < X* and harvest is positive, both x and K increase faster than 
the stock resource is increasing These conclusions follow from rewriting 
(6) as 

I<"'> 

Since A falls over time, g' must be decreasing in order for (8) to hold, so K 
must increase faster than X Therefore, from (6'), output must be growing 
relatively faster than the stock resource 

The economic interpretation of these changes is that a rise in the net 
value of the harvested resource, {P — A), through a fall in A induces an 
expansion of output and an increase in A, thereby increasing the marginal 
cost of output—through a decrease in the marginal productivity of k —m 
order to restore equality between net value and marginal cost This 
particular qualitative relationship between optimal changes in x and K 
with respect to the state variable, X, is a result of the homogeneity assump¬ 
tion on the harvest production function But for these conclusions, the 
content of the analysis throughout the paper is unchanged for any G 
which IS concave 

In those fortunate instances when a manager comes upon a resource 
whose stock level is above the optimum, Al'(O) > X*, the imputed price 
(A) of the stock is initially relatively small and should grow over time along 
the optimum path, and the ratio of extraction to the stock level [xjX) falls 
over time since KjX is falling, until the steady-state level of the stock 
resource declines to X* This is illustrated by the right-hand portion of 
the stable branch in figure 2 


Stationary Values 

It IS of interest to inquire about the relationship between the steady-state 
stock level, X*, and that level for which the annual physical yield of the 
resource is a maximum, X° In his static treatment of a comparable 
problem, Vernon Smith concluded that the optimal stock level is always 
to the right of X° (see fig 1) because the marginal extraction cost is falling 
in X (Smith 1968, p 427) In the dynamic context, such a conclusion 
cannot be reached because there is an opportunity cost involved in waiting 
for larger stock levels We know that Af* > Afwhere/(Af) + Xf'(X) = 
p, but this fact exhausts our information about the location of X*.^ 

* From (9), A = — (F — X)(g — kg') when /(X) + X/'(X) •» p Either ^ 5 F, m 
which case X obviously is growing since there is no harvest, or A < F, in which case 

< 0 since g — kg' > 0 because g is strictly concave. 



COMHUNIOATIONS 1 69 



Although It IS true that extraction costs fall with increasing X, they may 
not be falling sufficiently fast to warrant waiting while stocks build up 
If X* < X° < X(0), the optimal management plan calls for progres¬ 
sively higher unit costs of extraction over time as X decreases Referring 
to figure 3, had the adjustment process toward a steady-state value of X* 
begun at X, it should not stop at X**, where the instantaneous rate of 
profit is greater than at X*, but it should go to X* The “short-run” 
profit obtained from dnving the stock level from AT** to X* more than 
compensates for the “long-run” additional costs ofextracting/(A'*) 

This solution may be compared with the results for a related problem 
obtained by C. G. Plourdc (1960) In his model, the utility of consump¬ 
tion [/(C), or utility of harvest, is maximized over time in a context of no 
cooperating inputs Plourdc shows that the steady-sute optimum natural 
resource stock level is that for which the own rate of return equals p Thus 
his stationary level of stock always is lower than that presenbed in this 
model The steady-state harvest also is lower compared with the levels 
obtained in the present analysis provided X* < X^. That is, the optimum 
stock is not to the right of the level which produces maximum physical 
sustained yield. In general, optimal harvest can be greater than, equal to, 
or less than the solution value satisfying Plourde’s model Only by placing 




I JO JOURNAL OF POUnOAL ECONOUY 

further restrictions on the functions can we make the comparative results 
umque. 

The policy prescriptions of the two models also differ when natural 
resource stocks arc comparatively low Since U'{C) -* co as C -» 0, 
there should always be some consumption in Plourde’s model, whereas in 
this study the imputed marginal value of the stock resource for small X 
more than matches the instantaneous rate of profit for any positive rate 
of extraction, so harvesting should be postponed Apart from this minor 
distinction, the behavior of the two systems in terms of A/A and 
during the transient states are qualitatively equivalent 


Resource Management Policy 


Efficient natural resource use can be achieved within a decentralized 
competitive system if the management agency enunciates two pricing 
policies, one which reflects the scarcity value of the resource stock and 
another one designed to capture the opportunity cost of congestion Given 
the common-property aspects of this problem, it is necessary to levy a 
charge of A per unit extracted, since A is not market phenomena 

The congestion externality may be accounted for by charging a tax for 
the use of a unit of the variable factor The genesis of the congestion 
externality anses from the fact that variable factors in a competitive 
economy will be committed to the harvesting activity until 


{P - A) ^ = a., 

k 


(H) 


since average output per variable factor in the resource industry is the 
firm’s marginal output as viewed by it when the number of firms harvest¬ 
ing IS large ■* To induce the correct level of K, a unit charge, I, 

t= {P - X) , (12) 

k 


on the variable factor is computed from the requirement of equality of (8) 
and (11) ’ 

I am unaware of any domestic natural resource management program 
remotely resembling the approach recommended here Public natural 
resource managers in this country use neither a charge which vanes with 


* The condition that there are a large number of firms probably is met adequately for 
natural resources such as the various domesuc fisheries and groundwater There may be 
some doubt about its validity in the instance of forestry and in some international filhenes 
such as whahng 

‘ The tax is a simple expression if the extraction production funcUon is x = 
for then g •= k‘ and from (12) and (8), ( ss iti(l — a)/a The variable factor tax is there¬ 
fore mdependent of X and K. 



OOMinmiOATIONS 


«7> 

rate of use nor a charge for the right of entry, nor do they have the neces' 
sary enabling legislation to restrict the number of firms entering the 
fishery or groundwater basins. Additionally, the number of waterfowl 
hunters is not restricted * There are two exceptions to these statements 
In Colorado and New Mexico, apphcation for a new right to use ground¬ 
water can be denied if it impairs unreasonably existing rights in terms of 
quantity or quality ’ Only in California can a public water distnct charge 
a pumping tax, and only one district makes use of this tsixing power. ^ 


Conclusion 

The static treatment and solution of an externality problem which is in¬ 
herently dynamic generally will lead to prescriptions which cause social 
losses This is illustrated in Vernon Smith’s article (1968, p 426) where his 
artificial restriction of A = 0 is binding in the dynamic formulation, 
hence costly 

Omitted from this analysis are costs of exit m the case of overexploited 
stocks If, for example, the social cost of moving boats out of the fishery or 
of displacing loggers in the forestry industry arc greater than the profit 
obtained from pursuing the policy described above, obviously the best 
social policy is to do nothing in the context of the present model Given 
exit costs smaller than this magnitude and assuming income transfers are 
politically inappropriate, the “second-best” policy is apparent although 
not formally developed Some absolute rate of extraction over time greater 
than that derived earlier is called for until the steady state is reached, and 
this happy event will be postponed in time 


Appendix 


Differentiating both (8) and (5) after substitution of (6) into (5) yields, after 
substitution, 


dX 

dX 


Xs 0 


(P - Xlff 

(gV 


0,F> 


A. 


‘ No one can seriously sirgue that the cost of a season duck stamp is set with an eye to 
managing hunter congestion There are restrictions on the right of entry to shoot game in 
the public forests of Germany Curiously, the restricbon is based on the ranking of mdi- 
viduals in terms of their performance on an exam which test hunting proficiency and 
knowledge of “good” conservaUon pracuces (see Webb 1961) As for internaUonal 
resources, the right to harvest pelagic seals is resmeted in some instances (Kasahara 
1972, p 66) Entry of Japanese fishermen is restricted by the central government or 
fishermen cooperauves in the inshore, ofishore and high seas fisheries (ibid., chap. 1, 
and pp 110-11) 

'' Sro Fundmgsland v Colorado Groundwater Commission, 468 P 2d 835 (1970), 
Mathers v Texaco, Inc. 77 New Mexico 239, 421 P 2d 771 (1966) For further dis¬ 
cussion see Corker 1971, chap. 5 

• Interview with Langdon Owen, manager. Orange County Water Distnct, December 
28, 1970. 



172 

nnce/' ^ 0, ^ < 0; 


JOURNAL OF POLmOAL ECONOMY 


dX 


x-a 


jP - 

{gV 


for X > 0. 


DifTerentiatmg both (8) and (9) and simplifying through application of (8) 
gives 


dX 


A=o 


km' + xD 

P - (/+ Xf’) + g 


<Q,P > k> 0. 


The numerator is negative because /' < 0 and J" <, 0. The denoimnator is 
positive because (9) can be rewritten as 


P - {Xf +/) = (g - kg') > 0, 


since strict concavity of g makes g — kg' > 0 

From (10) and (8) it is easy to show that, when A = 0, 


dX 


= -km' + Xf) > 0 , 


l^lconstanc 

and from (5) and (8), it is readily proved that, when X = 0, 


dk 


■ilV 


> Q,P > k 


liconiunt {P 

The motion of this system is characterized by (Al) and (A2) 


(Al) 


(A2) 


References 

Arrow, K J “Applications of Control Theory to Economic Growth ” Stanford, 
Calif Stanford Univ Inst Math Studies Soc Sci, 1968 
Arrow, K J , and Kurz, Mordecai Public Investment, the Rate of Return and Optimal 
Fiscal Policy Baltimore Johns Hopkins Press, 1970 
Beverton, R J H , and Holt, S J On the Dynamics of Exploited Fishery Populations 
London Her Majesty’s Stationery Office, 1957 
Brown, Gardner, Jr , and Hammack, Judd “Economic Evaluation of a Fugitive 
Recreational Resource Migratory Waterfowl ’’ Paper prepared for Resources 
for the Future Multi-disciplinary Workshop on Research in Wildlands, 
Wildlife and Science Resources, Missoula, Montana, August 4-5, 1971 
Brown, Gardner, Jr , and McGuire, C B “A Socially Optimum Pricing Policy 
for a Public Water Agency.” Water Resources Res 3 (1967)’ 33-43 
Burt, Oscar “Groundwater Management under Quadratic Criterion Functions.” 
Water Resources Res 3 (1967) 673-82 

Corker, Charles “Groundwater Law, Management and Administration.” 
Review draft of Legal Study 6, prepared for National Water Commission, 
October 1971 

Crutchheld, James, and Zellner, Arnold “Economic Aspects of the Pacffic 
Halibut Fishery.” Ftshery Ind. Res. 1 (April 1962) 1-173 
Kasahara, Hiroshi “Future ofjapanese Distant-Water Fisheries ” Mimeographed. 
University of Washington, 1972 

Plourde, C G. "A Simple Model of Replenishable Natural Resource Exploita¬ 
tion.” Ameruan Econ Rev 60 (June 19^). 518-22. 



coifutnnoATiONs 173 

Pontryagin, L., Boltyaiukii, V.; Gamkrelidge, R.; and Miichenko, E. The 
Mathtmatteal Theory of Optimal Processes. New York Intencience, 1962. 

Q^k, James, and Smith, Vernon. “Dynamic Economic Modds of Fishing.” 
Research Paper no 22, Research Papers m Theoreticai and Apphed Economics, 
Univ. Kansas, Dept. Econ., June 1969. 

Schaefer, Milner, “Fishery Dynamics and Present Status of the Yellowfin Tuna 
Population of the Eastern Pacific Ocean.” Inter-Amencan Tropical Tuna Com.- 
mission 12, no. 3 (1967); 89-136. 

Smith, Vernon. “Econonucs of Production from Natural Resources ” American 
Econ Rev 58 (June 1968) 409-31. 

Walton, William. Groundcuater Resource Evaluation. New York: McGraw-Hill, 1970 

Webb, William “Forest Wildlife Management in Germany.” J. WihUtfe Manage¬ 
ment (April 1961), pp 147-61 



Cognitive Range in the Theory of 
Revealed Preference 

Anthony Y. C Koo 
Mtehgan Statt Umutrstty 

Peter Schmidt 

UmmsUy of North Carolma at Chaptl Hdl 


In some recent work (Koo 1963, Koo and Hasenkamp 1972), empincal 
investigation has been made of the extent to which the behavior of house¬ 
holds IS in accordance with the axioms of revealed-preferencc theory 
These investigations have found that, with few exceptions, almost all 
famihes made at least some inconsistent choices Of course, this is not 
surprising to economists, and the literature abounds with explanations 
However, few of these explanations arc testable against market data. In 
this paper we attempt to test two possible explanations for observed 
inconsistent behavior The hypotheses to be tested are proposed in 
Section I, and the tests are desenbed in Sections 11 and III 

I 

Let and p' represent n-dimensional vectors representing a “bundle” of 
goods and their prices, respectively, and let p'q‘ be the vector product 
representing expenditure on bundle when pnccs are p‘ Then the basic 
postulate of revealed preference states that if bundle q' is chosen when 
pnees are p‘, and bundle q^ is chosen when pnees are p^. 



(1) 

and 



(2) 

then the choice of q ' is said to be a “consistent” choice On the other hand, 

the choice of q * and q^ is “inconsistent” if 


P>q‘ > p>qJ 

(3) 

and 


p^qJ > p^q‘ 

(4) 

The authon wish to acknowledge with gratitude the services 
supervising the progiainining and computation of the prcject. 

of Art Havenner in 


m 



OOHUUNIOATIONB 175 

both hold. The basic postulate of revealcd-preference theory is therefore 
just that such inconsistencies do not occur. 

Two hypotheses can immediately be suggested in an attempt to explam 
the inconsistencies which are observed to occur. The first is that consumers 
cannot rank in a consistent manner alternatives which are very similar 
For example, a consumer may not be able to decide if he prefers 2 oz. of 
beer and 2 oz of wine, or 2.001 oz of beer and 1.999 oz of wine The 
assumption here is that an individual may only have discrete levels of 
perception (or preference), and that alternatives can be ranked consis¬ 
tently only if they differ by some minimum “threshhold” amount This 
hypiothesis, which will be called the “threshhold hypothesis,” is very 
siimlar to the threshhold notion as used in sensory psychology. 

The second hypothesis is that consumers cannot rank in a consistent 
manner alternatives which are very dissimilar A consumer may have a 
“typical” consumption pattern and may not adequately consider alterna¬ 
tive consumption patterns which differ widely from the one he is used to 
Hence, he would not be likely to make errors when he is ordering bundles 
in some immediate neighborhood of his past expienencc; he is likely to 
make errors in ordering bundles which are some distance apart 

The assumption is equivalent to the defimtion of margmal rate of 
substitution between vanous goods as descnbmg the mdividual’s scale of 
preference, and the errors in ordenng imply that the differential cquauon 
for indifferent changes in purchases may not be mtegrablc * This hypoth¬ 
esis will be called the “cogmtive-rangc hypothesis,” since it essentially 
implies that a consumer has a limited cognitive range * 

These two hypotheses will now be tested against market data of the 
Michigan State University consumer food panel consisting of a sample of 
about 250 famihes In the present study, only families that made all 52 
weekly reports for at least 1 calendar year in 1955-58 were chosen, the 
number of famihes considered was 87 in 1955 and 1956, and 127 in 1957 

* For an early brief introduction to the subject of integrabibty, see Allen (1938, 
pp 438-42, 509^17) In economic terms why a succession of infimtesinial indifferent 
displacements could not lead to an indifference variety of some sort rests m the absence 
of transitivity (see Georgescu-Roegen [1%7, pp 149-50 and sec IV], Hurwicz [1971], 
and Samuelson [1950] That transitivity in the revealed-prefoence sense also requires 
integrability was first shown by Houthakker (1950, pp 173-74) 

* One may argue that the cogmtive-range hypothesis is really testing the mcome effects 
on consumer behavior When his mcome remains unchanged, the consumer can only 
substitute one bundle of goods for another It follows that be is less likely to make a big 
adjustment and will, accordmgly, commit fewer mistakes On the other hand, a large 
a4)ustment of his consumption pattern fraquently follows from an income change, in 
which case he is more Ukely to niAe mistakes This mcome vsuiation hypothesis was tested 
in Koo and Hasenkamp (1972) and rejected. The number of inconsutent bundles for 
each family was regressed against family characteristics including annual family income 
We have a separate equation for each year (1955, 1956, 1957, and 1958) based on cross- 
sectfonal data. The signs for the coeffiaents of family annual income are equally divided 
between plus and minus for the four equations. 



JOURNAL OF POtmOAL BOONOUY 


176 

and 1938 The 52 weekly purchases were grouped into 13 observations. 
Details of the data are described m Quackenbush and Shaffer (1960). 


n 

Consider two bundles q‘ and q^, and suppose that p‘q* > p‘q^ Then 
the choice between q‘ and q^ is consistent if p^q^ < p^q‘, and is m» 
consistent if p^q^ > p^q‘ Now, the threshhold hypothesis could be in¬ 
terpreted as implying that inconsistencies would be more likely when 
\piqJ - piq‘\ is small than when it is large—that is, inconsistencies might 
occur when p^q^ and p^q‘ are almost imperceptibly different Note that 
this can be interpreted as a threshhold effect m the perception of expen¬ 
diture levels, representing an inability or unwillingness to make sufRciently 
fine expenditure calculations Alternately, since \p^q^ — p^q‘\ = 
Ip^iq'^ — I, It can be interpreted as a threshhold effect in the preference 
ordering of various commodity bundles, if | p^{q ' — q-^)l is interpreted as 
a meeisure of the “difference” between q' and In either case, the 
conjecture is that | p^q-' — p^q ‘f should tend to be smaller for inconsistent 
choices than for consistent ones 

To test this conjecture, the following procedure was employed For 
each family year, p‘q‘ was compared with p'q^ for all 1 ^ j, t,j = 
1,2, 13 K p'q‘ < nothing more is done On the other hand, if 
p‘q' > p‘q^ (so that q‘ is revealed preferred to q^), the observation is 
placed in one of two categories 

If p^q^ < p'q’, the choice is consistent and the difference 

\p^q^ — p^q‘\ = p^q‘ — is entered into the consistent list 

If p^q^ > p^q\ the choice is inconsistent and the difference 

\pJqJ - P^q‘\ = - P^q' IS entered into the inconsistent list 

We then tested to see whether the mean of the entries in the consistent 
hst equaled the mean of the entnes in the inconsistent list, the above 
conjecture is, of course, that the mean of the entries should be larger in 
the consistent list This test is carried out using the test statistic 


, - ^1 - 


where Sp^ 


"1 + '*2 ~ 2 


and where Xj, and 5 f are the number of entries, estimated mean, and 
estimated variance for the consistent list, n 2 , ^ 2 ^ *^2 correspond to 

the inconsistent list The relevant distribuUon (subject to the usual 
conditions) is the t-distnbution with nj -H nj — 2 df Clearly, a positive 
test statistic is needed to confirm the conjecture made above. 

One such test was performed for each family year, and the results were 
rather mixed Perhaps the best method of summarizing the results is just 
to compare the number of positive statistics to the number of negative 
statistics. This is done in table 1. Of422 family years for which there were 



COUMUNICATIONS 


177 


Table 1 

Pajkwbe CoiiPAiusom—AsiOLtm Diffeiusncbs 


Year 

Poubve Statiitici* 

Negative Statuticit 

No Statuticit 

1955 

29 

52 

3 

1956 

27 

51 

6 

1957 

30 

, 92 

5 

1958 

43 

78 

6 

Sum 

129 

273 

20 


* Indicates larger mean for conxiiteni group than for inconsistent group 
f Indicatei imallcr mean for coruistent group than for inconsistent group 
% Indicates no observations in either the consistent or inconsistent group 


Table 2 

Pairwise Comparisons—Percentage Differences 



Year 

Positive Statistics 

Negative Statutics 

No Statutics 

1955 


24 

58 

3 

1956 


25 

55 

6 

1957 


29 

95 

5 

1958 


36 

79 

7 

Sum 


114 

287 

21 


observations in both the consistent and inconsistent groups, there were 
129 positive statistics and 273 negative statistics In other words, the 
average entry actually tended to be larger in the inconsistent group than 
in the consistent group, contrary to the conjecture based on the threshhold 
hypothesis Indeed, if we test the null hypothesis that positive and nega¬ 
tive statistics are equally likely, it is rejected at any reasonable level—the 
reason being that under the null hypothesis the number of negative 
statistics has mean 201 and standard deviation 10 025, and the observed 
number (273j is over 7 standard deviations away from 201 Hence, the 
preponderence of negative statistics is statistically significant 

It IS therefore apparently true that \p^q^ — p^q‘\ is typically greater 
for inconsistent choices than for consistent ones This result can be inter¬ 
preted as supporting the cognitive-range hypothesis and contradicting the 
threshhold hypothesis 

As a further check, the same procedure was followed except that in 
this case wc used percentage differences 

\piqJ - pJq>\ 
iipJq^ + p^q*)’ 

rather than just the absolute differences \p^q-^ — p^q‘l The results are 
given in table 2 and are very siniilar to those reported in table 1 











178 


JOURNAL OF POLinOAL EOONOUY 


Tabu S 

PaDIWIIB COMPAIUIONS—ABSOLUTE DlFFBRBNOBS 



Year 

Positive Statistics 

Negative Statistics 

No Statistics 

1955 


34 

46 

2 

1956 


34 

43 

6 

1957 


50 

73 

4 

1958 


55 

66 

7 

Sum 


173 

228 

19 


m 

In the tests above, we considered bundles q‘ and such that p‘q‘ > p*q^, 
and then looked at the difference (in absolute or percentage terms) 
between p^q-^ and p^q‘ It might be objected that the difference between 
p‘q‘ and p‘q^ is also relevant, since it, too, can be used to measure the 
“difference” between q‘ and q^ Therefore, the first test reported in 
Section II was rerun, using the smaller of |p'q' — and — p^q'\ 
rather than just \p^q'^ — P'^q‘\ The results are given in table 3 Once 
again, the average difference is typically greater for the inconsistent group 
The differences which are being used in these tests can be illustrated 
graphically in the two-goods case (Z, and X^) Suppose that a consumer 
purchases three bundles of goods (at three different points in time), q° 
at price p°, at price p‘, and q^ at price p^ The three bundles are 
represented by one point each m figure 1 To avoid cluttering the diagram, 
we let the price lines p° and p^ coinade in the diagram 

It IS easily seen that q° is revealed preferred to q^ and that the choice 
IS consistent Now, if we draw a (dotted) line through q°, parallel to p‘, 
then the difference /p*^ * — p*^°| can be seen (in terms of Xj) as the 
radius of the smaller circle on the vertical axis, or (in terms of A"!) as the 
radius of the smaller circle on the honzontal axis Similarly, the choice 
between q * and q^ is inconsistent, and the difference | p‘q ‘ — p*^ can 
be seen as the radius of the larger circle on the vertical axis or the larger 
circle on the honzontal axis Note that the figure is constructed so as to 
support the cognitive-range hypothesis 

In conclusion, our evidence seems to support the hypothesis that the 
cognitive range of consumers is limited, and that the consumer seems to 
be able to better discnminatc between small changes from his established 
purchases than between widely different sets of purchases The finding 
thus points to the empirical relevance of theoretical work on integrabihty 
conditions 

References 

Allen, R G. D Mathematical Analysufor Economists. London’ Macmillan, 1938. 
Georgescu-Roegen, N. “The Pure Theory of Consumer’s Behavior.” Q J.E. 50 







COMHVmOATlONS 


179 



(August 1936) 545-93, reprinted in Analytical Economics Issues and Problems 
Cambridge, Mass Harvard University Press, 1967 

Houthakker, H S “Revealed Preference and Utility Functions ” Economua, n s , 
17, no. 66 (May 1950) 159-74 

Hurwicz, L. “On the Problem of Integrability of Demand Functions,” in 
Preference, Utihly and Demand, edited by J S Ghipman, L Hurwicz, M K 
Richter, and H F. Sounenschen New York Harcourt Brace Jovanovich, 1971 

Koo, A Y. C “An Empirical Test of Revealed Preference Theory ” Econometnea 
31, no 4 (October 1%3) 646-64 

Koo, A Y C., and Hasenkamp, G “Structure of Revealed Preference—Some 
Preliminary Evidence ’’ J P E 80, no 4 (July-August 1972) 724-44. 

Quackenbush, G. G., and Shaffer, J D CoUeclmg Food Purchase Data by Consumer 
Panel. Agricultural Experiment Station, Technical Bull 279, Michigan State 
Univ, 1960. 

Samuelson, P. A. “The Problem of Integrability in Utility Theory.” Economica 17, 
no. 68 (November 1950): 355-85; reprinted in The Collnted Scienttjic Papers cf 
Paul A. Samuelson, edited by J E. Stightz Cambndge, Mass.* M.I T Press, 
1966. 


Consumers' Durable Goods, Human Capital, 
and the Pareto Optimality of the 
Lange-Lerner State 

Michael R Darby 
Umvtrstty of CaUforma, Los Angeles 


This note shows that, in the presence of consumers’ durable goods or 
human capital, the Lange-Lemcr state fails to achieve Pareto optimality 
even in the limited sense of the current pienod’s consumption and pro¬ 
duction This follows because different prices exist in the same market, 
and this difference leads to distortions in the production-consumption 
decision, entirely apart from the question of optimal intertemporal 
allocation 

It IS well known that Oskar Lange and Abba Lerner developed an 
economic theory of socialism in which civil servants operate firms as if 
profit maximizing in competitive factor and product markets It has been 
generally accepted that, under the assumptions of the model and aside 
from certain highly special conditions, static Pareto optimality in the 
current period’s production and consumption will obtain, so that the 
discussion has focused on other points * This assertion is easily shown to 
be in error, however, if it is recalled that some of the goods being purchased 
by the household are consumers’ durable goods 

In Lerner’s The Economics of Control, consumers were allowed to partic¬ 
ipate in the government capital market by borrowing or lending at the 
same rate of interest as applied to firms ^ This has generally been taken to 
be inconsistent with socialist ideology, so I will be considering the 
dominant assumpUon of Lange in which private lending and borrowing 
are outlawed and the interest on savings deposits at the government 
bank is arbitrary and “would not have any necessary connection with the 
marginal net productivity of capital” (Lange 1938, pp 85-86) That is, 
the government firms bid in the capital market for the arbitianly fixed 

The author would hke to acknowledge helpful comments from two anonymous referees 
which have significantly improved the exposition 

' See, for example, I M D Little (1957, pp 260-€6} and F M Bator (1958, pp. 
353-56) A recent article by P, C. Roberts (1971) contains a survey of the literature. 

^ However, there were to be restrictions on total amounts (Lemer 1946, p. 269), so for 
those mdividuals at the maximum, the followmg discussion applies. 

i8o 



OOMMUNIGATIONS 


l8l 


(by the central planning board) amount of capital, thus determining the 
interest rate, r. The interest rate on savings is assumed to be r, < r ^ 
Consider the purchase and rental prices of a consumers’ durable good 
under the socialist regime Take X^ and pi as the quantity and price of a 
pure service which enters directly into the utility function or can be 
combined with the consumers’ durable good, say at a pnee p^, to 
produce a flow of services, X^ Assume that p^ correctly reflects the social 
marginal cost * The production function of X^ is 

X,^f{X„X,) ( 1 ) 

Now the cost function of a firm producing X^ is 

C(X,) = />,X, + (r + 6)p^X^, (2) 

where the durable good depreciates (by evaporation) at the rate b per 
period The profit function to be maximized is 

u=p, /(A-,, ATj) - p,X, - (r + b)p,X, (3) 


The first-order conditions for a maximum require that 

ii .. 0 . 

^x^ 


vn 




at 

Therefore, the service flow X^ will be priced such that 

r + 5 

p = - p 

d/isxj ’ 


(4) 

(5) 

( 6 ) 


Now broaden the consumers’ choice to include not only A', and A'j 
but also ATj Consider the consumer who can save or borrow at the state 
bank and thus has an internal rate of discount = r, ^ If he were to reduce 


’ If r, = r, then the results of this paper will not hold, and the lack of Pareto optimality 
m the socialist state will be limited to the fact that the arbitrarily deterrmned rate of 
savings does not coincide with the optimum rale of savings in a time-preference sense 
Lange acknowledges this problem but denies its importance—surely a patent on that line 
of debate would be worth a fortune It is of course ridiculous for r, > r in a socialist 
state 

* The following argument shows that the failure of Pareto optimahty follows from the 
divergence of r and r„ and it makes no diflcrence to the failure which particular subset 
of prices properly reflects social marginal cost, if any The sociahst-firm prices are assumed 
optimal only to abstract from the question of opumal intertemporal choice by assummg 
that the government-determined rate of umc preference is socially correct 

* It u not clear if borrowing is allowed The case of no borrowing will be considered 
below 



182 JOURNAL OF POUTtOAL ECONOMY 

his savings so as to purchase some Xy, he could produce Xy for himself at a 
cost of 

C,{Xy) = PyXy + {f. + 8)PyXy. (7) 

He will want to produce any level of Xy in the cheapest way, so form the 
Lagrangian expression 

V = P,X, + {f, + 6)PyXy - k[Xy -f{X„ Jf,)] (8) 

The first-order conditions for a minimum are 

dV 

dV 
8Xy 

dX 


II 

1 

II 

O 

(9) 

= {r. + S)Py - A ^ = 0, 

dXy 

(10) 

= -/(X,. Xy) = 0. 

(11) 


Since X IS the shadow price of Xy, substitute py = A in equation (10) and 
rearrange terms to get the consumer relationship ® 


= 


r, + S 
dfIdXy 


Pi 


( 12 ) 


Since fj IS less than r, either (1) the shadow price Py must be less than the 
firm price Py, or (2) the household marginal product of Xy must be less 
than the firm marginal product, or (3) both must occur ’ Since the social 
marginal rate of transformation between Xy and, say, good ATj, produced 
by the socialist firms, ® is given by pyjpi, there will be failure of Pareto 
optimality In case 1, this is because the consumer is consuming according 
to inappropriate price ratios so that there is inefficiency in consumption 
(a shift of resources from the household production of Xy to the firm 
production of X, would result in increased utihty) In case 2, there is 
inefficiency m production since a shift of production of Xy from the 
household to the firms would result in increased output In case 3, there 
IS both inefficiency m consumption and in production Summarizing, in 
cases 1 and 3, consumers shift savings from the savings bank and con¬ 
sumption from the firm-produced output to consumption of service flows 


* A sinuUr ducusuon could be made about inefficiencies m the use of Xy, but as it 
need not be an argument of the production function, that argument is left aside 

These cases correspond to the output effimt, substitution effect, and both effects in the 
theory of producuon I exclude the possibility that A'j is an infenor factor 

' For some products, the optimura scale of output will be sufficiently large that house¬ 
hold production would not be worthwhile despite the lower cost of capital to the household 
than the firm. This is parucularly likely to be the case where capital investment is small 
relauve to total costs. 



QOMMUNICATIONS 


183 

from consumers’ durable goods, which arc made relatively attractive by 
the divergence of r, from r. ® In cases 2 and 3, there will be inefficient pro¬ 
duction in the household so that the marginal return to investment is 
reduced to or toward r. Not surpnsmgly, the closure of the market to 
private producers leads to increased nonmarket production and the 
resulting inefficiencies 

As noted above, it is not clear whether borrowing is to be allowed 
so consider also the case in which it is prohibited ‘ ° For those individuals 
who would still have savings in the state bank, the analysis is unaltered 
For those who would want to borrow at r, but not at r, < r, the analysis 
is altered to the extent that the internal rate of discount is increased to r, 
so that savings in the form of consumers’ durable goods yields a return at 
the rate r^, but otherwise is unaffected Only for those individuals who 
would borrow even at a rate of interest, r, does Pareto optimality hold, 
since they purchase only the rental services and produce no X 2 
themselves " 

If the last class contains all citizens—so that at the market rate of 
interest no one wishes to save even in the form of durables—then Pareto 
optimality with respect to current consumption and production is achieved 
Comparison with the real world makes me, at least, doubt that, in that 
case, the intertemporal distortions would be of the second order of 
magnitude 

Another way around the problem nught be to sell only rental services 
These must not be leased over more than one period of course But then 
the comparison with the capitalist state is rather altered, since there 
IS reason to believe that—at least for some items such as automobiles or 
clothes—rental of services is considerably less efficient than consumer 
ownership ‘ ^ 

Basically we have been talking about distortions due to allowing 
consumers to save by investing in market goods which alters cither their 
production or demand conditions or both There is no basic difference in 
this respect between an investment in human capital and an investment in 
a durable good So even were the socialist state to get around the problem 
of consumers’ durable goods, there is still the problem of human capital 
Thus the state would have to determine each individual’s education and 

’ It 13 an interesting question, which need not be considered in detail here, whether 
investment for other purposes is to be affected by a shift of production of services of con¬ 
sumers' durable goods from the firms to the household sector It would appear that some 
sort of “production illusion” is present if it is allowed to increase total social investment, 
but this problem has apparently not been considered by sociahst writers 

' ° Borrowing at a different interest rate than r, is a straightforward extension of the 
analysis 

'' Only those who would neither borrow nor save at exactly r might produce some X 2 
themselves. 

Alchian and Demsetz (1972), discuss some aspects of this problem. 



JOURNAL OF POUTICAL EQONOKy 


184 

occupation—and this is not only a contradiction with Lange’s assumption 
of “freedom of choice of occupation” (1938, p 83), but returns to the 
forefront the requirements of huge (impossible?) amounts of information 
on the part of the central planning board If socialism is to be efficient 
even in the single period, then control over all produced means of pro¬ 
duction, including human capital, is required, and this is backbreaking 
both in terms of information requirements and freedom 


References 

Alchian, Armen A , and Demsetz, Harold “Production Information Costs and 
Economic Organization ” A.E R. 62 (December 1972) 777-95 

Bator, Francis M “The Anatomy of Market Failure " Q J E 72 (August 1958) 
351-79 

Lange, Oskar. “On the Economic Theory of Socialism ” In On the Economic 
Theory of Socialism, edited by Benjamin E Lippincott Minneapohs Umv 
Minnesota Press, 1938 

Lerner, Abba P The Economics of Control New York Macmillan, 1946 

Little, I M. D d Critt<}ue of Welfare Economics 2d cd Oxford Oxford Univ. Press, 
1957 

Roberts, Paul Craig “Oskar Lange’s Theory of Socialist Planning ” J.P.E 79 
(May/June 1971) 362-77 



A Test of the Permanent-Income 
Hypothesis: Comment 

James M Holmes 

State Umversxty of New York at Bifffalo 


In a recent article in this Journal, Prem Laumas (1%9) proposes a new 
test, which has been repeated elsewhere by Laumas and Mohabbat 
(1972), of Friedman’s permanent-income hypothesis (PIH), using 
Canadian national income data He concluded that this hypothesis 
should be rejected In the present paper, we shall demonstrate that 
Laumas’s test is invalid on both a theoretical and empirical basis In the 
Laumas model, it is possible to test seven assumptions which constitute 
Friedman’s hypothesis In fact, five of Friedman’s seven assumptions are 
violated in Laumas’s tests of the remaining two Thus, even if Laum 2 is’s 
theoretical analysis were saisfactory, it could not be used to reject 
Friedman’s theory Moreover, Laumas’s empirical evidence suffers from 
one major and one minor defect Finally, we present a rigorous direct test of 
Friedman’s hypothesis which violates none of the PIH assumptions using 
exactly the same Canadian national income data as used by Laumas, 
except that our data will be deflated This direct test leads to a rejection of 
the PIH when consumption is defined to include consumer-durable 
expenditures However, when consumption is defined to exclude such 
expenditures, this part of Friedman’s theory is not easily rejected on the 
basis of the Canadian evidence This result is in sharp contrast to Laumas’s 
claim and to that obtained from the United States evidence previously 
reported by Holmes (1970) 


I. Friedman’s Theory and Laumas’s Test 

Fnedman (1957, chap 3) formulated the following model, where 
Cp = permanent consumption, Cj = transitory consumption, C = 
measured consumption, Yf = permanent income, Yj = transitory 
income, and Y = measured income All variables are contemporaneous 
unless indicated otherwise by a bracketed t The term theoretical 


I wuh to aclcnowledge the aid of my research asautant Emmanuel Tzanalus, Bryon 
Spencer of McMaiter Umversity for hu help with the data, and Donald Ebbeler, Datvid 
Smyth, and Nageih Revankar for their comments. 



i86 

correlation between X and Y 


JOURNAL OF POLITICAL ECONOMY 


Cp = kYp, 

(1) 

F = r, + Fr, 

(2) 

C = Cp + Cj., 

(3) 

PYtYp = 

(4) 

PCtCp ~ 

(5) 

PYtCt ~ ® 

(6) 


From equation (1), we infer that transitory income has no effect upon 
permanent consumption From equations (1) and (4), we deduce equation 
(7), and from equations (1) and (5), we deduce that permanent income 
has no effect on transitory consumption. Thus, 

PCrYr ~ (^) 

PCrlf ~ ® (®) 

Although Friedman did not specify the distnbution of the error term 
Cj in the regression of C on Yp, his statistical tests implicitly assumed that 
Cj was distnbuted normally with expectation zero and constant vanance 

Cr - (T?) (9) 

Fnedman ran the regression of C on Yp which is derived from equations 
(1) and (3) 

C(t) = kYpiji) + Cr{t) (10) 

The estimates of the marginal propensity to consume k, granted the 
assumptions above, will be best linear and unbiased if Fp(t) is defined 
Friedman defined Yp{t) (1957, pp 142-46) as 

^ I”’ F(t) A, (11) 

where is the coefficient determining expectations adjustments and a is 
the secular rate of growth of permanent income * Friedman assumed 
/J = 4 and a = 02 

The model Laumas uses to test Friedman’s theory consists of basically 
two regression equations 

Cp = otQ + otjFj* + e, (12) 

CJ = 5o + BJr + F. (13) 

‘ I have demonstrated that this assumption contradicts the previous assumptions. The 
theoretical defsets of the PIH may, m fact, be more hmitmg than the empirical, smee they 
may apply to any use of the permanent income concept (tee Holmes 1971). For excepuoni, 
see Ebbeler (1971). 



COIOfUNICATIOMS 


187 

where Cp and Cf are p>ermanent and transitory consumption as defined 
by Laumas The e and V are stochastic>error terms, which we may infer 
on the basis of Laumas’s statistical tests (1969, p 859), are each normally 
distributed with expectation zero and constant variance The measure of 
permanent consumption is given by a scheme similar to permanent 
income 

CkM = A p C{t) dl (14) 

Laumas assumed A = 6 for consumption defined both to include and 
exclude expenditures on consumer durables * For these two differing 
definitions of consumption and thus permanent consumption, he assumed, 
respectively, that ^ = 45 and a = 0and)9 = 35, a = 0 in equation (11) 
defining permanent income Thus, equations (12) and (13) apply for two 
definitions of consumption and piermanent income 

Laumas claims that Friedman’s theory implies that £(a,) > 0 and 
E{B^) = 0 in equations (12) and (13) (i e , in terms of correlation 
coefficients, the assumptions in eqq [1] and [ 6 ]) He tested these two 
statements and found the latter unacceptable on the basis of the evidence 
from the Canadian economy 

Friedman’s theory does not imply that E{B^) = 0 Equation (12) 
is not equivalent to equation (1), that is, Cp # Cp Permanent con¬ 
sumption according to Friedman is estimated to be exactly proportional 
to permanent income Laumas assumes in equation (14) that Cj; is 
estimated as a moving average of past consumption and that this estimate 
contains an error, that is, £(e) # 0 Thus, in equation (13), Cf ^ Cj. 
According to Friedman, Cj- is also not estimable indcpiendent of income, 
a result contrary to Laumas 

It IS possible to prove, using the model m Holmes (1973), that Cp Cp, 
cl ^ Cp, Cp > Yp, whereas Friedman’s theory always does well m 
“explaining” consumption, that is, Cp{t) = C(l) and Cp = 0 

The empincal evidence presented by Laumas contains one major error 
the weights used to construct permanent income sum to less than unity * 
Whereas, if there is secular growth in income, that is, a > 0 , these weights 
should sum to more than unity If one regresses current consumption 
including durables in either real or nominal terms on either permanent 

^ Laumas (1969, p 659 n ) states “We finally chose disposable income senes with 
/I s 35 and fi = 45 by using consumption senes excluding durable goods, and con¬ 
sumption senes including durable goods, respectively, by the goodness of fit cntena 
Permanent consumption was also computed similarly The coefficients turned out to 
be 6 for both consumption senes (including and excluding durable goods) ’’ 

’ Laumas states (ibid): "With B =• 35, fourteen years (1930-43) were used up in 
computing permanent income With B « .33, the wei^ts u^ were (4 decimal places) 
2953, .2081, .1466, 1033, .0728, .0513, 0362, 0255, 0180, 0127, 0089, .0063, 0044, 
0031, .0022, .0015, and .0011 The sum of these weights equals unity ” The sum of the 
first 15 weights is .9973 — .0026 « .9947 These 15 weights should be normahzed, and 
« should be > 0. • 



id8 


JOURNAL OF POtmOAL ECONOMY 


TABLE I 

Some Estimatabw SrATisncg in the Laumas Version of PIH 


V = 23 

Including Durables 

5 * 45 

Excluding Durables 
^ = 35 

£(C4) 

1,463 21 

1,242 17 

Standard error 

(559 49) 

(472 72) 

Pc'ye^ 

8424 

8396 

PclfYp 

8458 

8430 

Pc^Yt 

8519 

8763 

PYpYp 

8593 

8815 


Note —The data used were Canadian National Accounts senes 30014. 30033. 30300, 30340 30972, and 
3105ft, kindly sunplied to me Br>on bpeoccr of McMastcr Untvenity, and with the exception oi the price 
index corrcs)x»nd to Laumai'i data The weights used in cnnstrucimg the moving averages are corrected to 
sum to unity, however 


income using p = 35 or ^ = 45, after the weights of such series are 
normalized, the marginal propensity to consume is always greater than 
one Moreover, for several years in the 1960s, consumption was greater 
than permanent income using Laumas’s weights 

The direct empirical evidence concerning the validity of the Laumas 
tests supports our theoretical criticism of these tests In Friedman’s 
regression equation (10), the estimates of E(C^), PctCp> Pcryp 
constrained to be zero In the Laumas model, equations (12) and (13), 
these statistics can be estimated Moreover, we can estimate Pc^y^ and 
PcyYp tti the Laumas model Table 1 presents the estimates of these 
statistics using the same nominal Canadian national income data and time 
period (1944-66) as Laumas uses 

One must conclude on the basis of this evidence that the assumption 
that E{Cy) = 0 IS not acceptable and that the correlation coefficients are 
significantly different from zero at the 01 level of significance Clearly, 
if a statistical test of a hypothesis violates any of the assumptions under 
which that hypothesis is supposed to hold, the test is invalid In the case 
at hand, every assumption of the PIH which is not tested (i c , PcpVp > 0 
and = 0) is violated The fact that ^ 0 cannot be considered 
as evidence upon which to reject (or accept) the PIH under these 
circumstances 

There is further theoretical difficulty m this test The estimated correla¬ 
tion between permanent and transitory income is positive As I demon¬ 
strated elsewhere (1971), under certain circumstances, the theoretical 
correlation between these two variables has a positive expectation when 
the permanent senes is defined by an equation such as (11) or (14). Thus, 
we can expect that, under such circumstances, > 0andpy„y^ > 0 
According to Fnedman’s theory, PcpYp > 0 Thus, if Cy is a stochastic 
term as Fnedman assumed, then equations (12) and (13) each contain 
part of the effect of Cp in the dependent vanable. This may explain why 





OOHUUNIOATIONS 


TABLE 2 

AlTEIINAITVK RbOREMONI op CONIUHPTION ON BoTH 
Tramutoby anc Permanent Income 


189 


A Ck>nsumption including durablet' 

1 y;(/( » 35). ^ = 0 

2 y;(^= 35),^= 6030 

3 = 45). ^ = 0 

4 y;(^ •= 45). ^ = 6045 

5 y;(/J =• 4 . a = 035), ^ = 0 

6 = 4, a = 035), ^ = .4965 

B Coiuumption excluding durables 

7 y;(/» = 35), 4 = 0 

8 y;(4 “ 35), 4 = 5341 

9 y;(4= 45),4 = o 

10 y;(4 = 45), 4 = 5528 

11 y;(4 = 4,a = 035), 4 = 0 


fo 

J’t 

fx 

D-W 


-3,036 

261 

1.130 

658 

.993 

(446 3) 

(244) 

(0 034) 



-2,380 

137 

1 116 


981 

(838 9) 

(166) 

(0 044) 



-3,077 

215 

1 116 

749 

993 

(464 6) 

(283) 

(0 032) 



-2,326 

.077 

1 097 


981 

(832 5) 

(176) 

(0 041) 



-3,141 

242 

1 056 

.8672 

9937 

(444) 

(235) 

(0 019) 



-2,306 

345 

1 024 


9856 

(714) 

(160) 

(0 029) 



-806 8 

304 

0 889 

804 

994 

(318 7) 

(174) 

(0 024) 



-484 6 

134 

0 903 


985 

(542 7) 

(129) 

(0 030) 



-852 5 

223 

0 893 

749 

994 

(321 5) 

(196) 

(0 023) 



-469 2 

065 

0 890 


988 

(5408) 

(132) 

(0 028) 



-880 

288 

(0 848) 

9031 

99504 

(316) 

(168) 

(00136) 



-383 

272 

0 828 


98859 

(513) 

(119) 

(0 0210) 




12 y ;(4 = 4 , = 035), = 4829 


Note —In (he Ant column the value of 4 tn parenthcsee refen to the weighu uied to construct (he Yp 
sertei In the second column the value of fi is given which was utilised in coniunclion with eq (16) to estimate 
ihe regreaion equation Figures in paientlieses, in columns 3-5, represent slandard errors. 


(2 > 0 It also implies that, if .fi| > 0, we cannot distinguish between this 
being caused by > 0 or > 0 


II. A Correct Test of Friedman’s Theory 

The PIH summanzed in equations (!) through (11) imphes in equauon 
(15) that fg =0 = y, and 1 > 72 > 0 utilize real instead of 
nominal values of these vanables following Fncdman We star such 
variables We can test if Pg'yJ. = 0 

C* = 70 + yJr + + C;. (15) 

The estimates of equation (15), assuming C* is distributed as in equation 
(9) and alternatively assuming that 

c;(t) - pCfit - 1) ~ a>(0, al), (16) 

are presented in table 2 We estimated equation (15) usmg the two 
tnesuures of consumption and the two measures of permanent income 
used by Laumas after the wughts were normahzed. In addition, we 




igo JOURNAL OF POLTnOAL BO(»«OlCY 

constructed a permanent income series using the weights derived for 
B ^ A and tt « .033, which is an estimate of the postwar growth in 
Canadian income In this latter case the sum of the weights exceeded 
unity; they equated approximately 1 096 ^ 

We note that Fnedman’s PIH must always be rejected when the 
dependent variable includes durables expenditure in the first six equations 
First, because ^ ^4 0 for any specification of Yf or for either specification 
of the disturbance term, C* (the lowest /-value is —2 794 in line 4) 
More critical to Friedman’s theory is the fact that is sigmficantly greater 
than one at .05 level in the first five equations Fnedman’s theory cannot 
be rejected on the basis that the margmal propensity to consume out of 
transitory income is different from zero (j)i) in these first five equations 
However, in equation (6), the hypothesis that transitory income has no 
effect upon permanent consumption must be rejected Equation (6) is, 
in my opimon, the one most relevant to the PIH, as it seems silly to assume 
the expected secular rate of growth of income in postwar Canada was 
zero 

When real consumption excluding durables expenditures is the depen¬ 
dent vanable, the PIH is rejected when it is assumed, following Fncdman, 
that C* ~ 4>(0, ffj) since jlo is sigmficantly less than zero at the 05 level. 
If one IS willing to drop this portion of Fnedman’s theory, then the 
estimates of the margmal propensities to consume out of permanent and 
transitory income are in good agreement with the PIH However, the 
D-W statistic indicates considerable senal correlation in the residuals of 
these three equations (lines 7, 9, and 11) When the error term C* is 
assumed to be generated from a first-order Markov process, Fnedman’s 
PIH cannot be rejected if we use Laumas’s weights, that is, if one assumes 
the expected secular rate of growth of income is zero However, if we do 
assume that the secular rate of growth of the real income of Canada is 
3 5 percent, our results (eq [12]) are similar to those obtained for the 
United States economy, transitory income has a significant effect on 
permanent consumption 

Last, we can test if Prf.jp = 0. For fi = .35 pyprf = 0 7856 and 
=5 7595 for fi = .45, both of which unply that this assumption is not 
acceptable at the usual level of sigmficance. It is a result consistent with 
our theoretical expectations Holmes (1971) 

In conclusion, we have demonstrated that Laumas’s new measure of 
permanent consumption docs not corrcspiond to Friedman’s Moreover, 
we showed that Laumas’s test of the PIH violated all of the assumptions 

* The &ct that as a ^ 0 the sum of the we^fhts should exceed, equal, or be less than 
unity, as I demooftrate in Holmes [1973], is not widely recognized by previous researchen. 
In my opuuon, there appears to be both on theoretical and empirical grounds sound basis 
for constdemig the role of a, the expected secular rate of growth of income more cntical 
to the PIH than that of A the coefficient of expectations adaptation. 



OOMUUNIOATIONI 


19 * 


underlying that hypotheiis which were not being tested. In the second 
section of this paper, we presented a test of Friedman’s PIH consistent 
with his assumptions and found that, if consumption is defined to exclude 
durable expenditures, the experience of the Canadian postwar economy 
could be considered in good agreement with this hypothesis if we follow 
T.a iimas and assume a = 0 However, if we assume that a ^ .033, in 
our opinion a more reasonable assumption, we find that transitory income 
IS a significant explanatory variable. We can conclude on the basis of this 
evidence that there may be a critical difference empirically between the 
role played by trsmsitory income and the expected rate of secular growth 
of income in the Umted States and Canadian economies 


References 

Ebbeler, D. H. “On the Measurement of the Permanent Component of a Senes ’’ 
Claremont Econoimc Papers no. 6 (July 1971), revised ui J Amencan Stalls 
Assoc, in press. 

Friedman, H A TTuory of the Consumptum Function. Pnnceton, N J. Pnnceton 
Unlv IVess, 1957. 

Holmes, J. M. “A Direct Test of Friedman’s Permanent Income Theory ’’ J. 
Amencan Statu Assoc. 65 (September 1970) 1159-62 

- “A Condition for Independence of Permanent and Transitory Com¬ 
ponents of a Senes.” J American Statu. Assoc 66 (March 1971) 13-15 

- “The Role of Secular Growth in the Permanent Income Hypothesis ” 

Discussion Paper no 274, State Umv New York at Buffalo, 1973 

Laumas, Prem S. “A Test of the Permanent Income Hypothesis.” J P.E 77, no 5 
(September 1969) 857-61 

Laumas, Prem S., and Mohabbat, Khan A “The Permanent Income Hypoth¬ 
esis Evidence from Time-Senes Data ” A E R 62 (September 1972) 730-35. 



A Test of the Permanent-Income 
Hypothesis; Reply 

Prem S. Lautnas 
Nerthtm lUumt Unaurnty 


Holmes raises three main issues with respect to my (1969) test of the 
piennanent-income hypothesis (PIH). (a) that my definition of permanent 
consumption is not equivalent to that of Fnedman, {b) that the weights 
used to derive permanent income and permanent consumption do not 
sum to unity and hence the values of permanent income, permanent 
consumption and transitory income, and transitory consumption used 
in my test are not the “correct” values, [c) that I ignored the average- 
trend rate of growth in calculating permanent income 

Before I consider these issues, let me state that the conclusions arrived 
at by Holmes are not inconsistent with mine He states. “This direct test 
leads to a rejection of the PIH when consumption is defined to include 
consumer durable expenditure However, when consumption is defined 
to exclude such expenditures, this part of Friedman’s theory is not easily 
rejected on the basis of Canadian evidence” (1974). 

Holmes himself points out that his equations (6) and (12) in table 2 
are the really relevant equations The results of these equations lead to a 
mild rejection of the PIH for Canada Thus, according to equation (6) of 
table 2, when consumption is defined to include durable goods, the 
marginal propensity to consume out of transitory income is equal to 345, 
and the marginal propensity to consume out of permanent income is 
equal to 1.024 When consumption excludes durable goods (according to 
eq [12] of table 2), the marginal propensity to consume out of transitory 
income is equal to 272 and out of permanent income is equal to 828 
Since consumption, according to the PIH, is defined to include the value 
of all nondurable goods plus the rental value of durable goods, one might 
expect that the true marginal propensities to consume transitory and 
permanent incomes would he within these linuts It is likely that the true 
marginal propensities to consume out of transitory and permanent incomes 
would be approximately .30 and .90, respectively. These results are not 
very different from those that I reported in my paper and are very similar 
to those reported in tables 2 and 3 below 

Let us now turn to the three luues raised by Holmes. 

Tlie author is thankful to Gurcharan Laumai and Eldon Roiling, for helpful comments 
on an earlier draft of this paper. 



COUUUNICATIONS 


>93 


Definition of Permanent Consumption 

Holmes argues that my definition of permanent consumption is not the 
same as Fnedman’s I do not fully agree with this I made several senes 
of permanent consumption, according to different values of and chose 
that particular senes of permanent consum]>tion (C^) which showed the 
highest correlation with the permanent income (F,) senes which had 
already been selected I justify this procedure on the ground that I 
provisionally assume that Friedman is nght m that Cj, and are most 
highly correlated and then try to test if the proportionality hypothesis 
and Its most important assumption, namely, the supposed marginal 
propensity to consume out of transitory income being zero, is valid or not 
A series of generated m this way is unacceptable to Holmes Rather, 
he suggests that according to PIH, C, is by definition propiortional to 
Fp I cannot agree with Holmes on this pioint Obviously, one can cal¬ 
culate Cf as kYj, (as reported below) If the parameter k is truly constant, 
then IS perfectly correlated with F., not just highly correlated, as I 
assumed This method of calculating is acceptable to Holmes But is 
this a correct test of the PIH’ My answer is no How can one claim to 
test a hypothesis when the testing procedure is predesigned to vahdate the 
hypothesis’ 

Friedman’s PIH comes out with flying colors if one were to assume that 
permanent consumption is proportional to permanent income, that is, 
Cp — kYp Permanent consumption was found by taking the estimated 
marginal propensity to consume and multiplying it into the constructed 
permanent-income senes. Transitory income and transitory consumption 
were found by deducting piermanent income and permanent consumption 
from their respective measured values Estimates were made for the 
following equations 

C; = Bo -t- a,F, -1- /!„ (1) 

= + + ^ 2 . ( 2 ) 

<?* = yo + yJ, + ( 3 ) 

(T, = ^0 + ^l^» + />4> W 

where C* is aggregate permanent consumption, including durable goods, 
and Cf IS permanent consumption excluding durable goods, C* and C, 
are the corresponding transitory consumptions, F, and F, represent 
permanent and transitory incomes, respectively, and /t|, ji], and 
are the error terms of the regression equations In table 1, we present 
the results on this basis, weights are normalized to sum to unity, and, 
following Holmes, a was assumed to be 3 5 percent 

The values of the MFC, are not just exceedingly small, but also in¬ 
significant at the 5 percent level The correlation between the transitory 



*94 


jaoMMAt Of fotjnwiu. 


TABLE I 

AMD ’niANBmiiY Itioaia* »o» C««a»* (IVtl fn) 



MPC, 

MK, 

r„cr 

(1) C* -/(F) 

.9^192 

.1967 

(.1868) 

J44 

(2)c-/(y) 

88801 

0926 

(.0716) 

.271 


Non —Figura in parentlma indicate itandard erron ot the coefficienn. Intercept! ct all equation! urere 
mnfxuficaai C* « consumption includotf durable foods, C « consumption ecdudinf durable foods, 
A4K m marnnal pr^enslty to oonsume Subscnpci p and t repr es ent permanent and transitory variables, 
p "> di^xMable income, y - coefficient 0 / correlation. Estimates of MK^ are ngnificant at the 5% levd 


components of mcome and consumption is also very small Clearly, the 
most important assumption of Friedman’s PIH is valid if we grant that 
permanent consumption is proportional to permanent mcome. 

The Weighting Pattern 

Holmes makes the pomt that there is a serious error in my calculations 
insofar as the weights used to derive permanent mcome do not sum to 
unity. Thus, when ^ = 35, my weights add up to 9973 It appears to be a 
very weak cnUcism of my paper. It is no doubt true that weights should, 
ideally, sum to unity Ideally, if one were to go back several more years 
beyond what I did, no doubt the sum of the weights would add to unity 
But, would that have been a worthwhile exercise^ In part, Fnedman 
answers this question with respect to his own calculaUons “The retention 
of as many as 17 terms is doubtless an excess of precision. It is dubious that 
the results would be appreciably affected by retaining, say, only 9 terms 
and adjusUng the weights for them to sum to unity ”* 

Holmes is right in stating that if the weights do not sum to unity there 
IS a bias introduced m the calculation of permanent and transitory incomes, 
and, therefore, in the calculations of margmal propensity to consume out 
of permanent ?nd transitory incomes It may be added, however, that if 
one normalizes the weights, there is another type of bias introduced into 
the calculation of permanent and transitory income due to the normaliza¬ 
tion factor Values of permanent and transitory incomes may go shghtly 
up or slightly down depending on how the weights were normalized. 

However, I recalculated my estimates for permanent and transitory 
vanables with normalized weights and found that the new results arc very 
similar to the ones reported in my onginal paper Thus, for example, 
when the weights are normalized, the MPC^ including durable goods was 
970, as compared with .968 when the weights were not normalized. 
Agam, the MPC^ excluding durable goods was 867 when weights were 


‘ Friedman (1957, p. 146, n. 26). 






ccnonjMiOA'noMS 195 

normalized and .871 when the weight* were not normalized. Theie 
results, in part, support Friednum’s intuitive hunch rqx>rted above. 

Parenthetically, we may note that Holmes’s view of the role of weighting 
scheme in the PIH makes one uncomfortable. Throughout his paper, he 
states that Friedman assumed ft » A and a * .02 and that I assumed 
certain ^ values in my calculations. This is not true. Neither Fnedman 
nor I made such assumptions, but found these weights from the data sets 
In his own equations (6) and (12) of table 2, which he considers as the 
most satisfactory equations, Holmes states that ^ = 4 He tnes to justify 
this “because this was Friedman’s original assumption or that it is the 
average of two alternative assumptions of ^ assumed by Laumas ” 
(Holmes 1974). I believe that this is arbitrary and an unscientific pro¬ 
cedure The value of fi ultimately determines the length of the “honzon” 
which is an integral part of the PIH The “horizon” in turn reflects the 
state of economic development of a country, its history, economic structure, 
and how pieople form their expectations In the absence of any comparison 
between the United States and Canada along these lines, I feel that 
Holmes is incorrect in following Fnedman and assuming fi = 4 for 
Canada Again, it appears strange that Holmes should consider an average 
of the two values of ^ that I repiorted in my onginal papier for calculatmg 
permanent income relevant to the consumption of all goods, and con¬ 
sumption excluding durable goods as somehow a more appropnate figure 
for estimating consumption functions including durable goods and exclud¬ 
ing durable goods in testing the validity of the piermanent-income 
hypothesis for Canada There is no justification for this in his papier 

A hypiothesis as elegant and as piowerful as the PIH apjiears rarely m the 
literature It is unfair to the author of such a theory to accept without 
reservations the central tenets of such a theory The exact value of fi, and 
thus the length of the horizon, is still a controversial issue in the literature * 
It influences the extent to which the piermanent value of a variable is 
likely to differ from its measured value, and thus the whole question of 
the validity of the distinction between jjermanent income and transitory 
income. 


Use of the Trend Factor 

Without the trend factor (a), an equation like equation (II) in Holmes’s 
papier (1974), when applied to a steadily growing senes, yields estimated 
values of a piermanent vanable systematically below their measured 
values. If one were to suppxne that the PIH m its stnet version is valid, it 
makes no difference if propier allowance is made for the trend factor 

* See, for example, Tobin and Swan (1969), Wnght (1969), Holbrook (1967), and 
Landiberger (1971) 



196 


JOURNAL OF POUnCAL ECONOMY 


TABLE 2 

MaROINAI. PROPBNSmU TO CoNlUME PBMtANENT 
AND TuAunroRY Income in Canada (1944-66) 



MPC, 

y„c. 

MPC, 

yptct 

( 1 ) c* =/(y) 

977 

99 

.306 

83 

(.008) 


(043) 


(2) c=/(y) 

843 

99 

367 

86 

(165) 


(047) 



Notb —For muninR of symbols, pleue M not« At the end of table 1 Intercepts of MPCp with and without 
durable soocb were iniigmncanc at the 5% level, while thote of the MFC* equattona were tigniAcant at the 
5% level 


However, following Holmes, by using a value of a = .035 for both 
and Cp and by normalizing weights, I present the new results m table 2 
for equations (l)-(4) above Permanent consumption was computed as 
in my 1969 paper and as explained above, and not as = kY^ 

Results of table 2 indicate that the true MPC, would be approximately 
one-third of the true MPCp It may be noted that this modifies my results 
reported earlier (1969) There I had found MPC, to be approximately 
one-half of MPC^ Thus, I wish to point out that the PIH is more valid for 
Canada than I thought on the basis of my previous calculations. 

Holmes argues that this conclusion needs further modification due to the 
existence of high correlation between Y, and Cp The essence of his point 
IS that since Cp is an imperfect measure of the true Cp and since my 
estimation procedure leads to a certain systematic element of the true Cp 
being transferred to Cf", we do not know if my estimated MPC, is due to 
or suggests that at least part of MPC, may be due to high 

yyjc^ In my calculations relating to table 2, I find = 768 and 
— 639 If one were to grant Holmes’s point, one would have to 
modify the conclusion that MPC, in Canada is approximately one-lhird 
of MPC,, However, in the absence of a valid method of calculating the 
degree of upward bias in MPC, (due largely to our ignorance of the true 
Cp), one does not know if our MPC, and MPC^ values differ significantly 
from their true counterparts ^ 

In addition to the above tests, following Holmes, I made a direct test 
pf the PIH m which measured consumption is the dependent vanable and 
tWe- transitory and permanent components of income the explanatory 

’ Athougb Holma's argument appears lound m theory, from an empirical pomt of 
view It seems inconclusive. Thus, with respect to our calculations m table 1, when tp u 
defined as kYp (this makes dp as close to being lU true value as pouible), we find that 
y„ci 774 and ” -Wl (These values are somewhat higher than when is used 
as a measure of permanent consumption) In this case, therefore, the existence of a high 
V„c, appears quite consistent with the near vahdity of a stnet version of the PIH ibr 
Canada In fitrther view of thu result, one does not know bow far one must alter, if at all, 
the conclusiom arrived at from figures in ubte 2. 






OratlfUNIGATIONB 


*97 


TABLE 3 

Makoinal PKonNimi* to Conkims out or PEMiAmirr 
AND TiiANirroRY Ihooku roft Canada (1944-66) 



R 

C* - 27 42 + 970 Y. + 324 Y, . 

[.365] [66 SOq [1 866] 

.99 

C - 330 99 + 859 y, + 316 Y, 

[5 84] [64 881] [2.598] 

99 


Non —Figure! in brnckeu ere the <>v»lucs The C* li meuured 
coDfumption including durable goodi, end C u meenired coonimption 
excluding durable goodi 


vanables Estimates were made for the following equation 

C* = kj, + A,F„ (5) 

c = ao + (6) 

where C* is aggregate measured consumption including durable goods 
and C IS measured consumption excluding durable goods The senes of 
Fj, used here is the same as in the previous tests for table 2 The results arc 
presented in table 3 

The results of table 3 appear consistent with the results from table 2 
The marginal propensity to consume out of transitory income appears 
to be approximately onc-third of the MPC^ 

One concluding observation appears in order Although I have care¬ 
fully followed Fncdman’s method for calculating the permanent value of a 
vanablc, I do not fully agree that his methodology corresponds most 
accurately to his theoretical construct of the concept of permanent 
income True, there is considerable vabd psychological reasomng behind 
this method,* yet it appears to me that the assumption that the biggest 
response occurs immediately at the beginmng of the adjustment period is 
rather strong 

To summarize, then, the following points emerge from the above 
discussion (a) Holmes’s results seem consistent with mine when I make 
due allowance for trend and normalize weights, (4) Fnedman’s PIH 
does not need as much modification in its appheation to Canada as I had 
suggested in my original paper; (c) Holmes’s criticism of my concept of 
Cj, IS based on what I consider an erroneous view of hypothesis testmg, 
{d) Holmes’s paper does not show adequate appreciation of the impor¬ 
tance of “horizon” in the PIH, {«) both Holmes’s test and my test are 
based on a particular method of calculating the permanent value of a 
vanablc which may not be a perfect method, and it is quite possible that a 
better method may lead to more accurate results. 

* The psychological postulates as related to Jest’s law of memory, underlying Fried- 
man’s theory, are briefly discussed m Laumai aad Laumu (1972). 



198 

References 


JOURNAL OF POLITICAL BOONOMY 


Fnedman, M A TTteoiy qf Coiuumptum Function. Princeton, N.J.: Pnnccton Univ. 
Press, 1957. 

Holbrook, R S. "The Three-Year Horizon An Analysis of the Evidence.” 

J.P.E. 75, no 5 (October 1967)- 750-54 
Holmes, J. “A Test of the Permanent-Income Hypothesis. Comment ” J.P.E 82, 
no 1 (January-February 1974). 185-91 

Landsberger, M “Consumer Discount Rate and the Horizon New Evidence.” 

J.P E. 79 (November-December 1971) • 1346-59 
Laumas, P S “A Test of the Permanent Income Hypothesu.” J.P E. 77, no 5 
(September-October 1969) 857-61. 

Laumas, P , and Laumas, G S “On How to Calculate Permanent Income ” 
Rev Income and Wealth Ser. 18, no 4 (December 1972). 435-38 
Tobin, J , and Swan, C “Money and Permanent Income. Some Empirical 
Tests ” Ameruan Econ Assoc. Papers and Proc. (May 1969), pp. 285-95. 

Wright, C “Estimating Permanent Income A Note.” J.P E. 77, no 5 (Sep- 
tembcr-Octobcr 1%9) 845-50 



"Preferences, Separability, and the 
Patinkin Model": A Comment 

Eitan Berglas and Assaf Razin 
Tel-Ainv l/tuuerstly 


I 

It has been shown that the introduction of real balances into the utihty 
function, which is the main feature of the Patinkin model, generally 
destroys the Slutsky results More spicciiically, the derivative of the 
compensated demand of any good with respect to its own pnce is not 
necessarily negative * 

In his paper, Lloyd shows that the “work by Baumol, Tobm, Patinkin 
and others implies that the utility function is separable in a spccihc way” 
which we shall reproduce later ^ Lloyd also shows that if this separability 
assumption holds, the Slutsky results can be reinstated with some 
modifications He then proposes an empirical test of the separability 
assumption 

Pearce (1958, 1964, chap 3), in his analysis of demand and savings, 
has made extensive use of the separability assumption to develop the 
Slutsky results for this case However, his analysis did not consider any 
monetary model, in particular, he did not refer to the Patinkin model 

This note shows that it is jxissible to apply the Pearce analysis to the 
Patinkin model Such an application yields a modified version of the 
Slutsky results The modified version and Us denvation are much simpler 
than Lloyd’s Furthermore, this denvation can be subjected to a simple 
empirical test, while the empincal test of the Lloyd formulation is shown 
to be impractical 

n 

Given Lloyd’s separability assumption, the utihty function can be vmtten^ 

M = b[^(xi, (1) 

‘ For a ducufuon of thu problem and for referenca to earlier concnbutioni, see Patinkin 
(1965, pp. 403-10) 

*Uoyd (1971, p 642). 

’ We exclude bonds from the analysu,since bond interest rates are constant throughout, 
thus the relative price of bonds in terms of money u constant, and the two can be regarded 
as a composite good The extension of the analysu to the case where mtereit rates vary is 
straightforward. 


199 


200 


JOURNAL OF POLmOAL ECONOMY 


where Xj,. , x„ are amounts of current commodities, p is the price 

index, and M is the nominal quantity of money. The M and p are intro¬ 
duced separately in the utility function Obviously, this includes the 
form Mjp, which Lloyd uses, as a special case. 

It is useful to write the demand function for current commodities as 
Xf = Xi(J>f, ,/)„,£), fort = 1,. , m, where £ is the total expenditure 

on current commodiues Differentiating toally, we g^t 


dx, ^ dx, ^ Sx, dE 
dpj dpj dE dpj 


Parutiomng dxjdpj, we get 


dx, 


u 



( 3 ) 


Given the budget constraint E + M = Y + L, where Y is income and L 
IS the initial quantity of money (both are given to the consumer), and 
using equauons (2) and (3), we get* 


dpj I - dMIdYy dPj) 


( 4 ) 


By the separability assumption, the consumer can be regarded as though 
he is solving a two-stage maximization problem- (a) given pnccs and 
£ -f y, he determines M, (b) subject to E = L — M + Y and pnccs, 
he maximizes (p The terms S^j are the compensated demand derivatives 
holding (j> constant. These terms, therefore, have all the known sign and 
symmetry properties 


ni 

Equation (14) in Lloyd’s paper has the following disadvantages when 
compared with equation (4) above (a) With Lloyd’s approach, it is 
necessary to know the pnee index function—for which no satisfactory 
specification exists,’ (i) equation (14) includes a vanable Q which is not 
observable (This vanable is solved from eq [14] of a particular pair of 
commodities ) Lloyd claims that his equation (14) makes the results of 
traditional demand theory open to empincal falsification. Given the 
difficulties we have noted, however, such a claim is at best questionable 
Equation (4) of this paper, which depends only on observable magnitudes, 
can be used for this purpose An even simpler test of the properties of S,j 
and the separability assumpuon can be earned out by using equauon (3). 

We cannot close without noting that what we have done is implied by 

Note that and BEIBY > 1 - BMjBY. 

* For a docusuon of these diffi^ties, see Patmkin (1965, pp. 405-6). 



COMMUNICATIONS 


901 


Pearce (1958; 1964, chap. 3). It is true that Pearce does not refer to the 
Patinkin model However, this note shows that the Patinlun model is a 
special case of Pearce’s model Using Pearce’s approach in a monetary 
model makes it possible to derive a modified version of the traditional 
demand properties in a way which can be tested empirically.* 


Reference* 

Berglas, £ , and Razin, A “A Note on the Balance of Payments and the Terms 
of Trade in Relation to Financial Controls ” Rev. Earn. Studus 39 (October 
1972): 511-13 

Lloyd, C “Preferences, Separabihty and the Patinlcin Model.” J P£. 79 (May/ 
June 1971); 642-51. 

Patinkin, D. Mon^, Interest, and Prices 2d ed. New York. Harper & Row, 1965. 
Pearce, I. F. “Demand Analysis and the Savings Function.” Earn Rec 34 (April 
1958): 52-66 

-. A Cdntnbutun to Demand Analysts London - Oxford Umv. Press, 1964 


* The usefulness of thu approach can be seen from its applicauon to the analysis of the 
effects of devaluation on the balance of payments m Berglas and Razm (1972) 



A Paradoxon of the Market Mechanism 

Gerhard O. Orosel 
Uttwerstfy of Vumia 


1. Introduction and Results 

The market mechanism can be used to construct a social welfare function 
m such a way that equilibrium points are preferred to other Parcto- 
optimal points, which in turn are preferred to non-Pareto-optimal points 
K J. Arrow’s condition of positive association (Arrow 1963) is not 
apphcable to this social welfare function because of formal reasons, but 
similar conditions of the same philosophical content can be formulated 
It can be shown by examples that the market mechamsm of a barter 
economy violates these conditions, that is, changes in individual prefer¬ 
ences may lead to exactly opposing changes in social preferences 


2. The Market Mechanism as a Social Welfare Function 

The market in a barter economy can be looked at as a mechanism which 
assigns to each given set of utility functions for individual i, > = 1, 2, , 

a, and initial endowments of each individual i with each commodity j, 
j = 1,2, , m at least one matrix (fly), where fly gives the amount of 

commodity j individual i gets in cquilibnum (cf Debreu 1959 or Arrow 
and Hahn 1971). In order to make things easier, there is no producUon 
included in this model ‘ 

First a social welfare function, induced by the market mechanism, must 
be constructed For a social welfare function, we need a complete pre- 
ordenng of the alternatives Alternatives are now aU attainable (feasible 
or possible) allocations of the commodities, where attainable (feasible or 
possible) means (1) that for each commodity the sum of the individual 
quantities of this commodity is constant and equal to the iniual endow¬ 
ment of the society with this commodity and (2) that no individual gets 
less of every commodity in an attainable alternative than in its iniUal 


. 1 am indebted to K J. Arrow for very valuable ducusuon and to an unknown referee 
for moft helidul critique 

‘ As Rader (1964) has shown, any model of an economy involving production and 
exchange can be represented by an equivalent model involving only exchange, if the 
condiUtmi for an efiiaent price system are fulfilled 


202 



OOUMinaOATlONS 203 

endowment ^ This follows from the absence of production. Hence, 
alternatives are matrices (*y), where x,j is the amount of commodity j 
individual t gets in altemauve x, ^.i ^ij must equal the fixed amoimt of 
commodity j m the whole society for ally’s, and no imtial endowment for 
an individual t is strictly greater than the corresponding vector (xy, 
x, 2 , • > The market mechanism does not give a complete pre- 

ordering of all those alternauves; it only singles out certam alternatives, 
namely, equihbnum points. Hence, every ordenng that distinguishes 
equihbnum points from nonequilibrium allocations is compatible with the 
market mechanism 

We will distmguuh three sets. Denote by Q the set of eqmhbnum 
points, by P the set of Pareto-optimal points not in Q, and by 5 the set of 
attainable allocations not in P and not in Q, that is, the set of all non- 
Pareto-optimal allocations ^ 

From this dehmtion it follows that the union of Q, P, and S is the set of 
all alternatives A, QuPuS^A Hence, if all elements of the sets Q, 
P, and S are comparable by a relation, all attainable alternatives are 
comparable by that relation. We will define now such a relation The 
relation W is a preference relation, which gives the following ordenng of 
alternatives each element of Q is stnctly preferred to each element of both 
P and S, each clement of P is stnctly preferred to each element of S, and 
elements of the same set are indifferent * 

It IS easy to see that the relation W does not violate the reqmrements 
put on a preference relation' rcflexitivity, transitivity, and completeness 
Hence, we have defined a preference rclauon compatible with the market 
mechanism We will take this relation W now as a social preference 
pattern and look at the market mechanism as a social welfare funebon, 
which gives us in cooperabon with the relabon W a preference ordenng 
of all alternatives, that is, of all attainable allocabons. 


3. Positive Association 

It IS a precondition of Arrow’s condibon 2 (Arrow 1963), the condiuon of 
posibve associabon of soaal and individual values, that it is possible that 

^ Part (2) u implied if local nonsatiaUon (Arrow and Hahn 1971) a ajsumed for each 
individual, and it assures that the set of altemabves changes if the imtial endowments 
change. Otherwise the social preference ordenng could change without any change in 
individual preferences, because a change of the iniual endowments will in general change 
the competitive equilibria Part (2) reflects that a different distribution of the society’s 
endowments among the individiuUs implies a different set of alternauves The necessity 
of part (2) has become obvious to the author because of a critique by an unknovim referee. 

’ It IS not really necessary to distinguish between the sets P and S in this context, but 
It assures the Pareto^mmality of the welfare fiincuon defined in the following 

* The case « prefer^ to i is denoted by a > i,a stnctly preferred to t means« ^ i 
and not t > a and it denoted by a > i;a tn^fferent to i means a ^ i and 6 > a and 
IS denoted by a ~ i; a subsenpt k refers to individual k. 



204 


JOURNAL or POLITICAL RCONOMY 


preferences concerning solely pairs of alternatives including a fixed 
alternative x can change with preierences concerning all other alternatives 
remaining constant It can be proved rigorously that this precondition 
cannot be fulfilled in our case 

If there are two commodity vectors x* and_y* and an individual i which 
used to prefer ji' to and prefers now to_)i', then there must exist a com¬ 
modity vector -c* sufficiently close to x*, such that the mdividual prefers 
to if It prefers y‘ to x‘ and prefers z* to y* if it prefers x‘ to y‘ Since 
external effects arc excluded, each individual orders allocations by 
ordenng his commodity vectors of each allocation. 

Obviously we have to alter Arrow’s condition 2 We will state con- 
diUon 2', a condition of similar content, and show that it is violated by our 
welfare function 

Condttum 2' 

i) For given individual preferences, the society prefers strictly x loy If 
mdividual preferences change the way that some individuals, who 
used to prefer stnctly to x, prefer now strictly x to ji, with all other 
individuals having unchanged preferences, the society must still prefer 
X to^, at least weakly, in its new preference pattern 
ii) For a given set of individual preference patterns, the society is in¬ 
different between x and y Individual preferences change now such 
that some individuals, who used to prefer y to x, prefer now x to y, 
and for every alternative z different from y which those individuals 
ordered 4; ii they order now z > y- All other individuals do not 
change their preferences Then the new social preferences, belonging 
to the altered set of individual preferences, must not prefer to x; 
hence, it must order x i _>> 

Perhaps an explanation of part (ii) will be useful. Contrary to (i), 
we start vnth x andy indifferent to each other for the society But ior y 
in individual preferences we demand more than in (i) if it falls u' com¬ 
parison to X, It must fall relative to every piossible other alternative z, 
that IS, there must not exist a z which falls in companson toji 

It will be proved by examples that both (i) and (u) are violated by the 
welfare funcuon, defined above 

4. The Counterexsunples 

Let X be an cquilibnum point for a given set of utihty functions / and given 
endowments Let^ be an attainable allocation which is not an equihbnum 
point at utility functions I We will show that it is possible that x ceases 
to be an equihbnum point and y becomes one, if preferences of some 



ookmunioationb 
individuals change from 


205 


y > X io X > y 

k k 

with preferences of all other individuals remaining the same. Using the 
social preference ordering, induced by the market and defined in the 
above way, this means that a rise of an alternative x in some individual 
preference patterns with other individuals’ preferences remaining con' 
stant leads to a change in the social preference from x > y toy > x, 
hence, condition 2' is violated. 

There are two commodities a and i and three groups of individuals 
Each group has the same size and identical utility functions Groups 2 
and 3 do not alter their preferences, whereas group 1 does change its 
preferences from 

y > X to X > y 

k k 

Because of this change y becomes an cquihbnum point, which is the 
paradoxon This is shown in the graphical representation on figure 1 
First we must do the computations in order to show that no mdividual 
violates Its budget constraint and that the sum of the commodities equals 
the sum of the initial endowments, which must be the case because there 
IS no production 

Let the initial endowments be u)^ (4, 4) for all individuals, the prices 

p* = (J), and = ( 2 ) for X and _>>, respectively 

The social endowment is J], w, = (12, 12) if we assume that there is one 
individual in each group Further, 

= (4 4)(f) = 12 and w,p^ = (4 4)(i) - 12 





QUANTmES 



Ef^uiLaiuA 

«i 

«2 

“3 *1 

h 

*3 

X. 

35 

35 

5 5 

5 

2 

y . . 

5 

2 

5 35 

5 

35 


(1) : (3.5 5)(i) = 7 + 5 = 12 (5 3.5)(|) =» 5 + 7 = 12, 

(2) (3 5 5)(?) = 7 5 - 12 (2 5)(i) = 2 + 10 = 12, 

(3) - (5 2)(J) - 10 + 2 - 12 (53 5)(i) - 5 + 7 = 12, 

W- ^ M - (3.5 + 3.5 + 5, 5 + 5 + 2) - (12, 12) = ^ w„ 

(J>)i ^ (a,b,) - (5 + 2 + 5, 3.5 + 5 + 3.5 - (12,12) ^ ^ w, 








2o6 


JOURNAL OF POtmOAL EOONOUY 



b 




Fio 1 — Top, group 1 (individuali *) MuUle, group 2 (mdividuali A) Bottom, group 3 
(individuals 1) 


Hence, x and^ arc equilibna, if the indifference curves touch the budget 
lines in the allocation points of x, respectively y. Denote by 


du*lda 


du'jdb 


J 


I, II 


the ratio of the marginal utilities of the commodities a and b (hence the 
marginal rate of substitution) for mdividual i at its endowment a « 3, 
i as ? and preference pattern/ Then in the example the following holds 



OomfUMfCATlONS 


207 


For group 1 (individuals k): 

2 = 

1 = 


For group 2 (individuals h ): 


2 = 


dt/'ldc 


8u^ldb\i:l » 


For group 3 (individuals /) • 


2 = 


du^jda 


du^ldb\izl 


di^lda 

I 

dt^/da 

II 

dil^ldb 

S 

6*9 

dt^ldb 

5 

6*5 

di^lda 

If/ 

dt^jda 

1' 

dt^ldb 

1 1 

^ di^ldb 

sn 

1 1 


1 = 


1 = 


divide 


dt^ldb 


du'Ida 


du'jdb 


( 1 ) 


6-5 


a^S 

bmi s 


j = I, If 


j = I. II. 


( 2 ) 


( 3 ) 


Since at allocation * at preferences // and at allocation,)* at preferences 
1 the marginal rate of subsutution of individuals k differs from that rate 
of individuals not k (hence, individuals hot l),x is not Parcto-optimal at 
preferences II, and> is not Pareto-optimal at preferences I 
Therefore, there cannot exist pnees which make those pomts x, re¬ 
spectively,)* to equilibrium points at preferences II, respectively /, because 
Pareto-opumality is a necessary condition for an equihbnum point 
We sull have to show that part ( 11 ) of condiuon 2' is violated by our 
welfare function We can use the same example, leaving everything un¬ 
changed for group 2 and group 3 Preferences of group 1 must now be such 
that y is as well as x an equilibrium point at preferences I, hence, the 
society IS indifferent between x and y If the new indifference curve 
through j>, implied by preference pattern II, touches the old one my and 
lies nearer to the origin in every point else for each z noty, the followmg 
holds; 


a 

k 


in preference pattern I implies 


z>y 

k 


in preference pattern II. Obviously it is possible that two strictly convex 
curves touch each other in a point Since in the example everything 
except the indifference curves of group I is the same as in the previous 
example, only the graphical representation of group 1 is given Obviously 
P^ (ii) of condition 2' is violated. 



ao8 


jouhnai, of pouncAL BCxmoKY 



The dotted curves in figure 2 are the indiflFerence curves at preferences 
II At preferences I y > x, and at preferences JI x > y 

k k 


2 = 


dt^jda 


duf‘ldb 


dt^jda 


= 3 S 
«S 


dt^jdb 


It 


a = 3 5 
4-5 


( 4 ) 


Hence, there cannot exist pnees making x to an equilibrium point at 
preferences 11 because the marginal substitution rate between a and b is 
equal to 2 for individuals not k Therefore, x cannot.be Pareto-optimal. 
Hence, it cannot be an equilibrium point 
Since 

1 _ dt^jda ^ _ dt^lda 

^ ^ di^ldb tZli ~ dt^ldb 


II 


5 

b»y 5 


Pi‘ 


( 5 ) 


then^ IS still an equihbnum point, and the social welfare function orders 
y > X ioT the preference profile II, which violates condition 2' part (ii) 


5. Conclusions 

The results reached show that the market mechamsm has unsatisfying 
properties even under ideal circumstances * Those properties are im¬ 
manent in the theoretical model, no matter if this model is an acceptable 
approximation of reality or not 

It seems to me that the source of the violation of condition 2' is that, for 
competitive equihbna besides the imtial endowments, only the slopies of 
the indifference curves are important This means that primarily only the 


‘ The confinement to a barter economy without production u no limitation to the 
validity of the results because of Rader’s theorem given in n. 1 



OOUMUNICATIONS 


209 

ranking of nearby alternatives matters, and the ordering of alternatives 
not being in a marginal neighborhood of each other comes in only by the 
necessity of transitivity and of the usual shape of the indifference curves * 


RtferencM 

Arrow, K. J. Soetal Choue and Individual Values Cowles Foundation Monograph 12 
2d rf. New York' Wiley, 1%3. 

Arrow, K. J, and Hahn, F. H. Centred Competitu/e Asudysts, Mathematical 
Economics Texts, vol. 6. San Francisco* Holden-Day, 1971. 

Debreu, G Theory qf Value New York. Wiley, 1959. 

Rader, J T. “Edgeworth Exchanges in General Economic Equihbnum.” Yale 
Eton. Essays, vol. 4, no 1 (1964). 


* Graphically two conditions are necessary and suffiaent for an attainable idlocation 
to be a competitive equitibnum the pnee line (the hyperplane in general) mutt (a) 
touch an indiflerence curve and (b) past through the point rcpretenting the rniual 
endowment, thereby (a) and (i) mutt Imld for each individual Hence, with 6xed initial 
endowments, only the slopes of the indifference curves at an attainable allocation x are 
relevant for the question, if x it a competitive cqtuhbnum or not If we compare two 
situations differing only in the individual prefennees (but not in the imtial endowments), 
this means that it is relevant how preferences in a very small neighborhood of x differ 
in the two situations Of course, given the change of the preferenca in all other meat of 
the positive orthant (that is, the slope of the indifference curves), the necessity of transitivity 
(the indifference curves must not cut) and the necessity of the usual shape of the in¬ 
difference curves (convex to the origin) exclude certain possibiliucs of the slope of the 
indifference curves at x. The same is true for the question if a pointy stays or docs not 
stay an attainable allocauon not being a competiuve equilibrium if preferences change. 
The influence of the two necessities of transitivity and shape, respectively, diminishes in 
our example ifx and^ are “far from each other “ Hence, if x and y arc not “loo close to 
each other,” it is possible to change the indifference curves (without violating the neces¬ 
sities) in such a way that (a) x > y change Xoy > x and (b) the slopes of the indifference 
curves are accordingly Perhaps it is not too farfetched to pomt out an analogy to 
optimizing theory for local optima (as opposed to global optuna) only an infimtely 
small neighborhood of the point m quesuon is relevant 



Miscellany 


Drug-Law Enforcement: Should We Arrest 
Pushers or Users? 


Any society wishing to reduce the consumption of an “undesirable” 
Item, such as illegal drugs, has available two general strategies ‘ reduang 
demand, by harassing buyers, or reducing supply, by harassing sellers 
Law enforcement agencies in the United States, presumably with the 
approval of the voting pubhc, seem to have opted largely for the latter 
strategy (“we’re not really interested in users, we want the pushers”), 
and yet ex-addict William S Burroughs (1966, p xl) advises. “If we 
wish to annihilate the junk pyranrud, we must start with the bottom of the 
pyramid the Addict tn the Street, and stop tilting quixotically for the 
‘higher ups’ so called, all of whom are immediately replaceable” (italics 
in original) This note will consider what is involved in choosmg between 
these two strategies and will argue that the Umted States has chosen to 
emphasize the strategy which, for a given enforcement expenditure, 
(1) probably has the smaller effect upon drug consumption, (2) seems 
likely to increase crime rates by addicts, making enforcement of other 
laws more difficult, and (3) seems more likely to corrupt police That 
something is wrong with our strategy seems clear enough The Govern¬ 
ment Accounting Office recently concluded from a year-long investigation 
that traffic in heroin is too big to stop at United States borders {New 
York Times, December 13,1972, p 31) In some ways even more disturbing, 
however, was the recent announcement that about one-fifth of the heroin 
and cocaine seized in New York City over the past 12 years has been 
stolen from the pohee department {New York Times, February 1,1973, p 1) 
The economics of drug law enforcement is similar to that of sales taxes 
As IS well known, who “pays” such a tax and its effects upon price and 
quantity depend upon the slopes of the demand-iuid-supply curves as 
well as upon the content of the law The law determines whether the buyer 
or the seller shall act as the state’s “collection agent,” in the case of sales 
taxes. Since punishment for violating drug laws is'usually at least partly 


310 




lOKIBlXANY 


211 


nonpecuniary, taking the form of jail time, social disgrace, and other 
inconveniences, the content of drug laws, together with police enforce¬ 
ment decisions, determmes upon whom'these nonpecuniary costs shall 
fall as well as upon whom the uutial burden of pecumary “taxes” (the 
expected value of fines, legal fees, etc.) shall fall If sale is prohibited and 
purchase not (by law, or dc facto, by police decisions within the constraints 
of limited enforcement resources), the tax will f^ll imnally upon sellers, 
who will convert the resulting pecumary and nonpecumary costs into a 
pecuniary increase m supply pnee to buyers Likewise, buyers will 
convert pecumary and nonpecuniary inconvemences arising from pohee 
harassment into a piecumary dechne m demand price to sellers As is 
true with sales taxes, a given shift in demand relative to supply will 
result in the same effect upon consumption, regardless of whether the 
shift IS in demand or supply Presumably, sales taxes are usually levied 
upon sellers because of lower enforcement and collection costs, a given 
amount of enforcement effort will produce more net revenue if directed at 
sellers. With sales taxes, we would wish to arrange collections so as to 
shift demand relative to supply as little as possible for a given tax rate 
With drug laws, on the other hand, the goal is not to collect revenue 
but to discourage consumption, and we would wish to arrange laws and 
enforcement procedures so as to shift demand relative to supply as much as 
possible Whether we would wish to impose the “tax” imtially upon 
sellers or buyers depends upon which arrangement we would expect to 
result in a greater shift in demand relative to supply This, in turn, 
would seem to depend mostly upon whether buyers or sellers are more 
efficient in converting the inconvemences of dealmg with fines or non- 
pccuniary punishments such as jail sentences, social disgrace, etc , into 
pecumary shifts in supply or demand The more efficient the conversion, 
the smaller the shift a given amount of harassment will produce It seems 
likely that sellers would have the advantage over buyers in making this 
conversion, since opportunities for specialization should attract to the 
drug-selling trade individuals who arc peculiarly suited for it If the law 
reqmres that sellers occasionally cope with fines, jail sentences, and 
resulting social disapproval, we should expect successful drug dealers to 
be those who have the lowest supply prices for these services or arc most 
adept at avoiding them Presumably, the successful pracuuoners of any 
trade are laigely distinguished by their ability to convert pecumary and 
nonpecumary costs into small pecumary prices, there is no obvious reason 
why drug dealers should be an exception In addition, speciahzation by 
drug dealers makes investment in devices for reducing the impact of 
enforcement (knowledge of whom to bribe to avoid arrest, which lawyers 
to hire to avoid conviction, etc.) more remunerative Finally, possibihues 
for entering the dnig-ielhng trade wnll tend to make seller harassment 
less effective. A buyer jailed means one buyer less for the duration of the 



9 13 JOURNAL OF POUTICAL BGONOUY 

sentence; a seller jailed may simply mean fresh opportunity for a hopeful 
entrant. 

Empincal evidence on the effects upon consumption of buyer harass¬ 
ment versus seller harassment is extremely difficult to obtain, but the 
meager evidence available seems to indicate that decreases in the effective 
penalty on buyers does lead to marked increases in consumption Wheat 
(1972) found that m Boston the expected costs of arrest to a user (defined 
as probability of arrest times probability of being sentenced to pnson 
times average prison sentence) declined sharply between 1961 and 1970, 
while the estimated number of addicts in Boston increased by a factor of 
about 10 During the Bntish exjjenment with decriminalizing heroin 
use and allowing physicians to prescribe heroin as they wished, addiction 
increased so greatly that the practice was abandoned in 1968 

A second point to consider m choosing a drug-law enforcement strategy 
IS the effect upon the pecuniary drug price Clearly, reducing supply will 
raise the price, reducing demand will lower it. In effect, reducing supply 
imposes an income ttix upon those with highly inelastic demand 
(“addicts”^) Simon Rottenberg (1968, p 87 n.) claims that the price of a 
gram of heroin rises from $0 05 m Turkey to $295 retail in Chicago 
Since much of this price increase is due to enforcement of drug laws, the 
implicit income tax upon addicts resulting from seller harassment is 
substantial Adding to the troubles of these unfortunates is a perhaps 
regrettable concomitant of reducing drug consumption However, much 
of the income tax imposed upon addicts by current enforcement strategy 
seems to be shifted onto others by theft and other addict crimes Estimates 
of the proportion of property crimes committed by addicts range from 
about one-fourth to two-thirds (Enckson 1969, p 485, Wilson, Moore, 
and Wheat 1972, p 12) This heavy tax upon innocent bystanders has 
no obvious social function and could be largely avoided by concentrating 
enforcement effort upon users, reducing demand and the pecumary 
pnee of drugs In effect, by shifting to such a policy we would be exchang- 
mg the current piccuniary tax upon addicts, which they easily shift onto 
innocent bystanders, for a largely nonpccuniary tax, which they could not 
easily shift onto anyone except drug sellers 

Becker (1968, pp 193-98) has argued convincingly in favor of fines for 
criminal offenses whenever feasible Indeed, given the current strategy of 
concentrating on sellers, it would seem that a given amount of enforce¬ 
ment cost would have more effect upon drug traffic if expenditure currently 
used to maintain in prison those already arrested and convicted were 
diverted to making more frequent arrests or obtaimng more convictions 
once arrests are made However, fines for drug users would seem as hkely 
to increase property crime rates as the current seller-harassment strategy, 
since they result in an increase m the pecuniary price of drugs to addicts 
Nonpecuniary punishment of users is not so easy to shift onto others and 



MnCBLLANV 


313 

also has the advantage of giving the conunimity a mechanism for en¬ 
couraging users to seek rehabilitation. Some authorities have claimed that 
many addicts will not seek rehabihtation without some measure of 
compulsion (Wilson et al 1972, pp 20, 25, 27, see also statements by 
Dr Herbert D Kleber of the Connecticut Mental Health Center reported 
by the New York Times, April 27, 1973, p 39) 

Perhaps the most senous flaw of a policy of harassing drug sellers is 
that It lays the basis for a cooperative relationship between drug sellers 
and the police The grains from collusion in an industry are known to be 
greater the smaller the elasticity of demand, the greater the elasticity of 
marginal costs, and the smaller the costs of enforcing the collusion 
(Becker 1968, pp 205-7). It is widely believed that the elasticity of 
demand for hard drugs is very small (at least m the short run) The 
elasticity of marginal costs is presumably very large Thus, the gains from 
collusion would be large if colluding sellers could obtain cheap means of 
enforcing collusion and of preventing entry into the industry Enforcement 
policies directed against sellers furnish just that Schelling (1967, pp 119, 
124—25) cited evidence that the Mianru wire service syndicate “relied 
heavily upon the police as their favorite instrument of intimidation ” 
Drug laws directed against sellers make this instrument available to 
colluding sellers for use against price cutten and potenual entrants 
One way to view police policies of harassing sellers is as establishing a 
nonpecumary “license fee ” Those who are relatively efficient in converung 
arrest and other harassment into piccumary terms will “pay the fee” and 
sell, others will be barred from the trade The sellers who succeed have 
an interest in enforcement of the law to prevent entry by others Both 
sellers and enforcement officials, however, have an interest m converting 
the license fee into terms other than harassment and arrest One jxissibility 
IS to convert it into pecuniary (and illegal) bribes for pohee inacUon, 
which raises onunous {xissibilities as to the typics of people who will seek 
police work in the long run But colluding sellers may purchase police 
inacUon by furnishing informaUon on other cnminal acuvity or by 
rendering other quite legal services to the jxilice In this way, the police 
could avoid illegal and antisocial behavior Indeed, they could simul¬ 
taneously serve their social mandate of reduemg drug consumption and 
serve the interests of the colluding sellers by preventing individual 
members of the collusion from increasing their sales at the expense of other 
members and preventing new entrants from compeung vnth established 
sellers. 

Thus, enforcement strategy directed against drug sellers tends to 
enhance a natural accord between illegal business firms and the pohee 
based upon their common interest in civil order After visiting Harlem 
m May 1968, Harry McPherson (1972, p. 371) reported hcanng of a 
conversation between Rap Brown and a Harlem rackets boss Brown 



JOURNAL OF POtmOAL EOONOMY 


ai4 

had made a strong speech The boss told him: “I agree with a lot of what 
you said Except I don’t want any riots I’ve got to raise $60,000 to buy 
off some people downtown on a narcotics rap. 1 can’t do that if there’s 
a not. You start a not and I’ll kill you ” Brown is rumored to have left 
town the next day. 


References 

Becker, G. S. “Cnine and Punishment: An Economic Approach.” J P.E. 76 
(March-Apnl 1968): 169-217. 

Burroughs, W S Naked Lunch New York* Grove, 1966 

Erickson, E. ‘‘The Social Costs of the Discovery and Suppression of the Clandestine 
Distnbution of Heroin.” J.P.E 77 (July-August 1%9) 484-86. 

McPherson, H. A Political Education. Baton* Little, Brown, 1972. 

Rottenberg, S.‘‘The Clandestine Distnbution of Heroin, Its Discovery and Suppres¬ 
sion.” J P E. 76 (January-February 1968) 78-90. 

Schelkng, T C “Economic Analysis and Organized Crime ” In President’s 
Commission on Law Enforcement and Administration of Justice, Organized 
Crime Task Force Reports Washington. Government Pnnting Office, 1967 

Wheat, I D., Jr. “Herom Abuse m Boston. The Dynamics of the Heroin Market 
and the Response of the Cnmmal Jusuce System.” Mimeographed. Harvard 
Univ , John F. Kennedy School of Government, Public Policy Program, 1972 

Wilson, J Q , Moore, M H , and Wheat, I D , Jr “The Problem of Herom ” 
Public Interest 29 (Fall 1972) 8-28 

Billy J Eatherly 

Mississippi State University 



Puzzles and Problems 


Editors’ Note* This secDon, which is being terminated, has been edited 
by Stanley Fischer, to whom the editors wish to express their grantude. 

Q725—Donald A Walker, Indiana Umversity of Pennsylvania 
Joan Robinson presented, in The Economtcs of Imperfect Competitum (London 
Macmillan, 1933, chap. 26), the accepted account of exploitation of 
workers that occurs because of the relation between the marginal revenue 
product of labor (MRP) and the wage rate in monopsonistic labor 
markets, and the related matter of the magnitude of monopsony profits 
By constrast with her treatment, it is here (fig 1) assumed that a monop¬ 
sony faces a supply-of-labor (S) curve that bends back toward the wage- 
rate axis m the relevant range It is assumed, for simphcity, that labor is 
the only factor used by the firm, auid that the S curve and the MRP 
curve each have constant negative slopes It is also assumed that the 
average revenue product (ARP) curve cuts’ the supply-of-labor curve 
from above. In what ways does recognition of this situation lead to 
modifications of Robinson’s analysis luid conclusions^ 

Amiver —It can be seen that Robinson’s solution is not true m general 
She considers only the case of a posiUvely inchned 5 curve Accordingly, 
she identifies, first, the intersection at N of the posiuvely inclined margmal 
cost of labor (MCf) curve and the MRP curve, second, the equihbnum 
volume of employment Z.Q, and third, the relevant point H on the S 
curve She concludes that the workers m a monopsony are exploited 
because the wage rate OlV^a less than the margmal revenue product OM 
Under the conditions of the question, however, the AfQ curve lies below 
the backward-bending S curve^ so that although the firm still maximizes 
profit by equating the MQ to the MRP, the resulUng wage rate OM^, 
determined by the relevant point V on the S curve exceeds the marginal 
revenue product OM. It alw follows that Robinson does not present a 
general treatment of the magnitude of monopsony profits. With the rising 
5 curve of her analysis, the wage bill is OL^Wq, so monopsony profits 
are (Vq/IQA. By contrast, if the S curve is backward-bending, the wage 
bill is OLQVfVf, so profits are the smaller magnitude fV, VQA. 




9i6 


JOURNAL OF POLmOAL SOONOMy 


W 



A companson of the cases shows that there are two possible reasons 
for monopsony profits Some profits may be obtained by “exploitation” 
of the workers This will occur if the S curve is rising But exploitation is 
obviously not a source of profits if the S curve is backward-bending. On 
the contrary, m that case the workers arc paid the sum MNVWi in 
addition to their contribution to revenue, at the expense of what would 
otherwise be a part of profits Even in the latter case, provided that 
revenue exceeds cost, it is advantageous for the firm to be a monopsonist, 
because the firm’s market position enables it to obtain the profits shown 
by VQA, as a permanent flow of income There is, therefore, a second 
reasem for monopsony profits, one from which the other firm also benefits, 
namely, the absence of the freedom of entry and competiuve equihbratmg 
forces that would lead to a systematic equalization of revenue and cost. 



Book Reviews 


Mowtary Theory; Iriflatton, Interest, and Growth in the World Economy. By Robert A 

Mundell. 

Paufic Paluada, Calif.: Goodyear Publishing Co, 1971. Pp ix+ 189. $10.00 

During the 1960s, Robert Mundell wrote a senes of short articles which had a 
sharp impact on the development of the monetary theory of both a closed and 
open economy. In the former category (represented by part 1 of this book) were 
Mundell’s semmal J.P E. papers on inflation and on growth, in the latter (repre¬ 
sented by part 2) were his important contnbutions toward the development of a 
general-equilibrium analysis of the monetary aspects of international trade. 

In the flist four chapters of part 1 of this book (and it is on this part that this 
review will concentrate) Mundell has convcmently collected the aforementioned 
J P.E, papers (somewhat modified)—and to them he has added two chapters 
(5-6) designed to integrate and generalize their respective analyses Part 2 is an 
extension of some of the basic analysis that appeared m his International Economics 
(1968) and consists primarily of new papers. 

The point of departure of Mundell’s analysis in part I is Metzler's famous 
diagrammatical representation of the general-equilibrium determination of the 
rate of interest and price level m a full-employment economy With the aid of this 
diagram Metzler made a fruitful distmction between the impact of a monetary 
change generated by deflat flnanang (which is neutral m its effects) and that 
generated by an open-market purchase (which, in Metzler’s analysis, is not). In 
his first chapter, Mundell very insightfully demonstrates that this difference is not 
inherent in the open-market operation as such, but depends on whether or not the 
taxes used to finance the interest payment on government debt are capitalized by 
the taxpayers. 

From here Mundell goes on (in chaps. 2-4) to make some ingemous modifica¬ 
tions of the Metzlenaxi diagram whose common theme is that since the vertical 
axis of this diagram represents a percentage rate per umt of time (namely, the rate 
of interest), it can also be used to represent various relevant rates of change in the 
economy Here Mundell is at his b«t m most neatly generahzmg the Metzlerian 
apparatus to the case of a continuous inflationary (or deflationary) process and in 
demonstratmg the nonneutrahty of such a process Mundell is also at pains to 
provide an economic interpretabon of this conclusion m terms of the discrepancy 
created by such a process between the nominal and real rates of mtercst—and 
hence its differential impact on the demand for real money balances as contrasted 
with that for commodities. In a similar way Mundell analyzes the impact of 
giwwth on the economy in terms of the rate of change of pnees that such growth 
generates, given the noimnal supply of money. 

All of us who work on monetary theory are in Mundell’s debt for his incisive 
analyses of these problems. 1 must, however, admit that the new essays (chaps 
5-6) m which Mundell attempts to elaborate upon this analysis and int^rate it 
into the theory of growth are less helpful. I find the argument here difficult to 


217 




2i8 JOITXNAI. OF POLITIOAL BOONOMY 

follow, including the difficulty of seeing precisely bow the earlier analysis is 
integrated into these essays. 

Thus, the two equihbnum conditions used to carry out the analysis in the 
earlier chapters are the familiar IS>LM curves, representing, respectively, flow 
equilibrium in the commodity market and stock equilibnum in the money (or 
financial-assets) market. In the growth chapters, on the other hand, the two 
conditions relate only to the stock equilibria—namely, those for real-money 
balances and physical-capital goods, respectively—^with an inadequate explana¬ 
tion of the way in which the commodity flow-equilibnum condition manifests 
Itself in the stock-adjustment process Again, there is an inadequate explanation of 
the assumption that money is a factor of production—which plays a crucial role 
m this analysis. Nor, correspondingly, is there an indication of the way the theory 
could be developed under the assumption that money is only a consumption 
good. 

Another point that should be noted is that strictly speaking Mimdell is not 
dealing with a growth model, but with the theory of the accumulation of wealth 
to a desired (and thenceforth stationary) level. A crucial question which then 
arises in the present context is the influence on this desired level of wealth—and of 
the desired level of its physical-capital component, m particular—of an inflationary 
process Unfortunately, Mundell does not deal explicit/ with this question In any 
event, there must be somethmg in the nature of his assumpbons here that leads 
him to the conclusion that a higher rate of inflabon necessarily leads to a lower 
long-run equilibrium level of real-money balances, whereas in other analyses of 
this problem the sign is ambiguous (cf Levhan and Patinkin 1968, sec. 11) This 
IS a quesbon on wluch I would like to have had Mundell’s view 

Another major theme of part 1 of Mundell’s book is the effect of inflation m 
stimulating the rate of growth. Mundell analyzes this problem under different 
assumptions and to my mind does not probe deeply enough into the reasons these 
differing assumpbons lead to different conclusions—though a common conclusion 
of all these cases is that the maximum contnbubon of inflanon to growth is reached 
at relatively low (by Latin American standards') rates of inflation, and that this 
contnbution is at best “small.” Thus, under certain assumpbons (chap. 6) 
Mundell concludes that this opbmum rate is roughly 10 percent, which increases 
the growth rate “only” by about 0.5 percent 

What I find missing in this discussion, however, is an indicabon of what con¬ 
stitutes a “large” increase in the rate of growth. After all, the analysis here 
assumes a fiill-employment economy with a constant technology whose sole source 
of more rapid growth is a more rapid accumulabon of capital Under such re- 
stactive assumptions it is hard to see how anything but an extremely drasbc policy 
could sigmficantly affect the growth rate 

As a case in pomt, consider Mundell’s analysis on pages 38 through 39. The 
model described here is a Harrod-Oomar growth model (with a capital-output 
ratio of 2), supplemented by the qumtify'^nation. Under certam assumpbons 
about the other relevant economic magnitudes (namely, the reserve rabo of banks 
and the elasticity o£ the demand for money with respect to the rate of change of 
pnees), Mundell shows that the growth rates generated by different rates of 
inflation are described by the first two columns of table 1 Now, let me first note 
that these cakulauons are made <mi the assumption that the banks’ reserve rabo 
IS 30 percent, which m Mundell’s model means that though every addibonal 
dollar generates inflabonary pressures, only 30 percent of them generate cajntal 
accumulabon and hence growth. Thus, if we assume as an alternabve (as Mundell 
does indeed assume in other contexts) that all of the expanded numey supply is 
used initially to finance investment (which it analytically equivalent to ass umin g a 



BOOK KBV1BW8 


TABLE 1 

DsnuitMAim or na Gkowth Rat* 


ai9 


Annuai. 
GaowTH Rate 
(Percemtaoe) 

(1) 

Ammuai. 

Inflation Rats (PEaesNTAOE) 

Sa VINOS Ratio or 
Rate op Income 
Tax 

(4) 

Roerve 

Ratio «> 30% 

(2) 

Reserve 

Rabo - 100% 

(S) 

000 

00 

00 

00 

0.25 

57 

1 3 

05 

0 50 . 

14 25 

28 

1 0 

0 75 

28 5 

44 

I 5 

1 00 

. 57 0 

6 25 

20 

1 25 

142 5 

83 

25 

1 50 

00 

10 7 

30 

500 


00 

10 0 


reserve ratio of 100 percent), then the relevant rates of inflation are given by 
column (3) of table I, which are moat moderate as compared with those of 
column (2) I offer these alternative calculations not because they are necessarily 
more realistic than those based on Mundell’s assumption, but only to illustrate 
the sensitivity of the calculations to changes in the assumptions. 

But what IS far more important for my present purposes is to note that if we 
delete the monetary part of MundcH’s model, we are left with a Harrod-Oomar 
model whose rate of growth (under Mundell’s assumption of a capstal-output ratio 
of 2) is half the savmgs ratio. In column (4) of the table are accordingly presented 
the growth rates associated with vsurous savmgs ratios Alternatively, if in the 
spirit of the Mundell model we assume that the private sector consumes all of its 
disposable mcome, and that investment m the economy is earned out entirely by 
government—but this tune financed by an income tax—then column (4) also 
represents the mcome-tax rates necessary to generate diflerent rates of growth. 

And then we can ask the quesuon. Winch represents a “more drasbe” pohey— 
one which roughly doubles the rate of mflauon in order to increase the rate of 
growth from 0.50 to 1.00 percent per year (as is shown by col. [3]), or one which— 
in order to accomplish the same objective—doubles the rate of income taxation 
(as shown by col. [4]) ? The criteria by which one would answer such a quesboo 
are not clear to me 

As indicated at (he beginmng of this review, I have concentrated on part 1 of 
Mundell's book. Many readers, however, will be primarily interested in part 2, 
in which Mundell—agam with great ingenuity—develops a graphical general* 
equihbnum analysis of the moneury aqiects of mtemabonal trade Here be 
emphaaiaes the view that balance>of-paymcnts at^ustmenU (parbcularty those of 
a small country) should be ansdyzed in terms of the movement of mtemabonal 
pnoet relative to domestic ones, and not primarily in terms of the movements of 
import prices relative to export prices. He also elegantly demonstrates—&r more 
ngr»oudy than has hitherto been done—the somewhat neglected truth that m a 
gtJd-s tand a r d world the money supply of any given country is endogenously 
determined. Gorreqxmdmgly, he draws the unpheabora of this faa for the money 
iiqiply of a country which carries out a devaluation in a world affixed exchange 
rates. 

Mok generally, Munddl bases his analyuof the intemabonal syMem 

on the supply of—and demand ibr—inteniational monetary reserves by die various 






990 


JOURNAL OF POLITICAL BOONOUV 

countries of the world. He then uses this analydcsl intmework to explain some of 
the dramatic changes that have taken place in this system smce 19M. 

These few words certainly do not do justice to the depth and ingenuity of 
Mundell's analysis m part 2. Let me then simply conclude by saymg that Mundell’s 
book must and will be carefully studied by every economist working in the fields of 
monetary and international-trade theory. Mundell's terse and elliptic style of 
wntmg does not make for easy reading, but the reader who overcomes these 
difficulties will find himself greatly rewarded by the additional insights into 
difficult probtems that he thereby gains. 


References 

Levhari, David, and Patinkin, Don “The Role of Money in a Simple Growth 
Model.’’ AER 58 (September 1968) 718-53 
Mundell, Robert A International Economtes, New York Macmillan, 1968. 

Don Patinkin 

The Hebrew Vtaverstty of Jerusalem 


Theory of General Economic Equilibrium By Trout Rader 

New York and London Academic Press, 1972 Pp xx+362 $14 50. 

This 18 a companion volume to the author’s Theory of Mwroeconotraes, using the 
distinctive notation and terminology of the earlier work as well as many of the 
results denved there Taken together, these two volumes represent a truly impres¬ 
sive attempt to exanune and extend the current state of knowledge in a number 
of major areas of modern analytical economics, with detailed consideration of the 
most important results in each area studied. The broad range of coverage of the 
volume under review is mdicated by a partial listmg of contents* intertemporal 
efficiency cntena, turnpike theorems, nucrogrowth models, existence and optf- 
msdity of the competitive equUibnum, bargaming games, stabihty and com¬ 
parative statics analysis of the competitive equihbrium, and apphcations to trade 
theory and macroeconomics In essentially all of the areas covered, knovm results 
are reformulated, extended, and generalized, and new proofs are constructed It 
should be emplusized that this book is not so much a textbook as it is a treatise 
summarizing a variety of Rader’s recent research, much of it published here for 
the fint time. If judged solely as a textbook, there are some serious flaws in this 
volume, but these are precisely the flaws one would expect in a creative work; in 
particular, the book tends to be unbalanced m the direcuon of work in wiuch 
Rader has been engaged, and even there one finds occasional slips and omissions. 

This is an exceedingly formidable bode from a mathematical pomt of view, 
especially m the first chapters dealing with infinite horizons. Because of the 
organization o[ the book, the mathematical requirements are not so stringent m 
the middle and latter stages, but many readers may simply give up when faced 
with the problems of getting through chapter 1 (exercise 5, p. 13: prove the 
Boiaano-Weierstrass theorem). The book is also not completely self-contained; 
unless the reader first studies Theory qf Miereecanomks quite thoroughly, reading 
this volume tends to be frustrating, since so many of the concepts with which the 
reader is assumed to,'be tumlur (directionally dense preferences, interindustry 



BOOK REVIEWS 


221 


producdon, no free production in the limit, etc.) are diicuiied m detail mainly m 
the earlier volume. A glossary of terms and a recapitulation of proofr from the 
earher volume used m this volume wmdd be a great hdp. The method of approach 
tends to be one al formulatmg a general and rather abstract framework, {Showing 
which the more conventional specialized models are exammed. While this has 
some obvious advantages from a formal pomt of view, it leads to the choice of a 
notation that tends to be somewhat cumbersome when considermg the simpler 
cases, and makes it more difficult to utilize the intuition one has gained from 
considering the conventional cases. 

It really requires an economist with Trout Rader’s range of interests to ad¬ 
equately judge the contribution m total of this volume; for the rest of us, we can 
at most attempt to evaluate specialized aspects of the book For example, 1 tried 
to look in some detail at chapter 5 (Macroatatics) and found several slips, all of 
them, however, easily corrected Theorem 11 (p 210) is incorrect because the 
sum of diagonal elements is not constrained to be negative. Problem (lii), page 245, 
cannot be solved unless, for example, one specifies that the coefficient matrix is 
Metzler-Mosak, while in problem (i), page 248, the matrix is quasinegative defimte 
(rather than indefinite), and the condition should hold wherever evaluated. The 
best-known result concermng “sometimes stable” matrices—page 205 (DA is 
stable for some diagonal D with positive diagonal elements), that is, that A is 
sometimes stable if A contains a nested sequence of principal minors, the ith of 
sign (— 1]', 1=1, , a—IS not mention«l On page 211, “market laws have 

been used with substantive conclusions only for the grots subttitutabihty case” 
neglects, for example, the case where the nonnumeraire goods obey the Monshima 
conditions. In contrast to these minor flaws, there is an elegant treatment of 
the derivation of the Hicksian laws and a particularly straightforward analysis of 
factor-pnee and consumption configurations in international trade also contamed 
in chapter 5. 

The most interesting aspect of this book to me is the work on pairwise optimahty 
and the general attempt to incorporate the more general framework of n-penon 
cooperative games into economic theory, following the work of Scarf, Debreu, and 
others The sections on stabilization of markets and price and pnee-asset adjust¬ 
ment mechanisms also represent important areas for further research. 

In summary, this book, if used as a text in lui advanced theory seminar, should 
be used in conjunction with Theory of Mteroeeonomus', it is acceaible only to those 
with an extensive background in mathematics, and it occasionally contains 
erroneous or ambiguous results. But I think it would be ideal for such a scmuuu:, 
and despite its faults, it represents an impressive accomplishment 


Calforma Institute qf Technology 


James Quirk 


Technology and American Economic Grotofh, By Nathan Rosenbero. 

New York. Harper & Row, 1972. Pp. 211. 

This bode is really a long essay, or perhaps a senes of shorter essays. As such, it 
does not present the results df new histoncal research. Rather, it integrates 
economic analytis with the results that have been obtained finsn eaistmg histoncal 
studies (some of them done by Rosenberg himself) on the role of technology in the 



992 


JOURNAL or POLmCAL BCKWOKY 


development of the Amencan economy. The remit of this integratioa u a compre¬ 
hensive survey both of the causes, especially the economic causes, of technological 
change and at the effects of technological ^ange on the American economy. 

Rosenberg has done an excellent job m handling the task he set hunself. He has 
achieved remarkable insight mto the nature of the processes he describes and he is 
able to communicate his ideas clearly and concisely. I was particularly impressed 
by the chapter (chap. 2) which analyses the influence of economic factors on the 
development and application of new technology and by the two chapters (chap. 
3 and 4) wrhich deaenbe American experience m the nmeteenth century. As is 
natural, given Rosenberg’s own research interests, the discussion of the role 
jriayed by the American machme tool industry is especially outstanding. The 
chapter on the twentieth century (chap 5) is also good, but I did not feel that 
it added as much to my understanding of events as did the earlier chapters. 

In reading this book I must confess to feelings of both admiration and frustra¬ 
tion. Certainly I wish I had written it Moreover I feel that I should have been 
able to write it, sdl the while realizing, however, that such an attempt on my part 
would have lacked much, if not most, of the considerable elegance and originality 
displayed by Rosenberg 

In my judgment, almost any student (or professor) of econonucs or history, not 
to mention economic history, will profit from reading this book. More important, 
the benefit will almost certainly vastly exceed the opportimity cost of the relatively 
small amount of time required to read it. Indeed, my only important criticism of 
the book is a direct result of my concern that it be widely read, especially by 
persons without extensive trammg m economics. Right at the start (on p 4), there 
are two diagrams utilizing isoquants and price Imes (without, however, explaining 
or even defimng these concepts). I think this is unfortunate, because the diagrams 
are not needed by economists and they are more likely to confuse than to help 
readers innocent of economics What is even worse, by giving a false impression of 
the level of comjictence in economic theory needed to benefit from the rest of the 
botdc, the diagrams may well frighten off potential readers who, in fact, have much 
to gam from it 

My other complamt about this excellent volume is directed at the publisher. 
There is no bibliography A book as neb in histoncal examples and references to 
the hterature as this one is needs a bibliography The convenience to the reader, 
or at least some readers, would have been well worth the extra cost mvolved. 

Lars G. Sandbero 

Ohio State Umoersity 


Plamtng far Economte Gfouith tn the Soviet Unum, 1918-1932. By Euoene Zaleski. 
Trandated from the French and edited by Mane-Chnstme MacAndrew and 
G. Warren Nutter. 

Chapel Hill: Umversity of North Carolina Press, 1971. Pp. xxxviu+425. $15 75. 

This u one of the most important Western studies of Soviet economic planning. 
The heart of Zaleski’t volume u a pioneer comparison of plans and revisions of 
plans with performance dunng the penod of the First Five-Year Flan (192&-32}. 
It onbraca five-year plans, annual plans, and—for selected periods—quarterly 
and even monthly plai^ and considen the progress and foilure of Soviet planning 
both cfaronologicaUy and analyticaUy. 



BOOK REVIEWS 


223 


ihowi that, in a certain sente, planning for economic growth took place 
succcMfiilly in the USSR in the period described. Even during the years of over* 
ambitioiis planning, 1930 and 1931, when targets were “dreams rather than 
plans," “all these false notions, more or lest shared by the authiHnties, nevertheless 
enabled the plan to be turned into a powerful instrument of growth " This was 
because contmuout pressure from a vast bureaucratic plaiuung machine replaced 
the market in moving the economy in the direction of the aipiraticms of the 
authorities. 

However, Zaleski convincingly shows that while the periodic butmess cycles 
characteristic of market economies were eliminated, they were replaced by new 
forms of economic fluctuation Whereas m market economies fluctuauons tend to 
be transmitted rather uniformly, but with a lag, in the Soviet economy the fluc¬ 
tuation spreads “accordmg to the strengths and weaknesses of the dams that the 
planning sector puts up in each sector ** In prerevolutionary Russia all parts of 
industry were affected by the busmess cycle, but in the single period 1930-34 
eight types of fluctuation can be distinguished m different industries, very rapid 
growth (e g, tractors), rapid growth with shock (machine tools), continuous 
growth (electriaty), interrupted growth (steel), stationary state (cotton fabrics), 
stationary state followed by shock (cement), violent shock (sugar), and decline 
(meat and fat) All this is illustrated with a quite extraordinary wealth of 
statistical data, meticulously collected. 

Some of Zaleski's conclusions may, however, need revision. He frequently com¬ 
pares growth rates in 1922/3-1927/8 with those in 1928/9-1932 and does not 
always take sufficient account of the unused capacity of the earlier period which 
was cruaal, even in 1927/8, to the higher growth rates then achieved He con¬ 
centrates hu discussion of successes and failures in plaiuung on performance in 
terms of growth rates, even though he rightly points out that the overriding 
criteria of success for the Soviet authorities were “the execution of large-scale 
projects, the achievement of social transformaticm and the build up slowly 
and sometimes empirically of a particular planmng system.” How far was the 
retardation in growth in 1930-33 due to deficiencies in planning, as Zaleski 
implies, and how far to the Isudt of spare capacity, coupled with the fact that these 
were years m which the new plants to be brought into operation during the 
Second Five-Year Plan were not yet completed’ Zaleski’s forthcoimng volume, 
dealing with Soviet planmng and growth after 1932, may help us to weigh these 
different factors. 

The present volume is a translation of the French edition published in 1962, 
with some mmor revisions (including a critique of Feldman’s path-breaking but 
highly unrealistic growth model), and with a useful index and bibhography 
added. A fuller paginated table of contents would have helped the reader to use 
this complicated volume. When the next vtdume appears in French, it would be 
temptmg to urge the very competent translator and editor to acquire temporarily 
some of the “shock” and “storming” characteristics possessed by Soviet {janning 
in the 193Qs, rather than keep us waiting 9 years for the English version, if it 
were not for the strong danger that the high qiudity of the present edition would 
not be maintained. 


Cenlrtfor Russian astd East Eunptan Studus 
Unutrstty Bvrmssigjlum 


R. W. Davibs 



ECONOMICS IS A MAHER 

ECONOMICS, Ninth Edition 

Pout A. Somueison, Massachusetts Institute of Technology. 1973, 
917 pages, $11.50. 

Changes in American life and economics and in attitudet towords the economic 
system provide the motif for the revised ninth edition. Profeuor Samuelson 
recognizes that to many of the younger generation. Gross National Product is 
really Grou Notional Pollution. Polluting, dehumanizing grovrth, merely for 
the sake of material growth, is something no one wonts. Therefore in the ninth 
edition, Professor Samuelson has not obondoned the national income approach 
or the concept of GNP, but he has shown how the Gross National Product must 
now be supplemented by a life-quality indicator that he calls Net Economic 
Welfare, N.E.W. This concept is adapted from research done by William Nord- 
hous and James Tobin of Yale, and their estimates are the basis for the first 
figure in the text and for the discussion of National Income Accounting in 
Chapter Ten. The new Chapter 43, Winds of Change; The Evolution of Eco¬ 
nomic Doctrine, offers perhaps for the first time in an elementary economics 
course a section devoted to the history of the subject and its relationship to the 
history of ideas. The problems of the quolity of life are covered in Chapter 40, 
and the cost-push inflation or the problems of the modern mixed economy are 
thoroughly discussed in Chapter 41. 


BASIC ECONOMIC PRINCIPLES, Third Edition 

Robert A. Lynn, Kansas State University. 1974, 416 pages (tent.), 
$9.95 (tent.). Instructor's Manual. 

As o result of a survey conducted among users of the previous edition, Profeuor 
Lynn's third edition of this text hos been tailored to meet the needs of its of 
readers. Briefer and simpler than before, the text covers o brood spectrum of 
macro- and microeconomics topics, using a public policy oriented approach. It 
now contains increosed coverage and updoting of such topics as communist 
economics, dollar devaluation, and direct government regulation of the 
economy. 


McGRAW-HILL BOOK COMPANY 



Private Savings: Ultrarationality, 
Aggregation, and ''Denison's Law" 


Paul A David 

Harvard Untversify amt SUa^ord Umatrtity 

John L Scadding 

Stanford Umoerstty 


The article examines the behavior of the rauo of gross pnvatc saving 
to gross national product—the gross private savings rate or GPSR— 
for the United States durmg the period I89&-1969 It emerges that the 
GPSR defined to include consumer durables expenditures and the 
imputed return on durables has no trend over the period and that its 
year-to-year variation is extremely small except for the World War I, 
World War II, and Great Depression years This stabihty prevailed in 
the face of pronounced shifts in the composition of private savings from 
personal saving to durables expenditure and corporate saving, and a 
redistribution of income from the pnvatc to public sectors. The notion of 
ultrurationality, in which households subsume corporate and government 
spending and saving in their budget decisions, is used to explain the 
stability of the GPSR despite the important changes in its composition. 
Leading implications of this stabihty for theory and policy are explored: 
one important conclusion is that, under full employment, fiscal ptdicy 
can neither affect aggregate expenditure nor alter the ratio of mvestment 
to consumption 


I 

The most profound and lasting impact of the Keynesian revolution on 
economic thought has been to leave macroeconomics with a strong 


The proximate ongins of this paper tie in collaborauve rcMarch undertaken by the 
authois in the summer of 1969 This effort is built upon work previoutly earned out m 
connecuon with the Stanford Project on Economic Growth in the United States. We 
gratefidly acknowledge the Social Science Reroarch Counal’s financial support of thn 
work, as well as the contmuing advice and encouragement we received from Moms 
Abramovitz Warren Sanderson and Fred Nold furnished able aautance in the prepara¬ 
tion and statistical analysis of the data. In preparing the preMnt veiaioa of this paper lor 
^blication we have had the benefit of conunenti received ftom Edward Deaiaoii, Rifriert 
J. Gordon, Joseph Pechnum, Eytan Sheahiniki, and an anonymous r efere e on an earlier 
mudi longer draft entatled '‘What You Always Wanted to Know about ‘Denison’s Law* 
But Were Afraid to Aik." ' 



826 


JOURNAL OF POLITICAL EOONOKY 

predisposition to emphasize demand over supply conditions. Nowhere 
u this more evident than in studies of the determinants of aggregate 
savings. Much of the evidence on savings behavior is not direct but, 
rather, implicit in the results of the many studies of the consumption 
demand This means that not only is savings treated as a residual but also 
Its study IS confined as a rule to the savings flows deriving from decisions 
about the disposition of personal income, a curious restriction given that 
the corporate and personal contributions to aggregate private savings 
in the United States have been roughly equal over the past 70 years. 

In the exposition of simple income determination theory, it is convenient 
to treat all savings as done by the household sector and to regard all 
investment as taking place in the firm or corporate sector, that is, to 
neglect consumer durables expenditure and corporate savings Alternat¬ 
ively, this simplicity can be achieved by treating expenditure on durables 
as part of investment demand and aggregating corporate and personal 
saving by defining the saving function in terms of private saving and (in 
the absence of government) total income rather than in terms of personal 
saving and personal disposable income alone 

The direction of research on the demand for consumer durables has 
already made quite respicctable the first tyjie of aggregation required by 
this alternative approach Formally, the theory of consumer durables 
expenditure is a part of modem investment theory even though in 
empirical applications it has been thought csscnUal to distinguish among 
the different categoncs of investment demand But the nouon that cor¬ 
porate and personal saving can profitably be treated within a unified 
conceptual framework, let alone usefully aggregated for empincal 
purposes, has made little headway among economists at large; here 
theory and practice are in accord, and disaggregation has been the order 
of the day 

This resistance seems curious on at least two counts First, it is possible 
to refer to a simple theoretical constmction which supports the aggrega¬ 
tion of the various savings flows ' The model in question views households 
as “ultrarational” in the sense of reg;arding the corporate sector as an 
extension of themselves, an instrument of their private interests, the 
distinction between corporate and household sectors therefore is a legal 
but not an economic one One intriguing implication of this view—which 
IS so at odds with the tradition of Bcrle and Means—is that corporate and 
personal saving are perfect subsUtutes Hence, a savings function defined 
in terms of total private savings should turn out to be more stable than 
separate savings functions for the personal and corporate components, 
in other words, aggregation is desirable. 

‘ Cf. Grunfiild and Gnhcha (1960) for development of the (parallel) aigument that 
interdependence of firms’ investment plans justifia aggregation across enterprises withm 
the oorpofate sector. 



PRIVATE SAVINGS 037 

The formal construction of a transsectoral savings model of this sort is 
due to Bailey (1962, 1972) Although in Bailey’s work the focus is on the 
charactenzation of "rationality” in the relations between the private and 
public sectors, clearly the corporate and jjersonal sectors could be sub¬ 
stituted without any change other than an appropriate relabeling of 
variables ^ As we shall see, the Bailey formulation is a better description 
of relations within the private sector than it is of connections between 
the private and public sectors 

The second ground for surprise at the slight attention paid the question 
of aggregating savings is that economists have had before them for some 
time intriguing empirical evidence that (gross) private savings is a very 
stable function of GNP, and that it has in fact been much more stable 
than either corporate or personal saving considered separately More 
than a decade has now passed since the publication of Denison’s 1958 
article calling attention to the remarkable stability of the ratio of gross 
private saving to gross national product m the United States during the 
postwar years 1948-56 Over that penod the average level of the gross 
private savings rate (GPSR), as this ratio is called, just about equaled 
the GPSR for 1929, the last date before 1948 that Denison thought was 
comparable in the sense of being a full-employment, peacetime year 

These two propositions—that the year-to-vcar vanabilitv m the GPSR 
was very small and that at least since 1929 there was no trend m the rate— 
were dubbed “Denison’s law” by Hickman (1966), who appears to have 
been alone in responding to Denison’s (1958, p 267) invitation to future 
students of the Amcncan economy to inquire whether this empincal 
regularity would persist beyond the period he had been able lo observe 

The purpose of this paper is to promote wider recognition of the reign 
of Denison’s law by presenting both a theoretical framework for its 
interpretation and the results of more extensive empincal tests of its 
validity Denison’s findings for the United States and our own arc com- 


^ Much has been made in the literature of including imputed income on consumer 
durables in estimates of consumption and dispiosable income. But little attenuon has 
been paid to the question of whether corporate income and saving should be included in 
household budget constraints There are a few exceptions Fnedman (1957, pp I1&-2S) 
explicitly raises this question twice, and Fnend and Schor (1959), m their mveitigation 
of the distribution of saving by income class, impute corporate saving to the household 
sector But they are quick to point out that the procedure producei a departure “from 
any iwrma/ concept of mdividual’s savingk" (p 237, italics added) Another proposed 
modiiicauon of disposabk income is the addition of capiial gams. One aigument for 
doing so IS that capital gams reflect transfers from the corporate to the personal sector 
and hence “should be included at least in personal income and saving" (Bhatia 1971, 
p 364). A second reason is to capture cluuiges m the stock of wealth when wealth u one 
of the arguments in the consumption function (Arak and Spiro 1971). Of course, it is 
possible to regard wealth, or at least a part of it, u a proxy for corporate inooine and 
saving, but typically it u included because it sununaiizes Jmlm disposable income. In 
other words, the aggregation implied m using wealth is over tune rather than across 
sectors. 



328 


ers, I, T 


JOURNAL OF POLITICAL ECONOMY 



patiblc with a picture like that in fig[ure 1 The ray OS through the origin 
depicts the observed stability in the (average) GPSR The curve TT 
described the dependence of tax revenue net of transfers, T, as a positive 
function of output The sum of the two curves, shown by the heavy line 
S + T, gives the amount of “leakages” from the system as a function of 
GNP To close the system and determine the equilibrium level of income, 
we add the schedule of “injections” shown by the curves EE The quantity 
E IS the sum of gross private domestic investment, net foreign investment, 
and government purchases of goods and services 
The intersection oi S + T and EE determines the level of GNP 
Vanations in EE, illustrated in figure 1, change the level of output, and 
the resulting levels of GPS and GNP trace out (xunts on OS The level 
of consumption expenditure can be shown by adding the usual 45-degree 
line (not shown) through the ongin the verUcal distance between it and 
S + T gives the amount of consumption spending Note that with a 
stable GPSR, a change in the average tax rate is completely offset, for a 
given level of income, by a change in the average propensity to consume 
out of GNP in the opposite direction This means that unless the tax 
schedule is stable, consumption is not a stable funcUon of GNP. Nor is it 
a stable function of disposable income The marginal propensity to 
consume is not independent of whether the source of the change in 
disposable income is a change in output or a change in the tax rate. But 
the loss is inconsequential, since, as figure 1 illustrates, a stable con¬ 
sumption function IS not necessary for the simple Keynesian analysis of 
income determination The simplicity of income determination when the 
GPSR is suble can be more easily seen if E and T are combined as in 
figure 2a. Figure 2a shows clearly that the tax and expenditure multipliers 
are the same and that the common multiplier is just the inverse of the 





private savings 


aag 



Fio 2 

GPSR Figure 2a is not a perfect translation of figure 1 In the former we 
have drawn the £ — T curve with a positive slope, which means that the 
corresponding EE curve must have a slope greater than that of the tax 
schedule, TT Hence, in figure 2a the marginal propensity to spend 
exclusive of consumption spending is greater than the marginal tax rate, 
whereas in figure 1 this is not the case and its representation (in a diagram 
like figure 2a) would call for a downward-sloping E — T curve with 
respect to GNP 

We draw the £ — T curve with a posiuve slope in figure 2b only to 
make the case for a stable GPS function no stronger than necessary 
Thus, if £ - Tu positively related to GNP, an alternative explanation for 
the observed stability of the GPSR is that E — T is the more stable 
function of output and that the average propensity to spend minus the 
average tax rate is constant Figure 26 illustrates this alternative, rep¬ 
resenting £ — r by the ray through the ongin, the constant GPSR is 
traced out by an unstable GPS function, denoted by SS, which wanders 
up and down the E — T ray Note that figure 26 is not simply a relabeling 
of the schedules in figure 2a, because stability considerations require the 
slope of the SS function to exceed that of the E — T ray 

We regard this second explanation for the observed stability of the 
GPSR as the weaker of the two for the following reasons First, it begs 
the question of why the amalgam of different types of expenditure and 
taxes should have a unitary elastiaty with respect to GNP Second, it 
entirely ignores the autonomous components of £, that is, those unrelated 
to GNP, whereas it is more conventional to treat much (if not all) of £ 
as autonomous—in which case, £ can hardly be a stahle function of GNP 
Finally, the marginal propensity to spend out of GNP must be fairly 
large if the £ — T curve is to have a positive slope. For exsuxiple, at 
current (third qusuter, 1971) rates, the aggregate average tax rate is 
about .31 while the average propensity to consume out of GNP is .47. 
Assuming that average suid marginal propensities are the same, the total 
propensity to spend Gut of GNP would have to approach an implausible 
.80 if the response of £ — 7” to GNP is to be positive. 





830 


JOURNAL OF POLITICAL ECONOMY 


The next two sections of this paper demonstrate the validity of Denison's 
law for a much longer time span than in the earher papers by Denison 
and Hickman On this point we are the beneficiaries of two developments 
which were not accessible to Denison (1) the generation of extra data 
points just by the passage of time since Denison wrote and (2) the fruits 
of historical research in estimating national accounts aggregates for the 
years before 1929. We use this wealth of additional data to show that 
with some slight modification Denison’s propositions regarding the secular 
constancy in the GPSR, the remarkable degree of year-to-year stability 
in that rate and the far greater instability m the component personal and 
corporate saving rates are all borne out by the accumulated evidence 
of over 70 years 

Section III advances some explanations for our findings by making use 
of the notion of ultrarationality mentioned earlier, and some further 
implications of this style of explanation are considered briefly m the 
concluding section, IV We should make it clear at this point that no 
pretense is made to provide a complete explanation, and our discussion 
based on the idea of ultrarationality is meant pnmanly to be provocative 
Our aim m this paper must be to confront economists with the/act of a 
stable GPSR, because the implications of that fact for both policy and 
accepted theory arc too important to go on being ignored Denison 
ongmally challenged the profession to see whether his findings would 
stand up to additional evidence We maintain that in essence they do, 
and we leave in place of that challenge another one to provide a satis¬ 
factory explanation for the why of Denison’s law 

n 

The regression equation for GPS presented by Denison for the period 
1929-40/194S-56 is repiortcd in table 1 The AGNP* vanablc is the 
difference between the last “high employment” year GNP and the 
current GNP A “high-cmploymcnt” year was designated by Denison 
as one in which the unemployment rate remained below 6 percent 
Plainly, the intention was to differentiate the mass unemployment ex¬ 
perience of the thirties from the other peacetime years since 1929 As it 
turns out, Denison might equally well have taken the previous peak value 
of GNP, inclusive of the level attained in the current year In empirical 
expenments we have determined that in Denison’s period it makes 
essentially no difference which definition is chosen 

That being the case, we are free to interpret AGNP* as the difference 
between the previous peak and current levels of aggregate income, and 
on such a reading the formulation of Denison’s law in table 1 bears an 
obvious correspondence to the models of consumption behavior made 
familiar by Duesenberry (1949), Modigliani and Brumberg (1954), 



TABLE 1 

Relation for Gross Private Saving Estimateo for 1898 - 1916 / 1921 - 40 / 1948-64 and Sdbperiods 



s 

d 

II 

esi 

jC 


m 

n 




s I 


o 


S m o 


» CM ~ 




£2 2a2$£2?ieS 





m S 

M " 

Z " 

- H 

" g? 

o »o 

S e£ 

a s 

•«. u. 


£ 2 
O A 5 


S 

o 


$ 


r* 0> 


SIM 


o> CO gg ir> *2 
mc^Qo^ioc^ 


s :: 


<o 

s 



as* 






JOUKNAL OP POUnOAL BOONOUY 


233 

and Ando and Modigliani (1963) Alternatively, the previous peak GNP 
and AGNP* may be read as the respective GNP counterparts of Fried¬ 
man’s (1937) concepts of a permanent component and a transitory 
component of personal income. Note that on the latter interpretation the 
estimates in the first equation of table 1 imply that the marginal pro¬ 
pensity to save out of aggregate transitory income is greater than the 
corresponding propensity with respect to aggregate permanent income, 
a result quite in keeping with Friedman’s findings regarding the direct 
savings behavior of the household sector One further virtue of making 
this switch to a “peak income’’ definition is that, in investigating the 
applicability of Denison’s law to U S experience prior to 1929, it obviates 
the need to depiend on the comparatively less reliable estimates of the 
rate of unemployment 

The effect of the Commerce Department’s 1965 revision of the national 
accounts was to raise the mean level of the GPSR for 1948-56 rather 
more markedly than the mean for the ensuing subpenod (1957-64) 
examined by Hickman; the revision for the first is from 1480 to 1543, 
while the revision for the second is only from 1527 to 1557. The result 
is that the stability of the GPSR within the 1948-64 mterval appears still 
more striking in the revised data A reestimation of our version of Denison’s 
equation for the period 1929-40/1948-56 using the revised Commerce 
Department figures is reported in table 1 The results there show that the 
only effect of the revisions of the national accounts has been to shift the 
coefficient attached to GNP, which is the estimate of the GPSR, upward 
from 1463 to .1544 Denison’s specificauon implied a constant average 
GPSR for AGNP* = 0 This was tested by estimating a constant term, 
which turned out to be statistically insigmficant at the 95 percent con¬ 
fidence level 

The fit of the Denison relation remains spectacularly good, and, 
although the high should be taken with a grain of salt in view of the 
serial correlation present in GPS and GNP, the closeness of the fit is 
equally manifested by the low (043) ratio of the standard error of estimate 
to the mean of the dependent variable 

The latter statistic is also indicative of the extremely small year-to- 
year vanabdity of the GPSR around its estimated "full employment” 
value Note that the rauo for the regression for 1948-64 alone is even 
smaller, .034 Proportional “errors” of this magnitude appear trivial 
indeed when ranged alongside the comparative size of the contempor¬ 
aneous statistical discrepancy in the national mcome accounts. The 
standard deviation of the latter as a fraction of the mean of gross pnvate 
savings was .037 for^|||e years 1929/1948-56 (addendum in table 1). 

The temporal sci^ of the empirical regularity isolated by Denison 
may now quickly be shown to be rather wider than the 1929-40/1948-56 
results reported in the first two regressions of table 1. For, as the addendum 



233 


private savings 

of table 1 discloses, that regression relationship performs quite respectably 
if asked to predict the volume of “full employment” GPS for the period 
after 1956 and for the years immediately antecedent to 1929 

The ratios of the standard errors of the forecasts to the average levels 
of GPS for the forecast periods are shown in the addendum to table 1 
The ratio for the forecast period 1957-64 studied by Hickman is about the 
same as the ratio of the standard error of the regression to mean GPS 
for 1948-56 • the forecast ratio is 032, the estimation ratio is 034 The 
ratio of the standard deviation of the statistical discrepancy to mean 
GPS, however, is much smaller than the forecast ratio in this prediction 
period Because it is also smaller than its counterpart in the 1929/1948-56 
period, the fall relative to the forecast ratio must be attributed primarily 
to the Commerce Department’s increased accuracy rather than to any 
notable deterioration in the predictive performance of the Denison 
equation 

The results for the extended forecast picnod 1957-69 suggest that there 
may have been some deterioration in the year-to-year stability of the 
GPSR from 1965 to 1969 For the extended period the forecast ratio 
is 054—over one percentage point greater than the estimation ratio for 
the 1929/1948-56 period This detenoration is the result of the historically 
high savings rates for 1966 and 1967, one has to go back to 1925 to find a 
GPSR which IS higher Whether these sorts of extremes will recur remains 
an open question Certainly the longer-run stabihty of the GPSR seems 
unaffected by them, the average GPSR over the 1961-69 cycle is the 
same as the average for the 1954-57 cycle 

The forecast ratios are reported for the pieriod including the 1920s 
as well Only the “high employment” years for the twenties, that is, 
1925, 1926, and 1928, are used Not surpnsingly, adding only 3 years 
does not change things very much; the forecast ratio is raised from 054 
to 058 For those years separately, however, the forecast ratio is 119, 
by comparison the ratio for the statistical discrepancy is 073 ^ 

While there is evidence of greater year-to-year variability in the GPSR 
before the Second World War, there is no evidence of any longer-run 
instability In table 1 are reported the results of estimating the Denison 
relation for the entire penod 1921-40/1948-64 and for the pre- and 
post-World War II subpenods A Chow test for the cquahty of coefficients 
between subpenods was performed, the test statistic was considerably 
below the 95 percent confidence limit, leading to the acceptance of the 
null hypothesis of no difference in the coefficients across subpenods If 

* For the yean pnor to 1929 we have estimated GPS data firom both the income tide 
^ th e expenditure side using the work of Goldsmith (1953) and Kendnck (1961) The 
diBeren ce between the two sets was used to calculate the ttatubcal ditaepancy. A 
regrewon of one senes on the other excluding the yean 1917-20 yielded an nmgnifiimnt 
intercept term and a slope coeflicient of .98. 



236 


JOURNAL or POLmOAL BOONOMY 


saving m favor of consumer durables, a conclusion in accord with the 
view that the 1920s marked a watershed m the widening of the range of 
consumer durables offered and their acceptance and consequent adoption 
by the pubhc 

When restated in terms of plausible, more comprehensive savings and 
income variables, the “Denison relationship” can therefore be said to 
have persisted for an impressively long span of time. Indeed, there are 
indications that the long-run level of the GPSRA has remained unchanged 
during the past 100 years * 

ra 

It is important to understand that the regularity in the GPSR which we 
have been at some pains to document in the last section is not the result 
of constant sectoral savings propensities, of a stable distribution of output 
between the public and private sectors, and, vrtthin the latter, of a stable 
division of income between the corporate and household sectors If the 
historical experience of the U.S. economy could be portrayed as a move¬ 
ment along a balanced growth path of that sort, our findings of a stable 
GPSR would be neither surprising nor very interesting 

The facts, however, suggest a different picture The extreme stability 
of the GPSR (and the GPSRA) masks two rather important trends 
First, there has been a shift in the composition of private saving away 
from personal saving to corporate saving and expenditure on consumer 
durables Second, the total of private saving has been almost totally 
insensitive to the share of output absorbed by the public sector, gpven the 
level of output The first trend, in the context of a stable private saving 
rate, implies that wealth holders, for whatever reasons, behaved as if the 
components of saving were perfect substitutes The second also implies 
a high degree of substitutability, in this case between private consumption 
and taxes and between pnvate investment and government dissaving 
Each of these results has implications which arc far-reaching, intnguing, 
and important in terms of their application to public policy and economic 
theory 

Cycle averages of the underlying components for the corporate and 
personal savings rates’ contributions to the GPSR are listed in tables 3 
and 4, respccuvely The trends in these components, summarized by 
pre-World War I, interwar, and post-World War II averages, reveal 

* Although reliable annual eiumates of output and saving are not available for the 
period before 189B, it is possible to construct decade averages of the GPSR and GPSRA 
for the penod 186^1908 using the work of Callman and Davis (1968) and David and 
Abramovitz (1969). The esumatei of the GPSR range from .173 to .191. For the GPSRA 
the range is from 216 to 233. The estimates for GPSR and GPSRA in the last decade, 

ISSS-lflOS, are, rojiectively, 187 and .233. These conqiare with the estimated ooefiBcicnts 

on GNP for the 1106-1916 period of .1635 and .2326, respectively. 



PRIVATE SAVINCU 


237 


that there has been a redistribution of pnvate saving away from personal 
saving narrowly defined and m favor of corporate saving and consumer 
durables expenditures For example, the personal saving rate out of 
disposable personal income (DPI) fell by about 25 percent from the pre- 
World War I period to the period after World War I, while the corporate 
saving rate rose by over 20 percent. The fall in the personal rate is nearly 
two percentage points, over the same period the ratio of consumer durables 
expenditures to DPI increased by five percentage points 

The comparison of the personal saving rate and the rate of durables 
expenditures, however, u sensitive to the choice of base period Thus, 
using the 1898-1916 period as a base suggests a shift in favor of durables, 
the fall in fiersonal saving to DPI from this pienod to the post-World War 
II period IS from 0804 to 0627, while CED/DPl (not shown) rises from 
0657 to 0970 But the changes in the rates from the 1921-28 period 
to the present are both pKssitive a 1 37 piercentage point change for the 
conventional savings rate and 2 19 points for the durables expenditures 
rate Obviously, the important shift in the composition of personal saving 
(including consumer durables) comes at the end of World War I * 

One explanation for these large shifts is the response of savers to the 
differential tax treatment of income from different forms of investment 
Thus, the secular nse in the corporate retention rate together with the 
decline in the narrowly defined personal saving rate suggest that the 
personal and corporate income tax schedules, along with the treatment 
of capital gams, the tax shelter features of depreaation provisions in the 
corporation income tax laws, and the like, have combined to effectively 
subsidize corporate saving at the relative expense of personal saving ^ 

While the high rate of substitution of corporate for personal saving 
shown in tables 3 and 4 may prompt some suspicion that the foregoing 

* Thus, we ally ounelva with Oihima (1961) and Jixtcr and Lipaey (l%7) m finding 
a rue in share of durables spending in income aAcr World War I On the other hand, we 
find no evidence of a rtaing trend in the importance of durables thereafter, and in this 
respect we agree with Vatter and Thompson (1966) and Vatter (1967) In support of 
this last proposiUon, note that the augmented personal saving rate (PSR) u about half a 
percentage point higher in the post-World War II penod than in the twenties and about 
a third of a percentage point higher than m the years before 1917 Nettber of theK 
differences u significant Moreover, the PSR increased by a small amount betwee n the 
twentia and the 1948-64 period, implying that the nse in the ratio of CEO to GNP was 
very marginal. Indeed, across the thrw cycles following 1948, the decline of the augmented 
PSR, vdien the convenuonad rate fell by only half as much, supports Vatter and Thomp¬ 
son's point that the high qiending on durables after World War II reflected baddogged 
replaceaoent detniuid 

' On the other hand, the contrast between (I) the divergent behavior of the oonsumer 
durables eaqieiidituies rate and PSR out of DPI prior to 1929, when tax rates did not 
change very much, ruid (2) the parallel movement of them rates after 1929 as both 
corporate and pertanrd tax rates rose, suggests that the shift in Csvor of durables during 
the 19204 wu primardy a response to a change m the supply price of durables rather than 
to the effiect of the heavier taxation of other income upon the relative after-tax rate of 
return on this class of inveiipiients. 



TABLE S 

Components or Cross Corporate Saving Rate Out of GNP: Cyoix Averaou, 
1898-1916/1921-29/1949-69 


Cycle 

Gross Corporate 
Saving to GNP 

Grou Corporate 
Gross Corporate Profit after Tax 
Saving to to Grots 

Gross Corporate Corporate Profit 
Profit after Tax before Tax 

Grou Corporate 
Profit before 
Tax to GNP 

1898-99* 

1182 

7863 

1 0000 

.1503 

1900-3 

1192 

6932 

1.0000 

1725 

1904-7 

1041 

6169 

1 0000 

1688 

1908-10 

1023 

6101 

0 9967 

1681 

1911-13 

0964 

5943 

0 9940 

1632 

1914-16t 

1221 

6748 

09888 

1822 

1921-23 

1023 

7107 

0 9231 

.1564 

1924-26 

1064 

7407 

0 9138 

1573 

1927-29 

1047 

6815 

0 9183 

1673 

1949-53 

1041 

7947 

0 6933 

1875 

1954-57 

1123 

8101 

0 7325 

1893 

1958-60 

1136 

8126 

0 7536 

1857 

1961-69 

1147 

8118 

0 7608 

1858 

Period averages 

1896-1916 

1104 

6626 

0 9966 

1675 

1921-29 

1045 

7110 

0 9184 

1603 

1949-69 

1112 

8073 

0 7351 

1870 


• Complete cycle 1897-99 
t Complete cycle 1914-18 


TABLE 4 

Components of Personal Savino Rats Out op GNP Cycle Averaobs, 
1898-1916/1921-29/1949-69 


Cycle 

Personal 
Saving 
to GNP 

Personal 
Saving to 
Disposable 
Personal 
Income 

Disposable 
Penonal 
Income to 
Personal 
Income 

Personal 
Income 
to GNP 

PenoniU Saving 
Plus Coniumer 
Expenditure on 
Durables to GNP 
Plus Gross Rental 
Flow on Durables 

1898-99 

0716 

0877 

9667 

8249 

1211 

1900-1903 

0771 

0941 

9863 

8288 

1283 

1904-7 

0783 

0926 

9894 

8414 

1330 

1908-10 

0685 

0816 

9885 

8462 

1212 

1911-13 

0528 

0642 

9886 

8319 

1113 

1914-16 

0522 

.0624 

9871 

8429 

1134 

1921-23 

0368 

0449 

9716 

8399 

1096 

1924-26 

0461 

0571 

9743 

8293 

1258 

1927-29 

0364 

0451 

9709 

.8286 

1169 

1949-53 

0476 

0676 

8915 

.7928 

1337 

1954-57 

0452 

.0647 

.8830 

7917 

1289 

1956-60 

0410 

0583 

8784 

.7988 

.1197 

1961-69 

0414 

0601 

8689 

7925 

1234 

Period averages 

1898-1916 

0668 

0804 

.9877 

.8360 

1214 

1921-29 

0398 

0490 

.9722 

.8326 

.1174 

1949-69 

0438 

.0627 

.8804 

7940 

.1264 


338 







PRIVATE SAVINOB 239 

cannot have been the sole influences at work, this u not nearly so strong 
an objection as it appears to be at first glance. When one oonsiders the 
high degree of concentration of ownership in corporate debt and equity in 
the hands of those occupying the upper income brackets, the differences 
which tax laws tended to create m marginal after-tax rates of return could 
quite plausibly have provided the impetus for (he marked portfolio shift 
observed.* 

Whatever the reason for the shift in composition—and we return to 
the question of the role of taxes later—the intriguing point remains that 
the shift left the GPSR relatively undisturbed In other words, wealth 
holders behaved as if corporate and personal saving were near-perfect 
substitutes in the portfolio 

Bailey (1972) argues that such behavior is consistent with households 
exhibiting a degree of ultrarationality m their deabngs with the corporate 
sector which he calls “perfect foresight ” Thus, households with perfect 
foresight perceive that a switch in corporate financing from retained 
earnings to debt issue, for example, does not affect their permanent 
incomes Their consumption therefore does not change, all that occurs is a 
switch from corporate to personal saving, with no change in the total 

Certainly the behavior of corporate and picrsonal saving is consistent 
with such a paradigm The question remains of whether it can be used 
to explain the two other examples of perfect subsutution suggested by the 
data and mentioned earlier, namely, between consumption and taxes and 
between investment and the government deficit 

These types of substitution are tmphed by the remarkable year-to-year 
stability in the GPSR despite swmgs in the amount of output absorbed by 
the government in taxes and debt issue, and the lack of a trend in the 
GPSR in the face of an inexorable nsc in the size of the government sector. 
The stability of the GPSR despite changes m disposable income imphes 
that consumption and taxes in effect are traded off one for one Its 
invariance to changes m the size of the government deficit suggests that 
pnvate debt and public debt are close subsUtutes in pnvate portfohos. 

The application of Bailey’s specification of ultrarationality to relations 
between the public and the government implies the same sort of 

* Bnttun’i (1966) itudy of corporate dividend policy, for example, found that one of 
the major detemunanu of the desired payout rate (whidi the penod averages in table S 
presumably come close to measunng) was the tax shelter o6cml to the hq^iest deole of 
personal income tax payers by the combinauon of lower corporatum incosne tax and 
capital gains tax rates Thus, a rise m the penonal income tax rate for that clam of tox- 
payers would lower the desiied payout ratio, 1 e , increase corporate saving. The other 
m^or detemunant of corporate savuig that Bnttam ideatiM was the deprcaatsoB 
shelter provision of the oorporauon tax laws, thus, libcraliaed depreaatNn pr o v im aaa 
tended to raise the retention rate out of net profits after tax. But this second inftuanee 
may be allowed for in redefining the base—as Brittam and we have d one in senna of 
cash flow, i.e., gram profits after tax. It u alio pomiblc that the de p rroa t ton shdler 
provuMDS conuibuted (B the me m die ratio of groas oorporate profits before tax to 
GNP which it obaerved after World War II 



240 


JOUltNAL OF POUTtOAL BOONCHIV 


TABLE 5 

Stabiuty of Grow Prwat* and Grow National Savwoi RAnt, 189B-1971 


CRon Privatb Savino Rats Grom Nationai. Savimo Rats 


Mesn of Mean of 
Abnlute Abaolute 
Deviauoiu Devutioiu 
of Annual aa 


Mean of Mean of 
Abaohtte Abaolute 
Deviationa Deviationa 
of Annual at 


Rateafrom Percentage Rateifirom Percentage 


Period 

Mean 

Mean 

of Mean 

Mean 

Mean 

of Mean 

1898-1928 

16 77 

2 79 

16 64 

16.92 

282 

16.67 

1929/1946-^4 

15 46 

041 

2 65 

15.51 

1 29 

8 31 

1946-64 

15 50 

040 

2 57 

15 49 

1.33 

8.59 

1948-71 

15 62 

046 

3 07 

15.46 

1 17 

7 57 


Modighani-Miller theorem applied to government finance that we saw 
for private finance A switch from taxes to Treasury debt issue should 
leave consumption unaffected, while private saving vanes directly, dollar 
for dollar, with the government deficit On this construction of ultra- 
rationality, pnvate saving and public saving are perfect substitutes and 
national saving is the stable aggregate given the level of output Yet that 
IS at variance with the facts, which suggest that public saving and pnvate 
investment are complements and that gross private saving is the more 
stable of the two aggregates in us relation to gross income 

We have adduced evidence that the GPSR is highly stable, both from 
year to year and over the long run Some evidence on whether it has been 
more stable than the national saving rate is shown in table 5 Table 5 
contains penod means for both the national and the pnvate savings 
rates as well as the mean absolute deviations of annual rates from their 
period means For the period before 1929 the two rates arc equally 
stable or unstable from year to year as measured by the ratio of their mean 
absolute deviations to their means But in the penod after World War II, 
the national rate is clearly the more unstable, its vanability is from 
two and one-half to four times that of the GPSR Over the long run the 
national rate appears marginally less steady, having been slightly larger 
than the GPSR before 1929 and somewhat smaller since then. 

The notion of complete, ultrarational household portfolio management 
need not be abandoned, however, if a modification in Bailey’s model is 
accepted Suppose, to return to the analogy with pure pnvate sector 
ultrarationahty, that there is a shift in the type of firm expenditure 
coincident with the change in the method of financing To be concrete, 
imagine that dividends are increased by reducing expenditure on non¬ 
monetary forms of executive compensation, such as free vacations, expense 
accounts, and the like, which are treated as consumption items by their 
recipients, while the new debt issue is used to finance capital expenditure. 
Hence, there is a shift in the time profile of income coincident with a 
change in the manner of financing If households prefer the initial dis- 





PKIVATB SAVINOS 


341 


tribution of consumption, they will use the additional dividends to 
replace the consiunption ibrmerly provided by firms rather than spend 
them on the new debt issue. At the same tune, because greater provision 
for future income is now being made in the corporate sector, households 
will reduce their own investment outlays. In other words, households 
absorb the new debt by reducing expenditure on internally financed 
investment; corporate investment and household investment are perfect 
substitutes in this example 

Similarly, at the national level, if the private sector regards tax-financed 
government expenditures as belongmg to the category of consumpbon, 
and bond-financed pubhc outlays as investment, the marginal propensity 
to consume out of tax reductions is unity, and pnvate savmg rather than 
national saving is the stable aggregate. Bailey’s dismissal of such behavior, 
which IS exactly what we observe, as mconsistent with total rationahty, 
stems from his imphcit assumption that the uses to which public funds arc 
put yield services that arc perfect subsbtutes in pnvate uulity funcUons 
Given that assumpbon and perfect foresight, nabonal saving logically is 
the stable aggregate But if consumpbon and investment (savmg) arc not 
perfect substitutes in household plans, and if the purposes of pubhc 
expienditures are believed to be inbmately bed to their mode of financing, 
then a stable pnvate savmg funebon is perfectly consistent with 
rationality. * 

In recognibon of imperfect knowledge, Baiicy offers pragmatic entena 
for the opbmal financing mix for government expenditure The argument 
is roughly as follows Where the benefits and incidence of government 
expenditure are locahzcd and certain, the cost of finanang them should 
be distnbuted m the same way, which leads to a presumpbon favonng the 
tax financing of such types of expenditures By contrast, where the benefits 
are highly diffused or their incidence is uncertain, a bias is created m 
favor of debt finanang, inasmuch as the costs of debt service similarly will 
be widely diffused and relatively uncertain If we interpret Bailey’s 
dichotomy between certain and uncertain as a disbncbon between present 
and furure consumpbon benefits, households conforming to our standard 
of ultrarabonahty will anbdpate (or see to it) that thar government in 
effect follows his recommendabons: finance pubhc investment with bonds, 
and pay for present benefits (consumption) with taxes. 


of 


* Bailey (1962, p 73), in an earlier ducusuon of ultrarationality, wnlei one vemon 
the consumpbon function as 


c + c, 

P “P P' 


(I) 


All Items are in nominal terms, with P beiag the deflator, C the pnvate consumption 
expendituK, G, the consumption element tn government expenditure, and K the CNF 
If we identify G, with T, to^ taxes, the GPS function by (1) is 

SfP m y/P - (C + T)IP m YfP - {C + C,)/P - -a/P + (1 - *)y/P 
Hence thh variant of ti]|sarationality implies that GPS is a stable funciMii of CNP and 
that at least the asarginal GPSR is bigUy stalrfe 



JOUStNAL OF POLmGAt SOONOUV 

949 

This last pr«cnption allows us to confront a problem which w have 
avoided up to now our version of ultrarationality is at odds with the 
explanation of the shift in the composiUon of private saving as an attempt 
to minimiEe the burden of taxes If households are ultrarational in the 
way we have described, and arc indifferent as to whether consumption 
and investment are provided through the public or private sectors, why 
should the amount of taxes (and, for that matter, debt issue) be of any 
concern’ Indeed, rather than indulge in tax avoidance, an ultrarational 
sector should act to equalise gross of tax rates of return 
Tax avoidance can be reconciled with ultrarationality, however, if one 
assumes that the government does not follow the optimal finance mix 
policy and, instead, finances part of its investment program through taxes 
In such circumstances, tax avoidance can be interpreted as the private 
sector’s attempt to shift the mix of financing toward the optimum directly, 
without engaging the cumbersome political machinery of the Congress 
Of course, if we are allowed thus to suppose that the political process is 
not wholly responsive, there is no assurance that legislative allocations will 
reflect pnvate demands for public goods, the systematic avoidance of 
taxes could then be explained simply as the (private) attempt to rectify 
what is perceived as the overproduction of public consumption services 
Yet this IS an answer too much m the spint of Alexander’s approach to 
knots We set out to see whether the phenomena of tax avoidance could 
be accounted for in a model which envisaged the corporate and public 
sector savings plans being integrated with and, at least for the long run, 
being under the control of an ultrarational household sector To depict 
the household sector as locked in conflict with an unresponsive govern* 
ment is to deny that the issue is worth bothenng wnth, rather than to 
resolve it 

Perhaps a more satisfactory, and certainly a more sophisticated, inter¬ 
pretation IS one that recogmzes the practical problems of matching the 
distribution of benefits of government expenditures with the inadence of 
the costs, and the consequent incidental redistributive effects of govern¬ 
ment fiscal operations Bailey’s own discussion acknowledges that there 
IS likely to be some conflict within the private sector between those who 
want publicly provided goods and those who must pay for them. And, 
as we have previously remarked, the appearance of (rational) integrated 
personal and corporate savings decisions designed to minimize the tax 
burden might well arise from the behavior of the wealthy few who 
effectively “own” the corporate sector. From the vantage point of these 
households, the amount of public goods provided may indeed be too 
laige. The fact that the private sector is not composed of identical house¬ 
holds is surely an important element of the story, but tax avtudance may 
be more than a symptom of nonoptimal, unintentionally redistributive 
modes of financing which concentrates the cost incidence of programs 
having widely diffused benefits It could also be a manifestation of the 



PRIVATB 8AVIMOS 


243 


fanuliar “free nder,” or “revelation of demand for public goods," 
problem, everyone may wish to see some publicly provided goods sub¬ 
stituted for private consumer spending—and yet be reluctant to disclose 
their willingness to bear the necessary share of the incremental taxes. 

IV 

The particular conception of ultrarational household behavior that has 
been advanced here to account for the apparent absence of any significant 
long-run effects of tax changes on GPS, given the level of GNP, turns out 
to carry several rather strong fiscal policy implications These deserve to 
be spielled out in greater detail and contrasted with the weaker, but 
nonetheless significant, implications that flow more immediately from the 
existence of the Denisonian empirical r^ulanties which we have sought to 
document and rationalize 

Unlike Bailey’s specification of perfectly foresighted behavior, our 
brand of ultrarabonality implies that an extra dollar of government 
deficit will displace a dollar of private investment expenditure because 
households autonomously treat deficits as pubhc investment and regard 
the public and pnvate sectors’ investment projects as interchangeable. 
We shall refer to this displacement effect as the “ex ante crowding out” 
of pnvate investment * ® When output is at the full-employment ceiling, 
however, it is difficult to distinguish the foregoing phenomenon—at 
least as far as the impact on pnvate investment is concerned—from what 
may be called “ex post crowding out ’’ In the latter case, an output 
constraint, a stable demand funebon for money, no change in monetary 
policy, and a stable GPSR imply that an increase in the government 
deficit must displace an equal amount of pnvate expenditure through 
a nse in interest rates—no matter how the households view public debt 
instruments and public investment projects 

The difference between the two forms of crowding out is illustrated by 
the loanable-funds diagrams shown in figure 3a and b The vertical Ime 
SIq 111 each is the amount of GPS, and hence the supply of loanable funds, 
for an initial equilibnum level of output of The fact of a stable GPSR 
IS reflected in the complete interest inelasbaty of saving.'' Schedule II, 
the marginal efficiency of the investment curve, represents the demand 
for funds for investment Imdally there is no government borrowing, 
and equihbnum occurs with the interest rate at and the level of 

"* Cf Fnednun and HeUer (1969, 54-56} for a dimHuoo of hypothcaes about ibe 

“crowding out" eflecti of government bond uraea. 

" Wright (1969) premnu evidence fora poamvesufaatttutxmmtcteatelutieitr of aggre¬ 
gate savings. But the esumales, while statisticaUy significam, are not laige; tbe highest 
otimate u 0.27. And Wright adimis that the eRwt of uitncst rate rhat^ti on inoosne 
niakes the interest rale elasticity of the saving rmk smaller. The high degree of stability 
of tbe GPSR that we hs^ shown suggesu that the interest daatidty of the savings rate is, 
for all pmetioal puiposea, aero. 



344 


JOURNAL OF FOLmCAt COONOMV 




a b 


Fio 3 


investment at O/g Now suppose the government borrows AG The ex ante 
case IS shown m figure 3a, where the private sector treats the government 
deficit as part of investment and incorporates it into the investment 
schedule II The demand for private investment therefore dechnes by 
AG to rr and the rate of interest is unchanged 

The altemauvc form is illustrated by figure 3b, in which the addition 
of government borrowing of AC raises the total demand for loanable funds 
by that amount to IGIG. If there is some elastiaty in the demand for or 
supply of money, the resulting rise in r generates an excess supply of real 
balances, part of which is attracted into the loanable-funds market 
The supply of loanable funds can thus differ in the short run from the 
funds supplied out of GPS, but in the full equilibrium where the demand 
and supply of the stock of money arc perfectly matched, the supply of 
loanable funds it identical with GPS It is clear then, m terms of fi^re 3b, 
that the new equihbnum depends on what happens to SIq when the public 
sector goes into deficit 

If Fg is less than full employment output, the increase in expenditure 
financed out of idle balances (or increases m the stock of money caused by 
a rise m the money multiplier) raises the level of aggregate output and 
hence shifts the SIq curve to the nght The total of government and 
private borrowing is larger than before, and therefore the government 
deficit does not completely displace pnvate investment. 

Alternadvcly, if Yq is full-employment output, SIq is the fixed, full- 
employment level of saving An increase in government borrowing in 
this situation drives up the rate of interest to the pmnt where private 
investment declines by the increase in the deficit. As is illustrated in 
figure 3b, in the final equilibrium the rate of interest has become ri 
and private investment has declined by I'Iq <■ AG. Tbu is the case of 
ex post crowding out. 

In terms of the impact on the level of investment, it is inqponible to 








nUVATE SAVOWM 845 

distinguish between the two types of crowding out. However, note that 
the ex post example assumed a gi^^ disposable income. The stable 
GPSR in that case can be rationalized in terms of a stable propensity to 
consume out of dbposable income But the empirical evidence suggests 
that, in the case where the increase in the government dehcit was at the 
expense of taxes, for example, the level of private saving still would not be 
affected In other words, the ex post crowding out can explain why pnvate 
investment and government dissaving behave like perfect substitutes, 
but it is silent on why taxes and consumption should do so 

One’s view of the relative effectiveness of hscal and monetary policy 
for macroeconomic stabilization, and the appropriateness of econometric 
exercises designed to test such views, turns out to depend in a fairly 
crucial way on whether crowding out can be taken to be of the ex ante, 
rather than the ex post, form Given that taxes displace private consumpi- 
tion expenditures, if pnvate debt issues are displaced ex ante by govern¬ 
ment bonds, there is no way that changes in the level of public expenditures 
can affect aggregate demand and output (or pnees), fiscal policy would be 
useless for stabilization purposes Nor would there be any additional 
“kick” of a larger multiplier attached to debt-financed government 
expenditure 

To establish the presence of the ex ante form of crowding out, however, 
it IS necessary to concentrate on the evidence provided by episodes in 
which there is sufficient slack in the economy to preclude the existence 
of ex post crowding out Even this empirical strategem is not totally 
dependable, for, if the money demand and money supply functions are 
extremely interest inelastic, considerable ex post crowding out may take 
place even in conditions of less than full employment Sull, some econo¬ 
metric indications of the importance of monetary policy may be obtained 
by discarding observations relaung to full-employment states, as in the 
latter it is difficult to distinguish between the effects of interest inelastiaty 
of the demand and supply functions for money and the effects produced 
by a stable GPSR and the constraint on output 

The absence of pronounced depressions, and the small year-to-year 
variation of the GPSR during the postwar era, does rather suggest that 
the disagreements which the monetanst-fiscalist debate has recently 
stirred up regarding the proper interpretation of the empirical record for 
the United States in thu period may amount, after all, to httle more than 
a squabble over residuals ‘ * The only way that controversy is likely to 

” The article that started the whole delate u, of coune, the now rlsir pMoe by 
Fnedman and Mebetman (1963). The evideoce ^ a stable GPSR suggests, however, 
that thar tests using consumption as the dependent vanable are bi a s e d against the 
autonomMu oqienditure theory. The reason is that a stable GPSR imidies that con- 
suaipuon pbu tsasi is the stable Aincoon of GNP, and hence of au ton o m ous cxpendtiitre. 
Or to put it another wgg, the Friedman-Mebeliw tests using consumption pick up not 
only any instahility in the multiptser but the instability m the pnvate con su mpti on 
functum as well. 



JOURNAL OF POUnCAL ECONOMY 


346 

be resolved is through investigations of the role played by monetary and 
fiscal policy in fairly severe diseqmlibnum states. This is a line of approach 
that has been pursued in recent time-series studies of monetary pohey 
during the Great Depression (eg, Brunner and Meltzer 19^). But 
there is also room for international cross-section analysis of the workings 
of monetary and fiscal policy in less developed countries, where dis¬ 
equilibrium takes the form of being “unlike” the advanced industrialized 
econonues The analogy drawn between the disequilibrium states of the 
industrially advanced countries and the prevailing situation in less 
developed countries is not so strained as it might first appear Leijon- 
hufvud’s (1968) work on the role of money in a general equilibrium 
context has suggested that a disequilibrium state can usefully be modeled 
as a situation in which information and transactions costs arc at very 
high levels, and this seems a quite natural characterization of chronic 
(rather than transitory) conditions in the markets of many less developed 
countries 

Of course, we are not confined to observing quantities alone, and the 
behavior of interest rates and pnees when fiscal policy is altered—even 
in conditions of full employment—should provide information which 
permits ex ante crowding out to be distinguished from its ex post twin 
Thus, if a movement toward fiscal ease leaves prices and interest rates 
unaffected, this may be read as indicating that crowding out in the ex 
ante form had occurred Unfortunately, students of the postwar U S 
economy will find it difficult to isolate such episodes of “pure” fiscal 
policy changes The evident commitment of the Federal Reserve to the 
task of stabilizing interest rates “in supfiort of” Treasury debt opicrations 
results in fiscal piolicy shifts being accompanied by alterations of the money 
supply But the latter’s impact on pnees and nominal interest rates (after 
these have adjusted to inflationary expectation) runs in the same direction 
as the price changes that would materialize in the course of ex jxist crowd¬ 
ing out, thereby muddying an otherwise clear test 

When one turns to the role of fiscal policy in the context of long-run 
gp'owth. It may seem that the distinction between the two forms of crowd¬ 
ing out with which we have been concerned is really unimportant In 
neoclassical growth theory, the supply constraint on output is assumed to 
be binding, in which case the constancy of the GPSR is enough to guaran¬ 
tee the same results with respect to the effect of government deficits or 
surpluses on private investment as ex ante crowding out. Yet it does 
matter When households act in accord with our definition of ultra- 
rationality, the optimal mix of tax and debt financing for pubhc expen¬ 
diture—^which the government either adheres to or finds that the private 
sector imposes by means of tax avoidance—introduces an ad^tiona 
ccHUtraint. This leaves fiscal policy no scope to affect the accumulatioi 
of cajxtal, just as in the short-run the combination of ex ante crowdinf 



PRIVATE 8AVIN08 


247 


out and Denison’t law robs fiscal policy of any leverage over the state of 
aggregate effective demand In the long run, fiscal measures can achieve 
an alteration of the distribution of activity between the private and public 
sectors, but, by hypothesis, that shift will not disturb the growth path of 
the economy It would seem that the stress that optimum growth theory 
(e g., Foley and Sidrauski 1971) has put on the role of fiscal pohey as a 
control instrument is in need of some reconsideration When the economic 
relations between the household sector and its government are those which 
our standard of ultrarationality implies, optimal fiscal policy begms to 
appear just as neutral as money in long-run growth 

The stability of the GPSR in the face of substantial financial innovation 
over the long span of time surveyed here might also suggest that the role 
of financial intermediation in American economic growth has been 
considerably less important than it has lately become fashionable to 
maintain But the temptation to jump to this revisionist conclusion should 
be resisted, for it unwarrantcdly neglects the sigpiificant effects of financial 
intermediaries in facilitating the shifts in the composition of savings which, 
over the long run, have probably been essential in staving off a decline 
in the real rate of return on “capital”*^ The growth in mortgage 
financing of housing expenditures, for example, freed household internally 
generated funds for consumer durable purchases These funds in turn 
were supplanted by the growth of installment credit, which has allowed 
internal finance to be applied to other forms of capital accumulation, 
perhaps most notably now to education ' * 

Whatever the reason for the crowding out of private investment— 
"lack of pierfect foresight” as Bailey would have it, “ultrarationality” 
as we have defined that, or just the conjunction of full employment and 
the stable GPSR required by Denison’s law—the empincal findings 
reviewed here offer a simple and direct answer to the impiortant public 
finance question What rate of discount should be applied in cv'aluating 
public investment projects and, more generally, to government borrowing 
for any purposes’ The historical evidence of a highly stable GPSR 
suggests that the interest elasticity of saving is very low and may be 
approximated by zero That being the case, as Harbergcr (1971) has 
shown and as one can plainly sec from figure 3, we can say that the 

” Cf Abrunovitz and David (1973a, 1973i) on the capital-deepening bias of tech¬ 
nological change and the changing composition of capital formation over the course of 
the past and prcKnt century of growth in the Umted States 

In hii study of the demand for money Chow (1966) found that permanent income, 
which presumably includes the return to human capital, was a better explanatory vonabie 
than nonhuman wealth alone However, the relauonship between iinancial mtermediation 
and the growth of human capital menuoned above raises the quesuon of whether lome- 
I thing other than a demand relationship u being measured. For the narrow definiuoti, tx., 
including oiUy commercial bank demand deposiu, thu may not be important But it is 
not clear that studies of the demand for more inclusive defintuoos of money are not 
contaminated by some sui^iluinetty bias of this sort 



JOURNAL OP POLITIOAL BOONOMY 


248 

opportunity cost of the resources borrowed by the government (AC) 
IS unambiguously represented by the area under the marginal efficiency 
of investment schedule II m the interval AC between /' and /q. 
Since the appropriate discount rate is gpven therefore by the social margfinal 
productivity of investment in the private sector, it would seem that the 
awkward problems of measuring the social rate of time preference can be 
justifiably finessed Indeed, if households really were ultrarational in the 
way we have described, the marginal rate of time preference would be 
equated to the gross-of-tax social rate of return on private sector 
investments 


References 

Abramovitz, Moses, and David, Paul A “Economic Growth in America 
Historical Parables and Realities ” Z>e £con 121 (May/June 1973) 251-72 (a) 

- “Reinterpreting Economic Growth Parables and Realities of the 

American Experience '' A E R 63 (May 1973) 428-39 (J>) 

Ando, Albert, and Modigliam, Franco “The ‘Life Cycle’ Hypothesis of Saving 
Aggregate Implications and Tests " A ER 53 (March 1963) 55-84 
Arak, M V , and Spiro, Alan “T’he Relationship between Permanent Income 
and Measured Variables " J P E 79 (May/June 1971) 652-60 
Bailey, Martin J National Income and the Price Level New York McGraw-Hill, 
1962 

-“The Optimal Full Employment Surplus " J P E 80 (July/August 1972) 

649-61 

Bhatia, Kul B “Accrued Capital Gains, Personal Income and Saving in the 
United States, 1948-1964’’ Rev. Income and Wealth, ser 16, no 4 (December 
1971), pp 363-78 

Brittain, John A Corporate Dividend Policy Washington Brookings Inst , 1966 
Brunner, Karl, and Meltzer, Allan “Liquidity Traps for Money, Bank Credit 
and Interest Rates ” J P E 76 (January/February 1968) 1-37 
Chow, Gregory C “On the Long-Run and Short-Run Demand for Money ” 
J PE (April 1966) 111-31 

David, Paul A , and Abramovitz, Moses “Economic Growth in the United States 
Savings in the Nineteenth Century ’’ Unpublished Stanford Center Res 
Econ Growth, August 1969 

Denison, Edward F “A Note on Private Saving ’’ Rev Econ and Stalls 40 (August 
1958) 261-67 

Duesenberry, James S Income, Savings, and the Theory of Consumer Behavior 
Cambridge, Mass Harvard Univ. Press, 1949 
Foley, Duncan, and Sidrauski, Miguel Monetary and Fiscal Policy in a Growing 
Economy New York Macmillan, 1971 

Friedman, Milton A Theory of Consumption Function Princeton, N J. Princeton 
Univ. Press, 1957 

Friedman, Milton, and Heller, Walter W. Monetary vs Fiscal Poluy. New York 
Norton, 1969 , 

Fnedman, Milton, and Meiselman, David. “The Relative Stability of Monetary 
Velocity and the Investment Mtildflkr in the Umted States, 1897-1958.’’ 
In Subiltzatum Policies New York: Prehtice-Hall, 1963 



PRIVATE SAVINOS 249 

Friend, Irwm, and Scbor, Stanley. “Who Savef ?” Rm. Earn, and Statu. 41 (May 
1959): 213-48. 

Gallman, Robert E., and Davu, Lance E “The Share of Saving* and Inveatment 
in Groii National Product during the 19th Century, United State* of America " 
Memo no 63. Unpubluhed. Stanford Center Re*. Econ. Growth, July 1968. 

Goldsmith, Raymond W. A Study <if Sainng tn the Umted States, Princeton, N.J : 
Princeton Univ. Press, 1953. 

Grunfeld, Yehuda, and Griliche*, Zvi. “I* Aggregation Necessarily Bad?” 
Rev Econ and Static. ^2 (February 1960) 1-13. 

Harberger, Arnold C. “On Meatunng the Social Opportunity Cost of Public 
Funds ” Inst Dtfense Anafysts Econ. Papers (May 1971), pp. 1-45 

Hickman, Bert G “Investment Demand in the Sixties ” Reprint no 120. 
Washington Brookings Inst, 1966 

Juster, F Thomas Household Capital Formation and Financing, 1897-1962. Princeton, 
N J Princeton Uruv Press (for Nat Bur Econ Re* ), 1966 

Juster, F Thoma*, and Lipscy, Robert E “A Note on Consumer Asset Formauon 
m the United States ” Econ J 77 (December 1967) 834—47 

Kendrick, John W Prodsutmly Trends tn the V.S Princeton, N J Princeton Univ 
Press (for Nat Bur Econ Res ), 1961 

Leijonhufvud, Axel W B On Keynesian Economics and the Economies of Krynes 
New York Oxford Univ Press, 1968 

Modigliani, Franco, and Brumberg, Richard “Utihty Analysis and the Con¬ 
sumption Function An InterpretaUon of Cross-Section Data ” In Post- 
Keynesian Economics, edited by K K. Kurihara New Brunswick, N J Rutgers 
Univ Press, 1954 

Oshima, Harry “Consumer Asset Formation and the Future of Capitalism.” 
EconJ 71 (March 1961) 20-35 

Vatter, Harold, G “Consumer Asset Formation in the United States A Re¬ 
joinder,” Econ J, 77 (December 1967) 848-55 

Vatter, Harold G , and I hompson, Robert L. “Consumer Asset Formauon and 
Elconomic Growth The United States Case ” £nm J 76 (June 1966) 312-27 

Wright, Cohn “Saving and the Rate of Interest ” In The Taxation of Income 
/rom Capital, edited by Arnold C Harberger and Martin J Bailey Washington 
Brookings Inst., 1969 




The Cost of School Time, Foregone 
Earnings, and Human Capital Formation 


Donald 0 Parsons 

Ohio Stale Unuiernfy 


A simple educational investment model m used to demonstrate that, if 
students are subject to borrowing constrainu, foregone earnings are not 
identical to schooling time costs, since students will sacrifice leisure as 
well as earnings Direct measurement of schooling hours and work hours 
of young males reveals that at the high school level the bulk of school 
hours result from foregone leisure A review of the forcgonc-eamuigs 
measures used in a number of major human capital studies is undertaken 
to determine the approximate bias in human capital formation and 
rate>of-retum estuiuites resulting from this source 


This paper deals with theoretical and empirical aspects of the proper 
measurement of school lime costs Using foregone-earnings measures, 
most researchers have concluded that schooling time costs are the bulk of 
all school-related human capital investments Schultz (1960, p 580) 
estimated such costs as about 60 percent of ail costs in 1956, while 
Becker’s (1964, p 75) estimate is 74 percent for pnvate college educauon 
costs The major point developed below is that indirect measures of the 
cost of school time such as “foregone earnings” senously underestimate 
school time costs. 

As a practical matter, there are three major components of schooling 
costs direct expenditures on tuition, books, transportation, etc , the 
alternative value of the hours devoted to the schooling activity’, and the 
reduced canungs on any hours the student contmucs to work due to 
part-time work wage penalties. In most of what follows, the emphasis will 
be on the estimation of schooling hours, although the size of the part-time 
work difTcrential will also be estimated 

Thu rcKSfch was supported in part by the Manpower Admuustratwn, U.S. Depart- 
niou of Labor, and the Ohio Stale Umvemiy Counter Getter. TV coasiwnis and 
cntieisms of Bdton FleidiM, T. W. Schulu, and an anonymous refe r ee are gratefiiUy 
uknowled|ed. Research assbtanoe was provided by DonaM Smith, Jill Cai a p p e ll, and 
Howard Atkins 


851 




JOURNAL or POUTICAL ECONOMY 


252 

Virtually all major human capital studies have used earnings differen* 
tials of youth out of and in school as an indirect measure of school time 
costs, since earnings data are considerably more accessible than direct 
ume allocation data ' Implicit in this indirect measure of schooling time 
costs IS the proposition that the difference in hours worked between the 
two groups IS equal to schooling hours or, from the individual’s viewpoint, 
that the student will reduce his work hours by a quantity identical to his 
schooling hours 

This work/schooling assumption is untenable, however, as it appears 
that students as a group devote substantially more time to market 
activities (work plus schooling) than do nonstudents of the same age and 
level of schooling completed To the extent that students are generally 
more capable at market activities, of course, one might expect them to 
devote more time to market activities throughout their lifetimes This 
tendency is not particularly strong at later points in the life cycle,* but 
other factors may augfment the effect for students in school As will be 
developed at greater length in Section I, borrowing difficulties during 
the educational investment period induce students to sacrifice leisure as 
well as work hours to attend school, since consumption cannot be financed 
by future earnings 

In Section II, some evidence is presented on the extent to which 
schooling hours are drawn from work hours and from leisure hours For 
high-school-agc youth, for whom the direct schooling hours data are 
richest, evidence indicates that substantial work time and leisure time 
are both sacrificed for school hours on average Evidence on the wage 
rate differential of youth in and out of school is also presented The 
pnncipal data source is the National Longitudinal Surveys, a 5-ycar 
panel survey of (among others) about 5,000 male youth 14-24 years of 
age in the first survey 

A comparison of methods of calculating school time costs used in 
several important human capital studies is presented in Section III Data 
problems have not even allowed the uniform measurement of the indirect 

‘ See Section 111 below for a more detailed description of meaiuremcnt methods in a 
number of major human capital studies 

’ Becker (1964, p 169) was aware of the problem that work hour differences between 
schoolmg groups may affect the interpretation of differences in earning streams and 
rate-of>retiim estimates He presented evidence which suggested that among employed 
workers there is no strong relauonship between hours of work and education level 
Lindsay (1971) considers this same issue at greater length A regression run on data 
obtained from the Nauonal Longitudinal Surveys found the following for men age 45-59 

work hours m 2,022 + 13 7 years of school, »« 015. 

(4 6) 

This suggests that a worker who has completed 4 years of college will work on average 
55 hours a year, or something less than 3 percent, more than one who has completed only 
high school. 



COST OF SCHOOL TIME 


253 


measure of school time, foregone eanungs, so the actual methods and 
assumptions used are applied to a uniform data base to deternune the 
degree of bias of each from directly measured school hours With some 
exceptions, the foregone-earmngs techmque led to significant under¬ 
estimation of the investment of time in schooling activities of males, 
particularly at the high school level The study which best estimates 
foregone earmngs most seriously underestimates school time costs 

I. A Theoreticsd Model of the Schoolusg-Work'Leisure Decision 

Schooling choice models in a perfect capital market are well established 
in the human capital literature Ben-Porath (1%7) presented an elegant 
multijjenod schooling-work model, while Becker (l%7a) developied a 
similar model incorporating leisure choice as well If one assumes that an 
individual in school will undertake both leisure and work in that parucular 
penod and that assorted interpenod interactions are not large, there is 
some justification for assuming that schooling activities will substitute 
only for work hours, as the foregone-earnings approach rcqmres Under 
such a set of assumptions, the constant marginal value of work hours, 
namely, the wage rate, makes work hours a buffer, with the value of 
school hours and leisure hours being marginally equated to the wage rate 
in equilibnum 

A shift up in the schooling returns schedule in a given penod will 
disturb the equality of the wage rate and discounted marginal returns to 
schooling, inducing more schooling Ignonng indirect effects, the equality 
of the constant wage rate and value of leisure time will be unaffected, 
suggesting that leisure hours will remain unchanged and work hours 
only will be reduced Obviously a shift up in schooling productivity is 
likely to change long-run wealth and future-penod wage rates which arc 
themselves likely to affect demand for leisure, but these may be second- 
order effects 

One difficulty recognized by both Ben-Porath and Becker is that there 
will be a tendency to acquire education early in the life cycle, since the 
hnitencss of life ensures that the stream of returns to a given educational 
investment will be larger the earlier the investment is undertaken In 
fact, without the assumption of suffiaent diminishing schooling produc¬ 
tivity within periods as well as cumuiauvely, a complete specialization 
of the individual in schooling and leisure would be the typical case As 
Becker (1967a, p. 14) has noted m such cases, the wage rate is no lot^r 
a useful index of the value of time’ and, moreover, leisure hours would 
be strictly and negatively dependent on school time productivity. 

* Thu u rather obvious, since the indivKluai finds bis working wage sufficiently un¬ 
attractive that he dtooses not to work. Gronau (1971) oonsiders thu question at great 
length in the context of vahiing the tune of housewives. 



254 


JOURNAL OP POLITICAL ECONOMY 


Individuals who find schooling more productive will tend to reduce their 
consumption of leisure, cetms panbus Even in traditional educational 
investment models, then, foregone earnings will underestimate schooling 
time costs if a youth docs not work part-time 

Students, however, do typically undertake part-time work A sample 
of about 5,000 males age 15 to 25 in 1%7 reveals that the age-work 
hours profile for full-time students (where full-time student status was 
self-declared) is approximately linear, annual work hours rising from 
under 500 hours at age 15 to almost 900 hours at age 23 (see table 1 for 
details) As mentioned above, Ben-Porath and Becker suggest that the 
reason for part-time work by students is that the efficiency of schooling 
investments decreases within time intervals as well as cumulatively 
across periods Since increased work hours are also likely to affect 
adversely the efficiency of a fixed number of school hours, the predicted 
substitution of work hours for school hours in this situation is not obvious 
An alternative explanation of the existence of part-time work among 
students is that students frequently face rather severe capital constraints * 
The assumption that students can borrow and lend at some set interest 
rate is intuitively unappealing It seems more plausible to assume that 
individuals have great difficulty in borrowing to finance human capital 
investments and instead are forced to self-finance their schooling by 
working part-time ’ Below I construct a schooling model to take account 
of this borrowing difficulty in a simple way and examine the effects of 
this modification on the schooling, work, and leisure choices 
To formalize the discussion, consider an ti-period model in which 
lifetime utility is a function of the stream of leisure hours and consumption 
over the n periods, so 

U=U{L„L„ ,L„,X„X„ (1) 

where L, is leisure hours in the tth penod and X^ is consumption of a 
composite market good in the tth period Assume that piast schooling 
investments increase the ith period wage rate (ic,) in the following way 

=/W«’o- (2) 

where 

I-1 

•^1 ^ S 

>-i 

* Becker (1967A), Stafford (1968), and Gustman (1969) coraider vanotu aipecu of the 
borrowing dtfficultiet of youth The complex probienu of fanuly support levds are con¬ 
sidered in a separate paper 

* Empirical tests of these two different hypotheses arc not hard to construct The 
capital constraint model would suggest, for instance, that summer school would be 
attended by students from generally wealthier fraulica. The secular increase m average 
daily attendance of school youth m the United States, a rise from 99 to 159 days between 
1900 and 1957 for studenu 5-15 (see Schults 1962, p. 97), could be explaii^ by die 
general rise m alBuence under the capital constraint hypothesis, while the intrapenod 
efficKiicy faypothesit would presumably require some ledmolagy argument. 



COST OF SCHOOL TIME 


255 


With Sj the time units of schooling undertaken in the 7 th period and W(, 
the initial-period wage rate. Presumably/' > 0 and/* < 0 

For simplicity, consider the polar case where no borrowing is assumed, 
all consumption and other expenditures undertaken by students must 
be financed by piast or present earnings or assets In its most general 
form, the capital constraint can be represented by a set of r inequaliUes 
that ensure that the individual will never be a net borrower, namely, the 
following. 

I 

^0 + [«'((// - L,- s,) - PX, - «,] 2; 0. (3) 

for <=1,2, , n, 

where S — 1/(1 + r) and r should be interpreted stnctly as the lending 
rate, individual’s initial asset level, N is the total hours available 

per penod, c is the level of direct schooling costs (assumed proportional 
to school hours), and P is the price level of market goods 
The objective of the individual is to maximize expression (1), his 
utility function, subject to the capital constraints in the n periods 
represented by the inequalities in (3) The Lagrangian appropriate to 
this case is 

y=t/(L„Lj, ,A.) 

+ E -I, {^0 + E - ^1 - ^.) - 

f»l 1-1 

(4) 

where the A’s arc Lagrangian multipliers and the d,’s are nonnegative 
slack vanables introduced to ensure equahty of inequality set (3) 
Intuitively, k, represents the marginal uuhty of a given wealth level in a 
particular penod In a model with borrowing and lending at a fixed 
rate, only A„ would be greater than zero, since lifeUme wealth and not 
the particular time stream of wealth posiuon would be relevant The 
quanuty A, represents net nonhuman assets at the end of the tth penod 
and is constrained in this problem to be nonnegauve because borrowing 
is not allowed 

When both leisure and market goods arc consumed in all periods, as 
would typically be the case, the Kuhn-Tuckcr conditions for a maximum 
utility level are the following set of equalities and inequalities for each 


penod. 





1 

1 

- 0, 

(5) 

t 

BL, ** 

-0, 

(6) 



256 


dk, 


JOURNAL OF POLITICAL BOONOMV 



Aq + ^ 6* * ~ ~ ~ ~ ~ 

i- 1 


iZ 

dA, 


-A, :S 0, 


(7) 


0 , 


(8) 

(9) 


where the inequalities in (7) and (9) hold only if the related control 
variable is optimally zero The expression (9) suggests that if the optimal 
strategy has the individual end the period with positive physical assets, 
A, = 0 or equivalently the constraint in that period can be ignored 

Interpreting the results, equations (5) and (6) suggest that an individual 
making a purchase or consuming leisure (thereby foregoing income) 
must weight the cost of the activity by the reduced options in all future 
periods in which the capital constraint is effective Consumption today 
has implications for consumption in specific later periods as well as over 
the lifetime 

The schooling expression is simply a modification of the traditional rule 
that schooling should not be undertaken whenever the net discounted 
flow of incremental income is less than the sum of school-related costs 
The only difference is in the definition of discounting, since borrowing is 
not possible In periods where borrowing would have been undertaken, 
the individual is forced to self-finance expenditures and the value of 
earnings exceeds the lending discount factor Rearranging expression (7) 
in terms of cash flows by penods, one has 


-(g 1,E 1,).'(«- 


^1+1 ~ •*!+ l) 


+ ^ 0 . ( 10 ) 

ds, 


Occurring first, schooling costs will always be weighted more heavily 
than future returns if capital difficulties exist, leading to the intuitively 
obvious result that borrowing difficulties reduce the profitability of 
schooling in a given period An individual may “drop out” of school to 
earn money sufficient to eliminate the capital constraint but will still 
generally find less schooling profitable than his counterpart who, because 
of initial wealth or borrowing possibilities, invested in schooling earlier 
in the life cycle. 

The main justification for developing the capital constraint model 
here is to acquire some notion of the substitutability of leisure and 



COST OF SCHOOL TUIE 


257 

schooling hours. The model suggests that schooling time will come from 
leisure activities as well as work hours, whether work hours which would 
be imdertaken are esdmated from the behavior of comparable youth 
out of school or from the same youth after schooling is completed. That 
leisure time in equilibrium will be more valuable to a youth in school 
than a comparable youth out of school (assumed to be subject only to the 
lifetime wealth constraint) can be seen by the ratio of marginal utiliUes of 
leisure of students to those of nonstudents 


^■tud«at 

^noflMudent 




i-t 


1 + 



(11) 


if at least some of the are positive Obviously, the full wealth positions 
of the student and nonstudent are not likely to be identical, so relationship 
(11) IS, at best, a tendency, but one which raises some doubts about the 
validity of any simple application of the foregone-earnings concept 
Longitudinal data which would allow one to obtain data on work 
behavior dunng and after schooling for the same individual would not 
necessarily lead to a more satisfactory measure of schooling hours It is 
easy to demonstrate that, if one ignores the complicating factor of direct 
school costs. 




> 1 


if a capital constraint is effecuve in period t In fact, the time path of the 
marginal utility of leisure will fall over the lifeume in this simple model, 
suggesting that, if the taste for leisure docs not vary between pienods, 
leisure will increase over the life cycle Clearly, non-investmcnt-related 
variations in wage rates over the life cycle, due, say, to maturauon or 
physical depreciation, will modify this simple tune structure The 
possibility remains, however, that schooling hours may come from leisure 
hours as well as work hours, suggesting that foregone earmngs will 
systematically underestimate school ume costs. 


n. The Value of School Time: Some Empirical Evidence 

The schooling model developed in the last section casts some doubt on 
the assumption that lies behind the use of foregone earmngs as a proxy 
for value of school time, namely, that an addiuonal hour of school implies 
one less hour of work Direct esdmates of schooling time and work time 
are developed below which will allow estimation of the percentage of 
schooling time that is the result of foregone leisure. Wage rate funcuons 
are also estimated separately for youth in school and out to measure the 
wage rate penalty tha^studenu, typically part-time workers, must bear. 
This is a second dimension of schooling activity costs. 



JOURNAL OF POLrnCAL ROONOMY 


258 

Foregone Work and Letsure Hours 

Both intuiuon and the work hours data in the sample of male youth 
examined here indicate that youth in school work substantially fewer 
hours than do otherwise comparable youth out of school. Annual work 
hour esumates for male youth arc presented in table 1 for out-of-schooI 
youth, full-time students, and part-time students. The work hour-age 
profiles were estimated as functions quadratic m age Work hours are 
presented as an age profile because of the strong dependence of age on 
work hours, presumably resulting from a combination of family support 
trends, changing social pressures with age, and growing levels of maturity 
The interesting question, of course, is the nature of the trade-off between 
work and schooling, not simply whether they are interdependent To 
what extent are school hours derived from leisure hours’ 

To answer that question, a measure of schoohng hours is needed. Such 
information is more readily available for high school activity than for 
college or university study Schooling hours are composed of two major 
components, classroom hours and homework time The Coleman report 
(Coleman 1966) presents data obtained from principals on days in session, 
length of academic day, and attendance rates for high school seniors 
(and certain lower classes as well) from which one can derive a measure 
of annual hours spent m class per student The pnncipals reported the 
figures presented m table 2 for the schools attended by white children * 
The deviations from the mean are proportionately quite small for all but 
daily hours Average annual hours in class can be estimated then as 

annual school class hours = annual days x daily attendance rate 

X hours per day 

= (179.4)(0 943)(5 8) = 981.1, 

or approximately 1,000 hours 

The National Longitudinal Survey asks a question of all high school 
students and youth immediately out of high school on hours spent j>cr 
week on homework, the second major component of schooling hours 
The results are tabulated below in table 3 Assuming mean values for each 
interval and a mean of 22 for the open-ended class, mean weekly home¬ 
work hours of 7 9 can be computed For the approximately 40 school 
weeks per year, these figures would suggest annual homework hours 
of 316 Combining class hours of 981 with estimated homework hours of 
316, we can denve an estimate of 1,297, or approximately 1,300 hours 

* Thew figures are largely confirmed by other sources. The avorage legal mbumum 
hours of school for those states (34) which have such laws are 3.86 per day m 1970. In 
23 states, the legally required days per year are precisely 160 days (see Edueatunsl Coin- 
mueum ^ the Slates 1970} Average daily memberdup and average daily attendance for 
all public elementary and secondary schools are also availsiile (National E d u ca t ion 
Assocuition 1970). From these, the rate of abwnce can be calculated for the same period 
as 6.4 percent. 



COST OF SCHOOL TIME 


259 


TABLE 1 

Annual Wouk Hours op Malx Youth, 1967* 


Age 

In School Full-Time 
(1) 

In School Part-Time 
(2) 

Out of School 

(3) 

14 

282 

3 

494 

15 

402 

366 

768 

16 

soe 

694 

1,018 

17 

599 

986 

1,246 

18 

677 

1,242 

1,450 

19 

741 

1,463 

1,631 

20 

791 

1,648 

1,789 

21 

827 

1,797 

1,923 

22 

850 

1,911 

2,035 

23 

858 

1,990 

2,123 

24 

852 

2,032 

2,187 


* Dftta are from the National Loci(itudicial Surveys Annual work hours were frOTn the foUowinf 


work hourt « 

-2.663 + 

322 4 - 

6 98 aaci, /t> - 069, 

(!) 


<4 59) 

(4 74) 

(sai) 


work boon • 

-8.622 + 

879 7 agr - 

17 81 Mc', - 390, 

(2) 


(3 19) 

(3 IS) 

(2 64) 


work houn • 

-5,785 + 

611 3*ce - 

11 63 >«c>, - 197, 

(3) 


(5 77) (6^7) (4 97) 

where absolute values of M-atios are in parentheses 


TABLE 2 

Fiourei Used in Calculation op Averaoe Class Hours by High School Students* 



Mean 

Standard Oevuuon 

Annual days in session 

179 4 

4 03 

Average daily attendance (%) 

943 

238 

Length of aciulemic day (in noun) 

58 

1 12 


* Relevant to white students R^iortod by pnacipali (Coieman 1966, 2 79) 


TABLE 3 

Weekly Homework Hours w High School, Male Youth, 1966 






Hours 





1-4 

5-9 

10-14 

1^19 

20 and More 

Frequency (%) 

29 

26 7 

369 

23 2 

72 

29 


Sovaca —National Longitudmal Surveys, 


for an esumate of total schooling hours for high school youth For lack 
of data this figure will also be used as an estimate of college hours, although 
any conclusions drawn for college age youth must be considered extremely 
tentative 

^ One suf le coUege survey of homewrork hours suggested that ooUege students spend 
considerably fewer hours at schooling acnvities than do high school youth (tee Marwardl 
and Si kkink 1970). Measugpnent of the investment component of tocisd tnleractioa (hull 
•emo^ etc.) would be necessary before any firm oonduBons could be drawn, however, 
on thii ime 











a6o 


JOURNAL OF POLITICAL ECONOMY 


TABLE 4 

SCHOOUNO Houiu AND WoRK HOURS 


Age 

Average School Hours 

(I) 

Foregone Work Houn* 

(2) 

Work Hours ShareJ 

(3) 

14 

1,300 

212 

16 3 

15 

1,300 

366 

28 2 

16 

1,300 

511 

39 3 

17 

1,300 

647 

49 7 

18 

1,300 

773 

59 4 

19 

1,300 

890 

684 

20 

1,300 

998 

76 7 

21 

1,300 

1,096 

84 3 

22 

1,300 

1,185 

91 2 


* From table 1, column 3 minus column I 
t Colonm 2 divided by column 1 timet 100 


It IS now possible to estimate what share of schooling hours comes 
from reductions in work hours In table 4, column 1, estimated schooling 
hours are rejJorted while in column 2 the difference between work hours 
of nonstudents and work hours of students or “foregone work hours” is 
computed Column 3 then provides an estimate of the percentage of total 
school hours which are foregone work hours Quite clearly, for the younger 
age levels (where the schooling time estimates are most relevant) the bulk 
of schooling hours are due to sacrificed leisure rather than sacrificed work 
hours, or ,conversely, the bulk of school time released when a young male 
drops out of high school is directed (voluntarily or otherwise) into leisure 
time If wage rates for youth in school and out-of-school youth were 
comparable and the most simplistic measure of foregone earnings were 
used as the measure of schooling hours, these estimates would be radically 
wrong Wage rates, however, are not equal for youth in and out of school 
(even with equal levels of schooling and age), and, moreover, because 
of data limitations on earnings, most foregone-camings measures arc not 
so straightforward Foregone-earnings measures used in a number of 
important human capital studies are considered in detail in Section III 
to determine the likely bias they contain It turns out that in most cases 
the bias is considerably reduced because of inaccurate assumptions about 
the magnitude of student earnings Before considenng this matter, I 
consider first the nature of the wage rate process 


Unit Value of Time* 

Allocating hours from work to school activity creates a secondary problem 
in that students face considerably lower wage rates for their typically 
part-time work than they would face if they were seeking full-time 

■ For R more detailed analysis of wage rate fiinctkms ibr a subset of fbe individusli 
considered here, see Parsons (1972). 






COST OF SCHOOL TIME 


a6i 


TABLE S 

Hourly Waob Rate Reoremons or Male Youth, 15-25 in 1967* 


Group 

Conitant 

Log School 
Yean 

Log Age 

R* 

Sample Sue 

Noiutudents 

-3 692 
(9.39) 

0 678 
(8.96) 

0 920 
(6 91) 

186 

768 

Studente 

-2 658 
(12 23) 

1.205 
(13 31) 


116 

1,341 


SoimcB —National Loncltudmal Surveyi 

* Dependent variable U the lofanthm of hourly wa^ rates Calculated hourly wafe rater greater than 
$7 M) are deleted The absolute value of the <-ratios arc in parentheses 


TABLE 6 

Hourly Waoe Rate PoanBiLmu In and Out or School for Male Youth, 1%7* 


Schooling Level (Age) Wage Rate In School Wage Rate Out of School 


9(141 

$0 99 

$1 25 

10(15) 

1 12 

1 43 

11 (16) 

1 26 

1 62 

12(17) 

1 40 

1 82 

13 (18) 

1 54 

2 03 

14 (19) 

1 69 

2 24 

15 (20) 

1 83 

246 

16(21) 

1 98 

2 69 

17 (22) 

2 13 

292 


* Calculated from eilimaia m table J 


employment This results presumably from the lower productivity 
part-time workers have to firms as well as the reduced incentive the part- 
time worker has to search out alternative employers and higher wages 
The precise size of this wsige rate differential is a matter of some interest, 
since It will affect direct measures of foregone earnings. Part of the 
earnings foregone will be due to reduced wage rates of students rather 
than reduced hours. 

Wage rates as a logarithmic funcuon of years in school and age are 
presented in table 5 for male nonstudenu and students. The age vuiabie 
was not introduced in the student wage rate regression, smee school year 
and age arc extremely highly correlated for youth still in school Assuming 
the youth followed the normal course of completing each grade m 1 yeaur, 
so age K schooling years -f 5, one can use the information m table 5 
to estimate the youth’s wage rate opuons m and out of school at various 
points in the schooling process. The average wage rate options in and 
out of school at various yean of schooling are given in uble 6. In percent¬ 
age terms, wage rates of in-school youth vary from about 79 percent of 
what they could have earned had they decided to work full-time at age 14 
to about 73 percent at age 22. This penalty forms another inomtive to 
specialize in schooling and allocate wort time to the pottschoding petrad. 











262 JOURNAL OF POLITICAL ECONOMY 

ni. Estimates of Foregone Earnings in Selected 
Human Capital Studies 

A number of important human capital studies have estimated foregone 
earnings at different levels of schooling Because of data problems, 
however, most have not derived their estimates from direct measurement 
of differences in work hours between nonstudents and students The 
closest any study has come to that is the corresponding measurement of 
differences in earnings, presented in a recent paper by Solmon (1971) 
Iromcallv, Solmon’s estimates, while more precise measures of earnings 
differences, are more seriously biased estimates of value of school time 
than any of the prior studies operating with more limited data The main 
purjMse of this section is to compare methods of estimating the value of 
school time used in a number of human capital studies with a single data 
set This should give us an idea of the accuracy of the methods used 
relative to the directly measured 1,300 hours of school 

Consider the following alternative methods of estimating the value of 
school time 

Method 1 The Actual Difference in Earnings of Students and Nonstudents 

As mentioned above this approach is used by Solmon (1971) and is 
purported to be a refinement of early estimates by Schultz (1960) 
Restricted to the hours allocation aspect, this approach involves com¬ 
puting the difference in work hours between students and nonstudents 
Estimates of this measure appear in table 4, column 2 

Method 2 Three-fourths Earnings of Nonsludents 

This measure was used in the classic rate-of-return studies by Becker (1964) 
and Hanoch (1967), where the particular fraction was derived from 
two other studies of student-nonstudent earnings differentials (see Becker 
1964, p 170) Although really only another way of measuring the same 
quantity measured more directly by method 1, practically it yields 
different results, since full-time students typically work more than one- 
fourth as much as nonstudents 

Method 2a The Full Earnings of Nonstudents 

Justified purely on the grounds of simplicity, this measure is used by 
Hansen (1963) in denving the value of school time.® If students should 

’ To quote Hanien (1963, p 130) “For example, at age eighteen the opportunity cost 
for the person undertaking four yean of college is the income that the high school graduate 
would obtain from ages eighteen to twenty-one This procedure made it unnecessary to 
rely upon indirectly estimated opportunity cost figures . 



COST OP SCHOOL TIME 


263 


TABLE 7 

Weekly Work Hours, Male Youth, 1966* 



Age 

All Youth 
(J) 

Nonstudent Youth 
(2) 

14 


19.6 

31.8 

IS 


24 2 

33 7 

16 


284 

35 5 

17 


32 1 

37 2 

18 


35 3 

387 

19 


380 

40 1 

20 


402 

41 3 

21 


41 9 

42 4 

22 


43 1 

43 4 


* Dat« Are Cram the NaikmiaI LongttudiBAJ Surveyt The age profUei 
were denved from the following ettimeted relAtiooihipB 

hours per week ~ 97 6 -f 11 83 age 0 25 age^, • 237» (I) 

(111) (125) (100) 

hours per week - >9 47 + 3 90 age - 007 age>, R^ - 075 (2) 

(0 6) (2 7) (2 0) 


accidentally spend as much time m school as nonstudents work, this 
method will yield accurate results Table 4 gives some indication of the 
value of this approach as an estimate of schoohng time 


Method 3 • 40 Times Weekly Pay of Working Youth 

This measure used by Schultz (1960) is conceptually partway toward 
the direct measurement of school hours, since it standardizes for the 
number of weeks in the school year If youth attend school at the same 
rate per week that they would otherwise work and students do not work 
at all and therefore are not included among working youth, this approach 
would accurately measure schooling time costs In table 7, the average 
number of hours per week worked by male youth are presented in 
column 1 and should be compared with a standard of 33 3 hours per 
week (1,300/40) 


Method 3a 40 Times Weekly Pay of Nonstudents 

This measure used by Blitz (1962) is a dcnvative of Schultz’s technique 
If one assumes that dunng the school year one attends school at the rate 
he would otherwise work, then the work efforts of nonstudents are the 
proper measure of schooling Looking at the work efforts of all youth 
(in and out of school), as Schultz was forced to do with his data, should 
lead to downward-biased estimates of school time. In table 7, average 

'* Schultz (1960, p. 574) isMits that ideally his measure should subtract the acbool 
year earmngi of tti^ents. lliis would be a more accurate measure of e a min p di ifc m i c cs 
hat a less accurate measure oAchool tune oasts, as I show below. 



364 


JOURNAL OF POLITICAL ECONOMY 


TABLE 8 

Aoe Psopilbs op Vauow Eitimatss op School Hours 


Age 

Direct 

Measure 

Method 

1 

Method 

2 

Method 

2a 

Method 

3 

Method 

3a 

14 

1,300 

212 

370 

494 

784 

1,272 

15 

1,300 

366 

576 

768 

968 

1,348 

16 

1,300 

511 

764 

1,018 

1,136 

1,420 

17 

1,300 

647 

935 

1,246 

1,284 

1,488 

18 

1,300 

773 

1,088 

1,450 

1,412 

1,548 

19 

1,300 

890 

1,223 

1,631 

1,520 

1,604 

20 

1,300 

998 

1,341 

1,789 

1,608 

1,652 

21 

1,300 

1,096 

1,443 

1,923 

1,676 

1,696 

22 

1,300 

1,185 

1,526 

2,035 

1,724 

1,736 



TABLE 9 





Percentage Deviation prom 

1,300-Hour School Year 


Method 

High School 

Collie 


Study 

in Which Used 

1 

-67 

-28 


Solmon (1971) 


2 

-49 

-2 


Becker (1964) and Hanoch (1967) 

2a 

-32 

31 


Hansen (1963) 


3 

-20 

20 


Schultz (1960) 


3a 

6 

25 


Blitz (1962) 




weekly hours of nonstudents arc reported in column 2 A quick com¬ 
parison of the two series suggests major differences only at the lower 
age groups, where students form a substantial component of all working 
youth 

Compamons 

In table 8, the estimated schooling hours-age profiles of the vanout 
indirect techniques are compared with the measured 1,300 standard 
Recall that the direct measures are much firmer for high-school-agc youth 
The biases caused by the different tcchmques can best be summarized, 
perhaps, by measuring the deviation of each of the estimates from the 
1,300-hour annual level for a full high school (ages 14-17) and college 
(18-21) education These are presented m table 9 The results indicati 
that the actual measurement of foregone earnings provides the least 
useful method of estimating high school time costs, underestimating sut h 
costs by almost 70 percent The Beckcr-Hanoch assumption at this level 
also substantially underestimates schooling tune, yielding estimates 
biased downward by 50 percent. The original technique suggested b\ 
Schultz and modified by Blitz provides the best estimates at this level 
At the college level, the estimates are generally more accurate, with the 
Beckcr-Hanoch approach yielding estimates remarkably close to 1.30*) 
hours per year. Three of the studies overestimate schooling costs at thu 














COST OP SCHOOL TOfE 265 

level and substantially so. However, again bear in nund that the 1,300- 
hour school acuvity level for college youth '.> merely an extrapolation of 
the measured values for high school students 

In reviewing the results for high school students, it seems at first 
ironic that the accuracy of the esumates are roughly inversely related 
to the time sequence of appearance. This process, however, is not entirely 
a chance happening, since we are witnessing the increasingly accurate 
measurement of the wrong variable Differences in work hours between 
students and nonstudents are not the same as the hours of schooling 
undertaken by students This paper has presented theoretical reasons 
why that is so, as well as directly documenting it from measured school 
hours If anything, at the younger age levels, foregone leisure is much 
more appropriate as a label for schoohng time costs than is foregone 
earnings 


References 

Becker, Gary S Human Capital New York Ckilumbia Univ Press (for Nat Bur 
Econ Res ), 1964 

- “The Allocation of Time and Goods over Time " Unpublished manu¬ 
script, 1967 (a) 

- Human Capital and the Personal Distribution of Income An Analjttcal A^oach. 

W S Woytinsky Lecture no 1 Ann Arbor Inst. Public Admin and Dept 
Econ , Univ Michigan, 1967 (b) 

Ben-Porath, Yoram “The Production of Human Capital and the Life Cycle of 
Earnings " J PE 75 (August 1967) 352-65 
Blitr, Rudolph, C “The Nation’s Educational Outlay *’ In Econormes of Higher 
Education, edited by Selma J Mushkin Washington Government Printing 
Office, 1962 

Coleman, James S , et al Equality of Educational Opportumty Vols I and 2 
Washington Government Pnnung Office, 1966. 

Educational Commission of the States. “State Action Extending the School 
Year ’’ Compact 4 (December 1970) 38-44 
Gronau, Reuben. “The Intrafamily Allocation of Time The Value the 
Housewives’ Time.” Mimeographed, November 1971. 

Gustman, A L “Human Capital, Capital Theory, and Occupauonal Eamingt.” 

Unpublished Ph D dissertation, Umv. Michigan, 1969. 

Hanoch, Giora. “An Economic Analysa of Earmngs and Schooling.” J. Humem 
Resources 2 (Summer 1967). 310-29. 

Hansen, W Lee “TotalandPnvateRatesofRetumtoInvestmentm Schooling.” 
J PE 1\ (April 1%3)- 128-40 

Lindsay C. M “Measuring Human Capital Returns.” J.P,E. 79 (December 
I971)' 1195-1215 

Marwaidt, Frederick, and Sikkink, Donald E. “Student Piepaiation Tune.” 

Impromng College and Unto. Teachers 16 (Autumn 1970). 306-9. 

NaUonal Fxiucation AiKCiation, Research Division. NEA Setearck BuUetm 
(March 1970), p. 27 

Parsons, Donald O. “Imergenerational Wealth Transfers and the Educational 
Decisions of Male Youtfi.” Mimeographed, 1972. 



266 


JOURNAL OF POLITICAL ECONOMY 


Schultz, Theodore W “Capital Formation by Education.” J^,E. 68 (December 
1960) 571-83 

- “Rise in the Capital Stock Represented by Education in the United 

States, 1900-57 ” In Economics of Higher Education, edited by Selma J. Mushkin. 
Washington Government Printing Office, 1962 
Solmon, Lewis C “Capital Formation by Expenditures on Education in I960.*’ 
J P.E 79 (December 1971) 1412-17. 

StafTord, Frank. “The Income and Consumption of Graduate Students.” Un¬ 
published Ph D dissertation, Univ Chicago, 1968. 



Advertising and Profitabiiity: 
A Reappraisal 


Harry Bloch 

Unuurnty qf Mamloia 


Previous empirical studies of the relationship between profitability, 
market structure, and advertising intensity have consistently found a 
strong positive relationship between profit rates and advertising intensity 
The profit rates used in these studies are based on treating advertising 
expenditures as a current expense Much evidence exists that advertising 
has a long-lasting effect on sales, so that the treatment of advertising as 
a current expense is inappropriate In this paper, corrected profit rates 
are calculated by treating advertising expenditures as an investment in 
a capital asset The corrected profit rates are then used in a multivariate 
regression analysis of the relationship between profitability, market 
structure, and advertising intensity No significant rclauonship between 
corrected profit rates and advertising intensity is found in the regressions 
This finding suggests that the positive relationships between uncorrccted 
profit rates and advertising intensity in previous studies are due solely to 
the inappropriate expensing of advertising 


Beginning with a study by Cotnanor and Wilson (1967), there have been 
a number of studies using multivariate regression analysis to investigate 
the relationship between profit pierformance, advertising intensity, and 
market structure The general method of the studies is to use profit rates 
equal to reported profit divided by repiorted net worth as the dependent 
variable and measures of advertising intensity and market structure as 
independent variables The conclusion which is reached consistently is 
that there is a strong piositive relationship between profit rates and 
advertising intensity 

The measures of profit rale used in the studies are subject to error due 
to inappropriate accounung treatment of advertising in the calculation 
of reported profit and net worth Advertising expenditures are treated 

I am indebted to Lester Teller, Marc Nerlove, Dale Orr, Maunoe Levi, andXeooard 
VVeiH for constructive cndCbm and advice. 

267 




268 


JOURNAL OF POLITICAL ECONOMY 


as a current expiense, even though they are generally believed to have a 
long-lasting impact on sales and should therefore be treated as an mvest- 
ment The presence of errors in the measures of profit rate used can cause 
the results of regressions of the above type to be misleading 
Weiss (1969), finds that adjusting the profit rates of a sample of indus¬ 
tries to correct for the expensing of advertising does not much change the 
strong posiuve relationship between profit rates and advertising intensity 
(pp 428-29) However, there is evidence that his adjustments are 
inaccurate We argue below that the questionable accuracy of the 
adjustments is due to aggregation problems encountered in the use of 
industry data We find that when corrections for the expensing of 
advertising are made to the profit rates of a sample of firms, the strong 
positive relationship between profit rates and advertising intensity 
disappears This finding supports the view that the strong posiuve 
relationships found in previously reported studies are due merely to the 
expensing of advertising 


I. The Effects of Elxpensing Advertising 

Studies which demonstrate that advertising has a long-lasting effect on 
sales have been done by Nerlove and Waugh (1961), Telscr (1962), 
Palda (1964), Lambin (1970), and Pcles (1971) Yet, normal accounting 
practice is to treat advertising expenditures as a current expense m the 
calculation of reported profit and net worth The assets on the balance 
sheets of firms exclude the value of undepreciated advertising cxpiendi- 
tures, so that the reported net worth of the firms is understated Also, 
when firms have current expenditures on advertising which differ from 
the amount of depreciation of their advertising asset, the reported profit 
of the firms is misstated 

The effect that the expensing of advertising has on a measure of profit 
rate equal to reported profit divided by reported net worth can be stated 
m terms of the difference between measured and “true” profit rates The 
“true” profit rate is defined for our purposes as the rate which would 
prevail if advertising expenditures were treated as an investment and 
capitalized over their full economic lives * Understatement of reported 
net worth causes the measured profit rate to be above the true profit rate 
However, this effect can be partially or totally offset by positive net 

' The “true” profit rate which we use it not corrected for all inappropriate accounting 
In addition to advertising, expenditures on research and development, product promo¬ 
tion, and on-the-job trainmg are treated at current expenses, even though they can be 
expected to have long-lasUng effecu No explicit treatment is given to the inappropriate 
accounting of these addiuonal classes of expenditure because of insufiicient data to cal¬ 
culate the effect on profit rates Thus, our true profit rate it still sulqect to error, but our 
purpose IS merely to remove spurious correlation between advertisiiig and profitabiUty 
due to the inappropriate expensing of advertisuig expenditures. 



advertisino and profitability 369 

investment in advertising, which causes the measured rate to be below 
the true rate * 

The general method of studies which have mvesugated the relationship 
between profit performance, advertising intensity, and market structure 
IS to calculate regressions of the following type 

It 

njE = Ho + 53 (1) 

I 

where itjE is the measured profit rate equal to reported profit divided by 
repiorted net worth, the a’s are estimated coefficients, the AT/s are measures 
of advertising intensity and market structure, and « is a residual term 
The above discussion suggests that the measured profit rate used is 
composed of a true profit rate component and an error term component 
The presence of an error term in the dependent variable can cause 
regression coefficients to reflect inaccurately the relauonships between 
the independent variables and profitability ^ In particular, the coefficient 
of advertising intensity is likely to be affected, because advertising 
intensity can be expiected to be correlated with the amount of advertising 
asset and net investment in advertising Tclser (1969) has suggested that 
advertising intensity may serve as a proxy for omitted intangible capital 
in the form of advertising, so that the coefficient of advertising intensity in 
regressions of type (1) is biased upward (pp 122-23) A positive coefficient 
of advertising intensity in regressions of type (I) can indicate a posiuve 
relationship between advertising intensity and the error term, rather 
than indicating that advertising increases profitability To determine the 
relationship between advertising and true profitability, it is necessary to 
correct for the expensing of advertising in the calculation of measured 
profit rates 


II. Weiss’s Study 

Weiss (1969) investigates the effect that the expensing of advcrtismg has 
on coefficients in regressions of type (1) He docs this by companng 
coefficients obtained from regressions of type (1) to coefficients obtained 
from regressions of the following type 

II 

7t*/£* = «o + JE (2) 

’ If we anume that advertiung expenditures grow at a constant rate over Ume, we can 
solve for the diflerence between measured and true profit rates in terms of the rate of 
growth of advertising expenditures, the rale of depreciation of advertising expenditures, 
and the level of the measured profit rate. A denvauon of this rdaUonship given by 
Teller (1968, p 167) and by Weiss (1969, pp 421-23) 

’ For a discussion of the effects of erron m the measurement of the dependent variable 
on the coefficients in classical least-squares regreoumt, see Johnston (1963, pp. 148-62) 
or Goldberger (1964, pp. 285M4). 



270 


JOURNAL OF POLITICAL ECONOMY 


where is an estimate of the true profit rate derived from the 

measured profit rate, jt/£, by treaUng advertising expenditures as an 
investment rather than a current expense, and the X^’a are the same 
independent vanables used in the corresponding regression of type (1). 
He finds the regression coeffiaents in (2) are close to the corresponding 
coefficients in (1) The coefficients of advertising intensity are only 
moderately lower in the regressions of type (2), and they are still greater 
than zero at the 1 percent sigfnificance level using a onc-tailed /-test 
These findings suggest that the expensing of advertising docs not affect the 
relationship between profit rates and advertising intensity, and that the 
coefficient of advertising intensity in regressions of type (1) provides an 
accurate indication of the relationship between advertising and 
profitability 

In calculating estimates of true profit rates for use in regressions of 
type (2) It IS necessary to make assumptions regarding the manner in 
which advertising expenditures depreciate over time In studies which 
have attempted to estimate the pattern of depreciation of advertising 
expenditures, the estimated pattern vanes substantially with the type of 
product advertised and the method used in estimation * This disparate 
evidence on depreciation patterns and problems of aggregation which 
are explained below make it important to check the accuracy of estimated 
true profit rates 

A test of the accuracy of the estimate of true profit rates used in 
regressions of type (2) is possible by making use of the relationship be¬ 
tween measured and true profit rates Weiss’s estimates of true profit rates 
are derived from calculations of the following form = (n + AZ)/ 

{E + Z), where n*IE* = the true profit rate for a penod of Ume, 
n = the reported profit for the penod, AZ = net investment in 
advertising during the pieriod, E = reported net worth at the end of the 
period, Z = accumulated undepreciated advertismg expenditures at the 
end of the period * Using this expression, the measured profit rate, itjE, 
can be expressed as njE = (n*/£*)(l + ZjE) — AZjE If the relation¬ 
ship between the A'.’s and true profit rates is properly specified in regres- 

* By calculating regressions involving current and lagged maiicet shares, Teller (1962) 
estimated that the advertising expenditures of cigarette firms depreciated at a constant 
annual rate varying between 15 and 20 percent (p 498) By calculating regressions in- 
volvuig current and lagged absolute sales, Palda (1964) esumated that the advertising 
expenditures for Lydia Pinliham Vegetable Compound depreciated at a constant annual 
rate of 33 percent (pp 60-68) By calculaung regressions of similar form to Telser's 
Lambm (1970) estimated that the advertising expenditures for an unspeofied consumer 
durable depreciated at a constant annual rate between 29 and 60 percent (p 475). 

’ Weiss’s formula for calculaung true profit rates is as follows; 

,, n. + A, - (I - i) i (A'A,.t) - i>A,., 

£,+ i i^*‘A,., 

t»0 

where A, is advertising expenditures in year I and i is one minus the Btiaual rate of 
depreciation of advertising expenditures The values of Z and dZ inqihed by this formula 


ADVERTISINO AND PROFITABILITY 271 

sions of type (2), the relationship between measured profit rates aVid the 
same Jf/s is properly specified in regressions of the following type 

n 

nIE = aS(l + Z/£) + g <(Jr,(l + Z/£)) + a,^^{EZIE} + e" (3) 

The dependent variable in regressions of type (3) is the same as in 
regressions of type (1), but the independent variables are altered to 
correct for the expensing of advertising If altering the independent 
vanables in a regression of type (3) reduces errors associated with the 
expensing of advertising, the corrected for the regression should be 
higher than the corrected for a corresponding regression of type (1) 
Thus, a comparison of the corrected R^ provides a test of the accuracy 
of the adjustments made in calculating estimates of the true profit rates * 
A further test of the accuracy of the adjustments is given by the value of 
the coefficient , in the regressions of type (3) The assumed value of 
this coefficient in Weiss’s calculation of estimates of the true profit rates 
IS — 1, so that , should have a value of — 1 

We calculate regressions of type (3) with the data on measured profit 
rates, advertising intensity, and market structure which Weiss uses in 
his regressions and with values of Z/£ and AZjE implied by his calcula¬ 
tions of true profit rates The results of these regressions are listed on 
lines 5 and 6 of table 1 Also listed in table 1 are the results presented by 
Weiss for the corresponding regressions of type (2), listed on hnes 3 and 4, 
and the corresponding regressions of type (1), listed on lines 1 and 2 
The corrected R^ for each of the regressions of type (3) is below the 
corrected R^ for the corresponding regression of typic (1), and the values 
of the coefficient of AZjE differ substantially from — 1 ^ Thus, Weiss’s 
estimates of true profit rates do not pass the test of accuracy established 
above 


are aa follows , 

HZ = A. - - 1) Z (-1%-.) - 

Z - I 

1*0 

If measured profit rates are subject to error due to inappropnale expensing of ad- 
vcrusing, regressions of type (I) arc improperly specified The independent vanables arc 
altered in rcgremoiu of type (3) to correct fi>r this improper specification The improper 
speafication will not be corrected if insKmiratc values of Z/£ and hZ/£ are used in 
regressions of type (3) The size of the residual variances provides a test for an improve¬ 
ment in specificauon in going from (I) to (3) If a regression of type (3) is properly 
specified while the corresponding regression of type (1) u improperly specified, the resid¬ 
ual variance for the regression of type (3) should be smallw according to Thai (1971, 
p 543) As the total variance is the tame in corresponding regressions of types (1) and 
(3), the corrected A* calculated according to the iormula in n. 7 below u mversely related 
to the residual variance. 

^ The corrected R* luted on lines 1 through 4 of uble I aie not those given by Wass 
(1969, p. 429). Rather, to obtain equivalent values of A* for all the tegressKins in table I, 
all the A* were calculatedhusing the following conventional formida for corrected A* 
w 1 — (1 — A*)(a — l)/(a — A; — 1), where a it the number of observations and k 
u the numba of exphuiatory variables. 




272 



ADVERTISINO and profitability 273 

The data used in calculating the regresuons in table 1 consist of 
observations on 37 U S. consumer-goods industries, Weiss measures the 
industry level value of Z by the sum of current and past advertising 
expenditures of all firms in an industry nunus the accumulated deprecia¬ 
tion of these expenditures. He measures the Industry-level value of AZ 
by the sum of current advertising expenditures by all firms in an industry 
minus current depreciation of Z. These procedures are eqmvalent to 
takmg the sum, for all firms in an industry, of each firm’s Z and AZ, 
valued as the firm's own advertising cxpicnditurcs mmus depreciation. 
Values of Z and AZ calculated in this manner tend to overstate an 
industry’s advertising asset and net investment in advertising whenever 
firm advertising expenditures have long-lasting negative effects on the 
demand facing other firms in the industry * The hmited amount of 
evidence available indicates that these negauve effects are substantial, 
which implies that there is a serious aggregabon problem in calculabng 
industry-level advertising assets and net investments in advertising * 
Improper aggregabon of firm advertising expenditures could explain 
the apparent inaccuracy of Weiss’s esbmates of industry true profit 
rates In any event, the possibUibes of improper aggregabon add to 
the difficulbes of obtaining accurate esbmates of true profit rates at the 
industry level 


m. Results Using Firm Data 

The aggfregation problems encountered in obtaimng accurate estimates 
of true profit rates for industries can be lessened by using estimates of 
true profit rates for firms Firm profit-rate data are used in three 

' When a finn'i adveroung hai long-lasting negative eHects on its compcUlon, the 
firm’s advertising creates a negative asset for the competitors as well as a pontive aaet for 
the firm itself This means that the addition to an industry’s advertising asKts fienn eadi 
firm’s advertising is smaller than the addition to the firm’s own advertising asset Thus, 
when the values of Z and AZ for the firms in an industry are appropriately measured, 
and when each firm’s advertising has negaUve eflccts on its compeuton, values of Z and 
AZ for the industry calculated as the simple sum of the values of Z and AZ for firms in 
the industry overstate the industry’s advertising asset and net investment in advertising 

* Telser f1962) supplies evidence on the negauve eflects of ogaiette-firm advertising 
on other cigarette fiinu (pp 463-87) Peles (1971) supplies similar evidence for the 
advertising of cigarette, be^, and automobile firms (pp 39-43, 72-73) Finally, Lambui 
(1970) supplies evidence of the negative effects of competitors’ advertising on the demand 
for an unspeafied consumer durable (pp. 479-80) 

Another explanation of the apparent inaccuracy is that it is due to astuming the 
wrong rate of depredaUon of advertising expenditures Attempts to improve the accuracy 
of the estimates of true profit rates by using alternative asaumptions about the rate of 
depreciation met with no success. 

'' Even when a firm's advertising has long-lasting negative effects on its oompetiton, 
the value of the advertiting asset created for the firm will usually include only the positive 
effects on the firm’s own future profits. An exception is where the firm’s advertauig teada 
Its competitors to increase their advertising, i^ this induced ndvertisuig has a long- 
luting negative effect on |he first firm’s future profits. 



JOURNAL OF POLrnOAL EOONOUY 


274 

studies of the relaUonship between profit rates, advertising intensity, 
and market structure Federal Trade Commission (1969), Imel and 
Helmberger (1971), and Shepherd (1972). However, a sufficient amount 
of data on advertising expenditures over time, to allow reasonable 
calculations of values of Z and AZ, are available only for the firms in 
the study by the Federal Trade Commission, and even so for only some 
of the firms Our investigation of the effect of inappropriate expensing 
of adverusing on the coeffiaents in regressions of type (1) is therefore 
based on a subsample of the FTC firms 

In the FTC study, regressions of type (1) are calculated using data for 
97 U S food manufacturing firms The profit rates used in the regressions 
are simple averages of annual profit rates for each of the 5 years from 
1949 to 1953 Two profit rates are used a profit rate P^ which is equal to 
repiorted net income after taxes plus interest on long-term debt divided 
by reported net worth plus long-term debt, and a profit rate Pj which is 
equal to repiorted net income after taxes divided by reported net worth 
For each profit rate, linear and nonlinear regressions of type (1) in both 
unweighted and weighted form are calculated, using as independent 
variables advertising-to-sales ratios, concentration ratios, relative market 
shares, percentage changes in industry demand, one over the log of firm 
assets, and altemauvely, three-digit-, four-digit-, and fivc-digit-firm 
diversification ratios For the purpose of investigating the effect of the 
expensing of advertising on the regression coefficients, we use unweighted 
linear regressions with five-digit diversification ratios The results 
listed by the FTC for the regressions m this form arc repeated here on 
lines 1 and 2 of table 2 

The data used in the calculation of the FTC regressions are hsted in 
appendix C to their study To check if the data are consistent with the 
regression results listed on lines 1 and 2 of table 2, regressions of identical 
form are calculated using the data The results of these latter regressions 
are hsted on lines 3 and 4 of the table. While the results of corresfionding 
regressions of identical form differ, the strong positive relationship 
between profit rates and advcrtising-to-sales ratios is not much affected 

The advertising-to-salcs ratios used by the FTC arc found by taking 
averages of the advertising-to-salcs ratios for all the industries in which 
a firm operates Because the implied measures of adverusing expenditure 
do not reflect the actual expenditures of the firms, they are not appropriate 

' ^ The choice oriinear rather than nonlinear regremoni was bawd on a denre to avtxd 
the tmilticoltneanty which ansa when several different powers of the same explanatory 
variable arc used in a Fcgreanon. Regresttons with five^digit-finn diversification raUos 
were cbown because these regrewons had RH which were unifonnly higher thsm the 
Rh tor the corresponding regreaton with a three- or finir-digit-firm diversifioauon ratio. 
Finally, unweti^ted regressions were chosen because hcteroscedatticity was not present 
in the resMhiak of the regressiora using the 40-finn subsan^ile on wfaidi the conclusions 
of this study are based 



g ^s I 


n Ptf OtftO O) ^ 


Ssog §2^-5 i^s 
ic-Tsi i~7~ 72-1=. 0&72 


I’i S § 2 — 
7 I S 555 s? 


6^0 2- 


sJi 

I 


I|il|s5 


000«^ 0^0^^ Se^O^ 

oeoo®'> 0 ^ 0 ^ O ■" O •" O •“ o 

I'^l'^ 1 ^ 1 '^ 1^1^ 


Z t^pZc4 ^pZr%. 

©©O'* ©wgo 0©0*'^ 0^0© 

AaH*A«« oO^O aPaO pOoO 

"wO-- 


O r«* 2 O M eo'^ ^ 5o®^?^ 

J?S©5o5 

*"csi‘“c«4 **®^*"£t o©o© ^ 


SSia 2882 1^8 

© d© r^, © *^ © ^ o r^LO r^ o ^© ^ 


|g^E l^z isgs gsSs 

o “ o c. ® C.O C. o C-° E ® 2.0 E 


SS 22 2S 88 

ooc^ r<-cs COP 


0,4. 0.0. 0.0, 


o> 

s W CC 

3e4 «^ • 

1 ^ H 1 ! 

s^ 1 r I I 

,'r;j « ■! B B 

-I I • - 

B«S^ 1 5! ■ • # • •• 

g- M Ijtii ■♦ );•({((> }(•»;• 

h h Pn h 


*75 




JOURNAL OF POUTtCAL EOONOUY 


276 

for calculating estimates of true profit rates for the firms. For 40 out of 
the 97 firms in the FTC regression sample, data on firm advertising 
expenditures in six major media were available for the period 1936 
through 1950 inclusive, and for the individual years, 1950, 1951, 1952, 
and 1953.*^ Our investigation for the effects of the expensing of adver¬ 
tising on the coefficient of advertising intensity in regressions of type ( 1 ) 
IS therefore based on the 40-firm subsample To check whether use of 
the 40-firm subsample distorts the results of the investigation, regressions 
m the same form as the regressions on lines 1-4 of table 2 are calculated 
for both the 40-firm subsample of included firms and the 57-firm sub¬ 
sample of excluded firms The results of the 57-firm regressions are 
listed on lines 5 and 6 of table 2, and the results of the 40-firm regressions 
are listed on lines 7 and 8 The coefficients of the advertising-to-sales 
ratio are higher in the 40-firm regressions than in either the 57-firm 
regressions or the 97-firm regressions 

The data on advertising expenditures in six major media for the 40-firm 
subsample are used to calculate measures of Z and AZ by firm for each 
of the years 1950 through 1953 These calculations assume a 5 percent 
per annum rate of depreciation of advertising capital This rate of 
depreciation is chosen because it implies values of Z capable of explaining 
variations in the relationship between the market value and book value 
of the stock of firms in the regression sample ^ * The calculated measures 

' * The data on advertising expenditures m the six media for the years 1951, 1952, and 
1953 are given in the October 29, 1954 issue of PrvUer’s ItA, while the data on advertising 
expenditures for the year 1950 and for the penod 1936-50 inclusive are given in the 
November 23, 1951 issue The six media for which advertising expenditures are included 
are network television, network radio, magazines, newspapers, national farm papers, and 
Sunday magazine supplements 

Ihe first step in calculating measures of Z and AZ was to solve for values of r, the 
compounded ratio of adverusing expenditures in year 1 — 1 to advertising expenditures 
in year t These values were found using the following formula 

<-1*3» 1 — r 

where A, is advertising expenditures in year t Using the derived values of r, measures of 
Z and AZ were then calculated from the following formulas, where i. is one minus the 
annual rate of dcpreaation of advertising expenditures Ztno = ~ >b')]iii>so> 

Z, = l(A, + Z,.,) for t - 1951, 1952, and 1953, &Z, A, - [(1 - X)IX)]Z, for 
/ = 1950, 1951, 1952, and 1953 

‘ ’ The treatment of advertising as a current expense means that the amount of un¬ 
depreciated advertising expenditures is not included as an asset on balance sheets, so 
that the value of a firm’s advertising asset is not reflected in the book value of the firm’s 
stock. The value of the firm’s advertising aoet does, however, tend to be reflected in the 
market value erf' the firm’s stock Thus, accurate measures of Z for firms should be highly 
correlated with the difference between the book value and market value of the firms' 
stock. We calculate regressions of the following form for 37 of the firms m the 40-firm 
subsample: log {MjE) ■= 4, + kj log [(i? + Z)/£] -b bt log [x/{Z 4- Z)] -I- s, where 
Af IS the market t^ue of the firm’s sto^ The values of Z us«l in the r^ressions are 
calculated using, altemauvely, 5,10,20, and 33 percent annual rates of depredation of 
advertising expoiditures The Rh for the regressions decreased continually from .679 for 



advertisino and profitability 


277 


of Z and AZ are used to find estimates of the two average true profit 
rates, P* and P*, corresponding to the measured profit rates, Pj and Pj- 
Because advertising expenditure data are not available for the year 1949, 
the period over which the average profit rates arc calculated is hmited 
to the 4 years 1950 through 1953, rather than the 5 years 1949 through 
1953 used in the FTC study The formulas used in finding the estimates 
of true profit rates are equivalent to those used by Weiss, except for the 
treatment of net investment in advertising ‘ * Weiss adds the full amount 
of net investment in advertising to the numerator of his estimates of true 
profit rates We add only the portion of net investment in advertising 
which IS not offset by a reduction in taxes Our purpose is to have estimates 
of true profit rates which reflect what profits would have been if adver¬ 
tising had been treated as an investment rather than expensed Our 
estimates of Pf and P* for the 40 firms arc hsted in table Al, along with 
the correspondmg values of P^ and Pj 


the regreuion with the 5 percent depreciation rate to 674 for the regrcnion with the 
33 percent depreciation rate The t-ratio for the coefficient it decreased continually 
from 4 79 for the 5 percent regreuion to 1 62 for the 33 percent legrciuon 
‘ ‘ The average pro6t rates were calculated tiling the following formulas 


/*, 

Pi 

PJ 


Pi 


(‘r si.) /(‘i £,.). 

(-r ss.) /(r Bt.), 

\e« 1950 / / \t*1990 / 

199) » 

I (£i. + 2.) 

r* 1990 

199S 

I [nsc + (• - "^) AZ,] 

I-HIS-. 

1999 * 

Z {Bt. + Z.) 

■>ISSO 


where a„ » reported net income after taxes plus interest on long-term debt in year I, 
£i, reported net worth plus long-term debt m year t, xj, = reported net income 
after taxes in year 1, £], a reported net worth in year I, and m, » the marginal cor¬ 
porate income lax rate in year < The values of m, used in the calculauons include all 
income tax surcharges and excen profits taxes in effect during the yean 1950 through 
1953 To allow for the fact that the fiscal yesus of the firms ended at vanous umea between 
June 30 of year t and June 29 of year (4-1, the values of m, were calculated by taking 
a simple average of the margiiuU corporate income tax raMi in effect at the end of each 
month from June of year (to May of year I 4 - 1, as indicated by the Internal Revenue’s 
Service's StaMics ^ Iname (Washmgton Government Pnnung Office), for the yean 
1950 through 1954: m, 9 so 0S7, m,,,, • 0 80, miyu ~ 0.B2, and aiifss » 0 79 
Values of repo r ted net income after taxes, interest on long-term debt, and reported net 
worth were taken from Mood's Iitdulnalt fin- the yean 1950 through 1954. 

' ^ In the numerator of the estunatei of true profit rates is rouinplied by (I — ai,} 
because the profit rates are calculated on an after-tax basis. The eiqiensuig of advertising 
tesults in reported profits before taxes beug lower by one dollar far ca^ dollar of net 
investment in advertising. When reported profits are lower, so are income t axe s. Hence, 
the expensing of advertisiu has the effect of lowering lepoiled profits after taxes by the 
•mount of net investment in advertising tunes one nunus the ap^cable income tax rate. 



278 JOURNAL OF POUnCAL ECONOMY 

The 4-ycar average measured profit rates, /*, and arc used in 
calculating regressions of type (1) identical with those on lines 7 and 8 
of table 2, except for the shghtly shorter period over which the profit 
rates are averaged The results of these regressions arc listed on lines 1 
and 2 of table 3 On lines 3 and 4 of table 3, arc hsted the results of 
regressions of type (2) which have the estimated true profit rate, P* or i**, 
subsututed for the corresponding measured profit rate as the dependent 
variable The coefficient of the adverUsing-to-sales ratio is substanUally 
lower in the regressions of type (2) than in the corresponding regression 
of type (1) This result provides evidence that the coefficient of the 
adverusing-to-sales ratio in regressions of type (1) overstates the relation¬ 
ship between adverusing and profitability Furthermore, the fact that 
the coefficients of the advertising-to-sales ratio in the regressions of type (2) 
are not gfreater than zero at the 5 jjercent significance level using a 
one-tailed /-test indicates that there is actually no measurable relationship 
between advertising and profitability 

The reliability of the above evidence depends on the accuracy of the 
estimates of true profit rates used in the regressions of typie (2) According 
to the analysis of section II above, the accuracy of the estimates can be 
checked by calculating regressions of type (3) corresponding to the 
regressions of types (1) and (2) The results of such regressions of type (3) 
are listed in lines 5 and 6 of table 3 The corrected for these regressions 
of type (3) are higher than the corrected for the corresponding 
regressions of type (1) Also, the coefficients of AZ/£ have values close 
to those assumed in the calculation of the estimates of true profit rates, 
although these coefficients do not differ from zero at the 5 percent 
sigmficance level using a two-tailed /-test '* These results conform 
reasonably well to what would be expected from accurate estimates of 
true profit rates 

The advertising-to-sales ratios used in the regressions of table 3 are 
computed by the FTC from industry average advertising-to-sales rauos 
These computed advertising-to-saies ratios do not reflect the actual 
rauos of advertising expenditure to sales for the individual firms in the 
regression sample, and may therefore be viewed as inappropnatc An 
altemauve measure of the advertising-to-sales ratio is computed by 
dividing firm advertising expenditures in six major media for the years 
1950 through 1953 inclusive by firm sales for the same penod.*® The 
alternative advcrUsing-to-sales rauos are used in calculating regressions 
of types (l)-(3) identical with those in table 3, except for the change in 


'* The avenge value of (1 - m,} used in calculating true profit ntea acooiding to 
n. 16 above u 0.255. In regronom of type (3), thu iniplici a coeffident AZIE of —0.255 
'* The data c« advertising expenditure* in the sk media was taken firoin Prmlir ‘1 M 
tmaa October 29,1954 and November 23,1951, and the data on net sale* after excue 
taxet wa* taken from Meow's ladustnaU tor the yean 1950 through 1954. 



advertisino and profitability 


279 


the measure of the advertising-tO'Saies ratio The results of these regres¬ 
sions, which are listed in table 4, do not differ in any consistent way 
from the results of the regressions listed in table 3, except for the somewhat 
lower coefficients of the advertising-to-sales ratio in table 4. 

The regressions m tables 3 and 4 are calculated based on the assumpuon 
that advertising expenditures depreciate at a rate of 5 percent per annum 
Given that there is uncertainty regarding the actual depreciation rate. 
It IS important to check the sensiUvity of our results to changes in the 
assumed depreciation rate In his study, Weiss assumes advertising 
expenditures depreciate at 33 3 percent per annum We calculate values 
of Z, AZ, Pand P * for the subsample of 40 food manufacturing firms 
assuming a 33 3 piercent advertising depreciation rate and then use these 
values to calculate regressions similar to those in table 4 The results of 
these regressions are listed in table 5 The regressions are numbered with 
primes, so that each regfression corresfionds to the unpnmcd regression 
of the same number in table 4 

The higher assumed advertising depreciation rate in the regressions 
of table 5 is associated with substantially higher coefficients for the 
advertising-to-salcs ratio However, these coefficients are not greater 
than zero at the 5 percent significance level using a one-tailed Mest 
Also the corrected for the regressions of type (3) in table 5 are below 
the corrected P* for the corresponding regressions in table 4 In secuon II 
above, we show that the corrected P* in regressions of type (3) provides 
a test of the accuracy of the adjustments made in calculatmg true profit 
rates Hence, we have evidence that calculauons of true profit rates based 
on an assumed 33.3 percent annual advertising depreciauon rate are 
less accurate than calculauons based on an assumed 5 percent rate, at 
least for the firms in the 40-firm subsample 

In both table 3 and table 4 there are substanual differences between 
the coefficient of the advertising-to-sales rauo in the regressions of ty^pe (1) 
and the coefficient of the advcrtising-to-sales raUo in the coirespionding 
regressions of type (2) or (3) This finding provides evidence that the 
coefficient of the advertising-to-sales rauo in regressions of type (t) 
overstates the relauonship between advertismg and profitabihly While 
a strong posiuve relationship between profit rates and advertising 
intensity is found in the regressions of type (1), the relationship disappears 
in the regressions of types (2) and (3) when correcuons arc made for 
the expensing of adverUsing. Switching from the use of measured profit 
I rates as the dependent variable in regressions of type (1) to the use of 
I true profit rates as the dependent variable in regressions of type (2) 
affects the coefficient of no other variable as strongly or persistently as it 
affects the coefficient of the advertismg-to-sales ratio, nierefore, it u 
appropriate to view the advertising-to-sales ratio in the r^rcssions of 
type (1) as a proxy for eriors in measured profit rates resulting from the 
expensing of advertising. 








jC/5 


bo^ 'S 


M C O ^_ 


^ 

ei-o £Z 
<^ o 


« C 

®«2: ® o^ 

°ci|^ 

5! Pic* 


u,^, 




>3 S 


Sg 




fcjc 2; 

e *rt «1 < 

< s'^‘2' 


> V V 

iS'" 


I ^ S o 
^ S - -Co 

, I safe 




o in 


^ m 


^ S 


-■ 

as^c 


OOoO o© I 


M ^ CO 

^ in c»4 eo 

© S,o ©, 


Q © Q © 
0 — 0 — 

1 I 


00©c 
o —o; 


o: 
I 


> © s ^ 
1,0 — 
I 


©o© — 




|iP8S 

SoElo Jt 


:1kss 

godoc. 

a 


CO < 
o o o © 


^sis 

'®OCN©C^ 

w s-' 

2 

<1 

5 .S«Sa 

|c.-o- 

8 

r. 

,© eo 


slit 


(8 

eo 

m 




e sic c 




3 B 2 



AOVBKTISINO and pkofitabiutv 383 

rV. ConcliMions 

Previous studies investigating the relationship between measured profit 
rates, advertising intrasity, and market structiu'e have consistently found 
a strong positive relationship between profitability and advertising 
intensity. Advertising intensity has generally been treated as a proxy 
for the degree of product difTerentiation, indicating that product 
I difierentiauon is associated with high profitabihty. An altemauve ex¬ 
planation for finding a strong positive relationship between measured 
profitability and advertising intensity is that advertising intensity is a 
proxy for errors in measured profit rates resulting from the expensing of 
advertising. The evidence presented above supports the latter explanation 
rather than the former. Although the evidence is based on only one 
relatively small sample of firms, it clearly indicates the need for caution 
in interpreting the results of regressions using measured profit rates as 
the dependent variable 

The relationship between advertising and profitabihty has attracted 
attention from policymaking bodies such as the FTC because of the 
imphcations for a public pohey aimed at maintaining the maximum 
degree of competiUon If adverusing by differenuatmg products acts as a 
barrier to entry, a positive relationship between advertising and profit¬ 
abihty can be expected The strong posiuve relationship between 
measured profit rates and advertising intensity found in previous studies 
of the relationship between advertising and profitabihty has lent support 
to the view that a corrective pohey toward advertising is needed to 
maintain comfiebtion The evidence presented above indicates that a 
strong positive relationship between measured profit rates and advertising 
intensity disappears when the profit rates are adjusted to correct for the 
expensing of advertising Thus, the view that a posiuve relationship 
between measured profit rates and advertising intensity indicates the 
need for corrective pohey toward adverusing seems unwarranted. 

It IS interesung to note that, while adjusung measured profit rates to 
correct for the expensing of adverusing casts doubt on the need for a 
correcuve pohey toward advertising to maintain compieUUon, the need 
for a different kind of correcuve pohey toward advertising is clearly 
indicated The treatment of adverusing expenditures as a current expense 
rather than as an investment in the calculauon of reported profits reduces 
the reported profits of firms that make posiuve net investments in adver¬ 
tising The tax payments of such firms arc reduced because the level of 
tax payments varies directly with the level of reported profits.*® The 

The teduebon in cuirent tax paymenti gives rae to a deferred tax habUity whidi u 
incurred when the firm subsequently reduces its investment in adverusing. For a firm 
ivhicfa continually increases its advertising expenditures there u never a lubs^uent reduc¬ 
tion ui Its advertisiiig investment, m the deferred tax liability u never incurred. 



JOURNAL OF POLITICAL ECONOKY 


984 

data on net investments in advertising and marginal corporate income 
tax rates used in calculating estimated true profit rates for 40 food 
manufacturing firms imply that over the 4 years, 1950 through 1953, the 
40 firms were able to avoid a total of $373 million of taxes due to the 
expensing of advertising for tax purposes The average annual tax 
avoidance over the 4 years for each of the 40 firms is listed in table A1 
The tax avoidance amounts to an average of more than $2 million per 
firm per year The ratio of tax avoidance to advertising expenditures is 
0 46, which IS the equivalent of a 46 percent subsidy on advertising 
exfienditures 

Wciss (1969) proposes that to eliminate the implicit subsidy on 
advertising the capitalization of advertising expenditures for tax purposes 
should be enforced in cases where the advertising expenditures have 
long-lasung effects (pp. 429-30) The size of the implicit subsidy on the 
advertising of the 40 food manufacturing firms indicates the importance 
of the subsidy for firms that have large amounts of long-lasting adver¬ 
tising, and thus provides evidence in support of Weiss’s proposal for 
tax reform 

Tile amount of tax avoidance was calculated by multiplying total net investment in 
advertiung for the 40 firms in each year by the prevailing marginal corporate mcomc 
tax rate and then summing over the 4 yean For the marginal tax rates used, see n 16 
above 

The amount of tax avoidance during the years 1950 through 1953 was to a degree 
uncharacteristically high due to the imposition of excess profits taxes after the beginning 
of the Korean war Assuming that the corporate income tax rate had been a normal 52 
percent during the period and that the tame level of advertismg expenditures prevailed, 
the total tax avoidance for the firms would have been S249 million and the ratio of tax 
avoidance to advertising expenditures would have been 0.31 Thus, even with normal 
income tax rates the 40 food numufactuniig firms would have been receiving the equival¬ 
ent of a 31 percent subsidy on their advertismg expenditures 



ippea^M 


TABLE A1 

noftr Ratei, Aovutmio-TO-SAtE* Ratkm, Advuitibno Ambti, and Tax Avodahcx 
FROM EXTENUNO of AOVEBTIdNO, FOB 40 PoOD MaMUFAOTUBINO FiRMI 


Company 

Avrraob Proftt Rates (%) 

Pi P, PJ» P* 

Aover- 

ntiNo- 

Tt>- 

Sales 

Ratx> 

(%) 

Averaoe 
Avescaoe Annual 
Adver- Tax 
TISINO Avoid- 
Aseet anck 
( t Miluons) 

Swift & Co 

65 

7.2 

58 

6 3 

0 37 

63 6 

4 32 

Armour & Co 

5 1 

58 

49 

54 

0 27 

340 

2 87 

S^enley Induitries 

64 

86 

57 

70 

250 

79 2 

5 31 

Nauonal Dairy Productt 

9.9 

13 0 

88 

10 9 

0.92 

593 

5 19 

Jos Seagrani Si Sons 

11.3 

14 2 

9.1 

10 5 

206 

101 6 

692 

General Foods 

10 7 

12 5 

72 

7 7 

4 03 

182 0 

13 34 

National Dutillers Prod 

64 

80 

58 

69 

1 96 

596 

440 

Borden Co 

90 

10 9 

8 1 

94 

060 

33 4 

1 98 

Coca-Cola Co 

16 6 

16 8 

12 5 

12 6 

2 23 

604 

1 50 

National Biscuit Co 

12 3 

12 5 

10 7 

10 8 

1 28 

296 

1 99 

Hiram Walker & Sons 

13 7 

14 0 

11 3 

11 4 

1.14 

37 2 

1 32 

Corn Products Refiiung 

12 4 

12.4 

11 0 

It 0 

1 32 

21.2 

1 01 

California Packing 

8 1 

92 

75 

82 

1 13 

16 6 

1 18 

Wilson & Co 

43 

44 

4 1 

42 

0 11 

85 

030 

General Mills 

94 

97 

6.0 

6 1 

3 24 

107 0 

7 17 

^blicker Industries . 

4 1 

4 1 

4 1 

4 1 

1.57 

12 3 

1 41 

H J Hems 

65 

6 7 

49 

48 

1.45 

41 1 

066 

Standard Brands 

83 

84 

53 

53 

1.94 

808 

209 

Libby, McNeil, & Libby 

65 

78 

52 

58 

1 11 

24 0 

0 70 

Wesson Oil & Snowdrift 

70 

70 

65 

65 

1 02 

11 4 

0 93 

Pillsbury Mills 

6 1 

7 1 

54 

59 

2 49 

348 

430 

Pabst Brewing 

II 1 

12 1 

96 

10 3 

1 75 

172 

1 45 

Quaker Oats Co 

113 

13 0 

79 

84 

3 23 

502 

3 52 

Ralston Purina Co 

10 6 

12 3 

84 

93 

0 63 

27 6 

0 93 

Carnation Co 

10 5 

12 4 

90 

10 I 

I 01 

18 8 

I 52 

Wm Wrigley, Jr Co 
Continent Bwng Co 

158 

158 

114 

11 4 

5 73 

305 

2 05 

98 

12 3 

76 

89 

1 16 

19 4 

’ 0 77 

1 Stokely-Van Camp 

8 1 

8 1 

7 1 

70 

1 26 

96 

0 73 

' Sunshine Biscuits 

152 

15 2 

130 

130 

1 07 

85 

056 

Brown-Forman Distillers 

93 

12 4 

80 

96 

368 

12 8 

1 40 

Kellogg Co 

Pet Milk Co 

23 5 

23 5 

10 7 

10 7 

504 

55 8 

308 

78 

78 

64 

64 

1 38 

134 

1 14 

i Best Foods 

138 

138 

94 

94 

285 

21 2 

0 91 

Geo Hotmel & Co 

87 

10 0 

73 

80 

046 

95 

0 62 

Canada Dre Ginger Ale 
Glenmore Distillers Co. 

85 

96 

6 1 

64 

3 70 

18 4 

1.01 

79 

97 

64 

7 1 

389 

148 

1 17 

Hunt Foods 

95 

9.9 

88 

90 

364 

7.1 

1 45 

Pcpsi-Cola Co 

114 

12 4 

83 

86 

4 13 

16 1 

092 

Oerber t*roducts Co 

151 

16 9 

12 0 

12 7 

2.32 

74 

0 75 

Green Giant Co 

8.6 

10 2 

7 1 

77 

3.61 

87 

0 78 




286 

References 


JOURNAL OF POLITICAL ECONOMY 


Bloch, Harry. “Advertiang, Competition, and Market Performance.” Ph.O 
thesis, Dept Econ , Univ. Chicago, 1971. 

Comanor, Wilham S , and Wilson, Thomas A “Advertising, Market Structure, 
and Performance." Rev. Eton. Statu. 49, no 4 (November 1%7): 423-40. 

Federal Trade Commission. “Economic Report on the Influence of Market 
Structure on the Profit Performance of Food Manufacturing Companies ” 
Washington. Government Pnnung Office, 1969. 

Goldberger, Arthur S Econometnc Theory New York Wiley, 1964. 

Imel, Blake, and Helmberger, Peter "EsUmation of Structure-Profit Relation¬ 
ships with Application to the Food Processing Sector.” AER 61, no 4 
(September 1971) 614-27 

Johnston, J Econometru Methods. New York McGraw-Hill, 1963 

Lambm, Jean-Jacques “Optimal Allocation of Competitive Marketing Eflbru 
An Empirical Study ” J Bus 43, no 4 (October 1970): 468-84 

Miller, Richard A “Market Structure and Industrial Performance Relauon of 
Profit Rates to Concentration, Advertising Intensity, and Diversity ” J 
Indus Econ 27, no. 2 (April 1969) • 104-18 

Nerlovc, Marc, and Waugh, Frederick V “Advertising without Supply Control ” 
J Farm Econ 42, no. 4 (November 1961): 813-37 

Palda, Kristain The Measurement of Cumsdatwe Advertising Effects Englewood 
Cliffs, NJ Prentice-Hall, 1964 

Peles, Yoram “Rates of Amortization of Advertising Expenditure.” J.R.E 79, 
no 5 (Scptember-October 1971) 1032-58 

Shepherd, Wilham G. “The Elements of Market Structure ’’ Rev. Econ Static 54, 
no 1 (February 1972) 25-37 

Telser, Lester G. “Advertising and Cigarettes ” J.P E 70, no 5 (October 1962) 
471-99 

- “Some Aspects of the Economics of Advertising ” J Bus 41, no. 2 (April 

1968) 166-73 

- “Theory of the Firm—Discussion "AER 59, no 2 (May 1969) • 121-23 

Theil, Henn Prsnctples of Econometrtes. New York Wiley, 1971. 

Weiss, Leonard W “Advertising, Profits, and Corporate Taxes ” Rev Econ 
Statu 51, no 4 (November 1969) 421-30 



Investment Incentives and the Corporate 
Income Tax 


Agnar Sandmo 

Nomiegtan School of Economcs and Busmesi Admuustratton 


Within the framework of the neoclassical theory of investment and 
capital this article discusses the effect of the corporate income tax on 
investment incenlives, particularly with respect to capital goods of 
different degrees of durability Sources of distortion are sought in the 
treatment of depreciation, in incomplete interest deducuons, and m 
the treatment of capital gains Depen^ng on the relative magnitudes of 
the parameters involved, the corporate income tax may change 
relative prices in favor of either short-term or long-term capital goods 


1. Introduction 

The purpose of this article is to present an analysis of the effect of the 
corporate income tax on investment incentives The framework of 
analysis is the neoclassical theory of investment and capital, the fum’s 
optimal use of labor and capital over time is derived from the basic 
criterion of present-value maximization A large part of the modem 
work in this area has been done by Dale Jorgenson and his associates, 
they have also considered the effect of the corporate income tax, for 
example, in the article by Hall and Jorgenson (1967), and subjected 
the model to careful empincal tests ‘ There still seems to be a need, 
however, for a more detailed theoretical discussion of the model, par¬ 
ticularly since there is a compeUng body of theory—exemplified by 
writings of E. Cary Brown (1948), Leif Johansen (1965), and Jam 
Sbdersten (1972)—which appeairs to leaul to somewhat different con- 

I am indebted to Leif Johaasen, Tbore J^insen, and the refcreea for cxUremely bdpfol 
commenti and niggeitiana. 

' In addition to the Hall-Jor gm aon article, the reader b referred to the book by Ftomm 
. (1971) for a series of interertipg, aldwiigh perhaps rather incoodudve, empincal in- 
L vtaugabom of die probicoi. 


•87 




388 


JOURNAL OF POLITICAL ECONOlfY 


elusions and which also raises some new questions. Briefly, these writers 
have turned their attenuon to the effect of the corporate income tax on 
the composition of the capital stock, in particular with respect to 
durability, while the neoclassical analysis has been focused on the 
subsutution between labor and capital. The present paper attempts to 
integrate the two approaches by extending the neoclassical model to take 
account of the existence of capital goods of different degrees of durability 
The analysis is partial equilibrium in character, although some general- 
equilibnum considerations are touched upon in Section 9 A full-scale 
invesugation of the general-equilibrium implicaUons of the analysts 
would be of considerable interest, but it lies beyond the scope of a single 
paper 


2. The Basic Model of Capital Accumtilation 

The formulation of the model of capital accumulation to be employed 
in the following is derived from an earlier paper (Sandmo 1971a); the 
only change of any significance is that we shall work with two different 
capital goods instead of one * In this secuon we shall present the basic 
features of the model while postpomng the analysis of taxauon to 
Section 3 

Output in period t depends on the quantities of labor and the two types 
of capital used during the period 

A', = F(L,, K,„ K^.) (1) 

Marginal productivities are assumed to be everywhere jiositivc, and there 
are nonincreasing returns to scale 

Gross investments in period t,/y, (j = 1,2), arc equal to net investments 
plus replacements We take depreciation to be proportional to capital 
stock, so that we have, for each capital good, 

- ^1. + <5/^7. (j = 1, 2, 0 < <5^ < 1) (2) 

It is also assumed that the stocks of capital at the beginning of the imUal 
period are given, that is, 

Kjo = exogcneously given (j ™ 1,2) (3) 

These assumptions imply that capital goods bought in period (increase 
the productive capacity of the firm only in period t + 1 We may think 

* As in the earlier paper, the model u formulated in discrete tune An altemaUve 
would have been to use conunuous tune and to denve the results by means of calculus- 
of-vanahons methods Such a reformulation presenu no essential problems and would in 
some res p ects have been more elegant. But the problems discussed m this paper cim be 
handled equally well by the simpler approach chosen here, which probably appeals to s 
under resulefship. 



CORPORATE INCXJMB TAX 


aSg 

of this lag as the time required to install new capital goods. It is, of course, 
a weakness of the model that the length of this time period is taken to be 
exogeneously determined, so that adjustment costs are ignored. 

The iirm is assumed to be a price taker in both commodity and factor 
markets In period t the price of output is p,, w, is the wage rate, and 
q^, and are the prices of capital goods For simplicity we shall assume 
that these prices are expected to be constant over time, so that p, = p, 
w, = w, and qjt (j = 1,2) Likewise, the rate of mterest is constant 
over time and equal to i, it is assumed to be strictly positive Some special 
problems associated with pnce vanabihty arc discussed in Section 7 
The net cash flow of the firm in penod t —the excess of receipts over 
expenditure—is defined as 

= pX, — wL, — qili, — ^ 2^21 (^) 

The objective of the firm is to maximize the present value of its cash 
flow, that IS, 

( 5 ) 

By substituung into this expression from (1), (2), and (4), we can write 
the maxunand more fully as 
00 

F = g (1 + t)->[pF{L„ K,„ K2,) - wL, 

- ?i(^i.,.i - ^1. + (6) 

~ f+l ~ ^2t + ^2'^2r)] 

The first-order maximum conditions then become 

p^-w=Q (t = 0,1,2, ) (7) 

OL^ 

/>^ - (« + «i)9i = 0 (<-1,2, ) (8) 

- (« +‘ 52)?2 = 0 (<= 1 , 2 , ) ( 9 ) 

OAj, 

Note that the /-index runs from t » 0 for labor and from / — 1 for 
capital goods. This is because the miual capital stocks are given, so that 
the only producuon deasion in the imual penod concerns the amount of 
labor to be employed with the given amounts of real capital. 

Equations (7)-(9) state the equsdity between the value of each factor’s 
tnarginal product and its price In the case of labor the interpretation 
of the condition is straightforward. In the case of capital goods, hofwever, 



JOURNAL OP POLITICAL ECONOMY 


ago 

the factor prices involved are the shadow prices or rental rates, which 
represent the cost of holding a unit of capital goods for one tune period. 
This IS obviously equal to the interest on the value of a unit of capital 
plus the value of its depreciation over the period. 

Since we have now derived the shadow prices of capital goods, we can 
define profit in period t as 

n, = pF{L„ Ki„ K 2 ,) - wL, - (1 + - (» + (10) 

and the present value of profits as 

00 

V = pF{Lo,K,o,K2o) - «^Z.o + E (1 + (11) 

1= 1 

Obviously, maximization of V' with respect to L, and Kj, will yield 
the first-order conditions (7)-(9) Accordingly, we have shown that 
maximization of the present value of cash flows as defined by (4) and 
maximization of the present value of profits as defined by (10) lead to 
identical production and investment decisions, so that they arc equivalent 
decision rules 

As has been shown in my earlier paper (1971a) and as will emerge 
from the discussion in Secuon 7 below, the simple optimality criterion 
of equating marginal products and factor prices in each pienod is a 
special case of a myopic decision rule which can be generalized to the 
case of prices that arc changing over time, m that case, only the current 
expected rates of pnee change enter into the rental rates of capital goods 
This property of the opUmal policy greatly simplifies the analysis, but 
It does rest on some rather restrictive assumptions It assumes the existence 
of capital markets that are perfect in the extreme sense that capital goods 
can be bought and sold in any amount at the same price It also assumes 
that adjustment costs arc either insignificant or that they are simply 
proportional to the amount of gross investment, so that these costs can 
be interpreted as included in capital goods prices, 

These are strong assumpuons, which clearly limit the applicability of 
the analysis The first one, however, may not be as restrictive as it seems, 
since the analysis will still be valid, even if gross investment is restricted 
to be nonnegativc provided that the amount of net disinvestment docs 
not exceed the amount of depreciation The second assumption is more 
serious in that it may limit the piossibilities of providing hypotheses about 
the response of investment demand to tax changes in the very short run 
Still, in the present context the more long-run effects of taxes on rental 
rates of capital goods seem to be the more interesting ones 

Returning to our present-value decision rules, it is well known that 
these rules can themselves be derived from a more fundamental criterion, 
namel y, utility maxii ruz atlon over time on the part of the owners of the 
firm. If owmers can borrow and lend in a perfect capital maxltet, the 



OORPORATE INOOUB TAX 


29 ‘ 

policy which from their point of view u the optimal one for the firm is to 
ensure that they attain the highest posable budget constraint for inter¬ 
temporal transactions ^ This policy is prcosely the maximization of the 
present value of cash flows or profits 


3. Neutrality of the Corporate Income Tax 

In order to discuss the effects of a tax on profits we must ensure that the 
model implies the existence of profits In long-run compeuuve equilib¬ 
rium profits will be zero, and the corporate income tax will not yield 
any revenue In order to have a piosiuve Uuc base it seems as if we shall 
have to depart from the compeuuve assumptions set out above This 
can be done without much difficulty, it is, in fact, easy to show that the 
first-order conditions for a firm which is a monopolist m the product 
market differ from (7)-(9) only m having marginal revenue substituted 
for price Thus, the following analysis can be given a more general 
interpretation by simply interpreting p as marginal revenue 

However, even in the perfectly compeutive case with constant returns 
to scale. It may be meaningful to analyze the corporate income tax if 
taxable profit is not equal to "true” profit If some of the imputed factor 
rewards are not in fact tax deductible, a tax on profit will really amount 
to a tax on those factors We shall return to several examples of this 
later on 

Let us first suppose that a proportional profits tax is added to the 
model described in the previous secUon In what way will the impiosiuon 
of such a tax influence the allocation of capital and labor over ume^ 
One important conclusion is immediate if taxable profit is defined as 
in (10), a tax on profit will be equivalent to a tax on present value and 
will have no effects on investment decisions * 

Analytically this result is, of course, quite tnvial But it is worth 
stressing that the introduction of the time dimension into the model of 
the firm does not necessarily imply that the corpiorate income tax loses 
Its neutrality, neutrality can be achieved by propier definition of the 
tax base 


4. The Role of Depreciation Allowstnces 

The imposition of a corporate income tax usually implies a set of rules 
governing the depreciation allowances that can be deducted from gross 
income It seems to be quite generally believed that it is in the interaction 

’ Thu point IS elaborated in Hirshleifer (1958) and Sandmo (1968), who also consider 
the effect of imperfecuons in the capitsd market 

* In somewhat different versions this conclusion can also be found in the articUs by 
Satnuehon (1964) and Hall Ad Jorfenton (1967). 



JOURNAL OF POLITICAL ECONOMY 


392 

between the depreciation allowances and the tax rate that one finds the 
most important distortivc effects of the corporate income tax 

Profit before tax is still defined as in (10), while taxable profit is now 

nf = pF{L„ Ki„ K2,) - wL, - (1 + - (« + (ti)q2F-2n ( 12 ) 

where are the rates of dcpreciauon allowance If s is the tax rate, the 
optimal policy is found by maximizing, in each current period, after-tax 
profit which is equal to n, — sn* The first-order conditions then become 

= (/ = 0 , 1 , 2 , ) 

oLf 

P ^ - 1^1 -I- (^1 - a,)J 9i = 0, 

(<= 1 , 2 , ) 

+ 92 =0 
dKi, 1_ \ - s J (15) 

(< =1,2, ) 

We see that when the tax depreciation allowances do not correspond 
to the “true” rates of depreciation, the tax rate does influence the shadow 
prices of capital goods and thereby the choice of optimal factor 
proportions ‘ 

There arc two ways in which the corporate income tax may lead to a 
distortion of pretcix factor prices First, it may increase rental rates 
relative to the wage rate and thus lead to a less capital-intensive choice 
of technique Second, it may influence the relative rental rates Of course, 
both effects may be present simultaneously 

The first effect will be unambiguous in the cases where depreciation 
allowances are either both greater than or both less than the true rates 
of depreciation, these cases create incentives to substitute capital for 
labor and labor for capital, respectively * The ambiguous case is the 
intermediate one where one capital good is taxed and the other one is 
subsidized relative to labor In this case the corporate income tax has 
no obvious effect on the general capital intensity of production 

To study the effects on the composition of the capital stock, we compare 
the rental rates of the two types of capital before and after tax Thus, we 

* We do not carry through here a full comparative-itaUc analysis The interesting part 
of the theory concerns the influence of taxation on the relative shadow pnees of factors 
of producUon The further analysis of the effects of factor price changes on factor demands 
IS a standard exercise in the theory of the firm 

* It is theoretically possible, although pracucally implausible, that rental rates might 
become negative This case is meaningleu in the context of a perfect capital market and 
IS not taken care of by the present fixmulation 


(13) 

(14) 



CORPORATE INCOME TAX 


293 


define 

»•? * (« + SMi. U “ 1 . 2) (16) 

rj - [^. + 6; + {Sj - a^) j qj {j = 1, 2) (17) 

We shall say that the corporate income tax discnminates in favor of 
capital of type 1, is neutral with respect to capital-goods inputs, or 
discriminates in favor of capital of type 2 according to whether the 
relative rate of increase in the rental rate of type 1 is less than, equal to, 
or greater than that of type 2, that is. 


(18) 


Substituting from (16) and (17) and rearranging terms, we can write 
this condition as 


^1 - s ^2 - 22 
t + 1 + 62 


(19) 


This IS the general condition for the discnminatory effect as between 
capital goods of the corporate income tax For the tax to be neutral 
with respect to the composition of capital, the gaps between the rates of 
true depreciation and depreciation allowance must constitute the same 
fraction of the rate of true depreciation plus interest 
There are two special cases in which condition (19) is simplified and 
which seem to be of considerable practical sigpiificance The first case is 
the one where the rate of depreciation allowance is a flat one, that is, 
a, = = at Condition (19) is then reduced to 


d, f 62 (20) 

But this means simply that in the case of a flat rate of depreciation 
allowance, the corporate income tax creates an incentive to change the 
composition of the capital stock in favor of more durable capital goods 
Condition (19) can be rewritten as 


^1 ~ ^1 g ^2 ~ ^2 ^2 

^1 i + Si $2 1 + da 


( 21 ) 


Suppose now that the tax authonties fix the rates of depreciation allow¬ 
ance such that they are proportional to true rates of depreciation, that is. 



( 22 ) 


In the case where the numerators of the fractions arc positive (the 
case of decelerated dep^eclation), we get again condition (20) We 



JOURNAL OF POLITICAL ECONOMY 


294 

conclude that in the case of decelerated depreciation and proportional 
depreciation allowances, the corporate income tax changes investment 
incentives in favor of more durable capital goods When the numerators 
are negative, however, subsUtution of (22) into (21) will yield condition 
(20) with the inequality signs reversed Consequently, m the case of 
accelerated depreciation and propiortional depreciation allowances, the 
corporate income tax changes investment incentives in favor of less 
durable capital goods 

The intuitive reason for these results should be clear The rental rates 
have two components, interest and depreciation In the case of decelerated 
depreciation the corporate income tax is—in addition to being simply 
a tax on profit—a tax on depreciation If depreciation is already the 
major component of the rental rate, such a tax is more important than if 
depreciation is initially a very insignificant part of the opportunity cost 
of capital, that is, the tax falls more heavily on short-term than on long¬ 
term capital goods In the case of accelerated depreciation, the tax 
contains an element of a capital subsidy, and the argument applies in 
reverse 


5. Comparison with the Brown>Johansen Formulation 

An influential early contribution to the analysis of the tax treatment of 
depreciation is that of Brown (1948), his analysis was later extended and 
modified by Musgrave (1959), Johansen (1965), and Sodersten (1972) 
It is obviously of interest to compare their formulation with the present 
one A direct comparison is difficult, however, the authors cited do not 
derive their conclusions from an explicit optimization model for the firm, 
but rather from simple present-value calculations for a single investment 
project Moreover, there is the additional difficulty that they do not 
assume a constant rate of depreciation Still, a comparison, if only in 
more general terms, seems to be called for The following exposition is 
based on that of Johansen (1965, chap 7, par 5) 

Johansen writes the present value of an investment project before tax as 

H 

+ «)■' - K = ap,- K (23) 

1 

Here a is the cash flow resulting from investing the amount K in the 
initial period, this cash flow is assumed to be constant over time The 
tax base in year I (I = 1,2, , n) is defined as net income, which is 

a — {ljn)K, where n is the lifetime of the project and a imear depreciauon 
allowance is assumed Present value after tax is then equal to present 
value before tax imnus the present value of teuc payments, that is. 



(24) 



CORPORATE INCOME TAX 


295 


Fori > 0 we must have < n Hence, for a project that was profitable 
before tax, so that F' > 0, we must have a — Kjn > 0 Equauon (24) 
then implies that a project with positive present value before tax may 
have negative present value after tax, thus, the corporate income tax 
IS not neutral with respect to the choice between capital and other 
factors of production. Moreover, since the present value of tax payments 
IS an increasing function of n, it follows that the tax will also discriminate 
against long-term investment projects 

What IS the assumption made here about true depreciation ? Since a is 
constant over the finite lifetime of the investment, it must be assumed 
that depreciation is of the “sudden death” variety, the capital stock is 
constant until the end of the nth period, when it depreciates 100 percent 
But this means that we have a special case of accelerated depreciation 
allowances, and we showed in the previous section that this system tends 
to discriminate against investment in more durable capital goods Thus, 
the two analyses arc consistent on this point 
This comparison may not go to the heart of the matter, however 
hvaluating an investment on the basis of a fixed hfetime does not go 
very well with the assumption of jjerfect revcrsibihty, for in the latter 
rase a myopic policy would be optimal An alternative interpretation 
of the Brown-Johansen formulation is therefore that capital is irreversible, 
once installed, it cannot be resold The implication of this is clearly that 
in value terms the rate of depreciation is 100 percent m the initial period, 
although the technical lifetime of the capital good is still n periods With 
these assumptions it becomes very hazardous to compare the two 
approaches, for the one adopted in the present paper cannot be easily 
extended to cover the case of irreversible investments There is a need, 
therefore, to construct models of optimal capital policy that incorporate 
the Brown-Johansen formulations in a more comprehensive framework 


6. Equity Capital and Incomplete Interest Deductions 

Tax laws in most countries do not provide for deductibility of interest 
on the full value of the capital stock The nondeducubihty of interest on 
equity capital has been seen by many writers as the mam reason why the 
corporate income tax cannot be assumed to be allocativcJy neutral 
The present model docs not lend itself very well to an analysis of this 
problem A really satisfactory analysis would have to involve uncertainty 
and the determination of the optimal debt-equity rauo via the price 
formation m capital markets. These are complex and unsetded questions 
•ind will not be explored here. Instead, we shall take as a given fact that 
part of the firm’s capital stock is financed by equity and study the 
consequences of this on the incendves to hold capital goods Suppose first 
that the firm’s equity capital is ucd up with a specific capital stock which 
1 $ not subject to change, for example, the land used by the firm or buildings 



9g6 JOURNAL OF POUTIOAL ECONOMY 

which are specific to the firm and indivisible Otherwise, all other capital 
IS debt financed, so that interest is deductible. It is easy to see that in 
this case the corporate income tax must be allocatively neutral, since the 
required debt-equity rauo is irrelevant to marginal decisions. The 
interesting case is where there is a required debt-equity ratio for capital 
goods purchases that are determined by the firm’s optimal policy. 

Assuming now that depreciation allowances correspond to true 
depreciation, taxable profit is equal to 

rt* = pF{L,, A’l,, ^2,) — wLf 

— diqiKi, — &292^2t ~ “■ + Q2^2t)y 

where k is the required rauo of equity to the total value of the firm’s 
capital stock (0 < 1: < 1) Maximization of net profit, n — sn*, yields 
the first-order conditions 



P^- 

dL, 

» = 0, (t = 0,1,2, ) 

(26) 

dF 


*^ + ^J?i=0, (<=1,2, ) 

(07) 

dK„ 

LV 1- 


dF 

/. X 

4 1 1 _i_ 

*^ + <5ij92 = 0 {<=1,2, ) 

(OfK) 

8 K 2 , 

L V 1 



A positive value of k increases the rental rales relative to the wage rate 
and creates an incentive to substitute labor for capital This effect is 
rather obvious It should also be noted, however, that the relative rental 
rates are also changed, given that they have different rates of depreciation 
The equity requirement will change relative prices in favor of capital of 
type 1 or type 2 according to whether 

’(1 + [■‘'/(I ~ ■^)]^} + <^i g ~ •^)]^} + ^2 ^29) 

I + l + ^2 

This condiUon simplifies to 

<5, g S 2 (30) 

The conclusion is, then, that incomplete interest deductibility changes 
the relauve rental rates in favor of short-term capital goods. 

This effect, though usually neglected in discussions of the corporate 
income tax, is hardly very surprising As we have seen, incomplete 
deductibility raises the effective rate of interest A classic result of capital 
theory is that a higher rate of interest leads to a substitution of short-term 
for long-term capital goods, and it is precisely this effect that we observe 
here 



297 


I CORPORATE INCOME TAX 

I 7. Diatortions Induced by Price CSiangea 

It IS, of course, a simplification to work with a model in which pnees 
are assumed to be constant over time, in fact, price variability may be 
one of the more important reasons for differences between “true” and 
taxable profit. If p„ q„ and w, arc no longer constant with respect to t, 
> It IS easy to see that profit before tax must be equal to 

It, “ ^2t) 

- [. + - (1 + (31) 

- [« + ^2 - (1 + «)y2f]?2t^2«. 

where we have defined the rates of change of capital goods prices as 


Vj. = U = 1, 2) (32) 

^jt 

The last terms in the bracketed expressions are the capital-gains com¬ 
ponents of current profit ^ For the corporate income tax to be completely 
neutral, it is necessary that these compionents are also included in taxable 
profit; gains must be taxed and losses must be deductible If this is not 
the case, the existence of the corporate income tax will raise the rental 
rates relative to the wage rate if prices of capital goods arc falling and 
reduce the rental rates in the opposite case 
Failure to allow for capital gams may also affect the composition of 
the capital stock If taxable profit is written as 

~ ^ 2 f ) ~ ■“(•■!■ ~ (* "^ ^z)92i^Zi> 

(33) 

so that the firm maximizes n, — sit*, the first-order conditions arc 

p,^-w,=0, (< = 0,1,2, ) (34) 

vLf 


"rh" 


(<= 1 . 2 , ) 

(35) 

(<= 1 . 2 , ) 

(36) 


Substituting from (35) and (36) into (18), we see that the tax will change 
prices in favor of capital of type I, lead to no change, or to a change in 


’’ See n 6 above 




I 



298 JOURNAL OF POLITICAL BOONOMY 

favor of capital of type 2 according to whether 

, + 6 ,- [1/(1 - ■t)](l + »)y,. ^ - [1/(1 - ■r)](l + 072. 

« + - (1 + i)y„ I + ^j - (1 + i)yj, 

(37) 

This condition can be simplified to read 

(« + ^i)y2, i (» + ^2)yir (38) 

As was to be expected, the result depends crucially on the relative rates 
of change of capital-goods prices—hardly a very interesting observation 
What may not be so obvious, however, is that an effect remains even 
when the relative rates of change of the two capital goods prices are the 
same, so that, y,, = yj, If this common rate of change is positive, (38) 
becomes 

<5i f S, (39) 

If capital goods prices increase at a uniform rate, the failure to tax 
capital gains creates an incentive to substitute long-term for short-term 
capital goods 

If the common rate of change is negative, condition (38) becomes 

^i i <5j (40) 

If capital goods prices decrease at a uniform rate, failure to allow tax 
deductions for capital losses creates an incentive to substitute short-term 
for long-term capital goods 

It may be mtuitively helpful to think of a uniform increase of capital- 
goods prices as a lowering of the real rate of interest, we know that this 
favors long-term investment Similarly, decreasing pnees of capital goods 
means a higher real rate of interest and a higher return on short-term 
investments Note, however, that the conclusions are independent of 
other rates of price change and therefore are much more general than 
statements about the impact of general inflation or deflation 


8. Some Neglected Problems: Imperfect Capital Markets and 
Uncertainty 

The foregoing analysis depends crucially on the assumption of a perfect 
capital market Not only is the firm facing a given market rate of interest 
which IS insensitive to the amount it borrows, the avaulability of credit 
IS also independent of the firm’s own income and capital position. The 
latter assumption may be of particular importance in this connection, 
since the opinion is widely held that the effect of the corporate income 
tax on investment decisions works mainly through its reduction of internal 
financial funds 



CORPORATE INCOME TAX agg 

Models incorporating such ideas are not easy to formulate They would 
clearly imply that one must go beyond the framework of the present 
analysis to study an optimizing model of the supply side of the credit 
market as well as an equibbnum theory explaining the market allocation 
of funds between eqmty and debt These considerations raise unsettled 
theoretical problems of great complexity which we cannot attempt to 
tackle here Apart from this, however, the significance of these com¬ 
plications may easily be overstated. Seen from the viewpoint of a single 
firm, such “income effects” may seem highly important, but that is 
frequently because the single firm does not take into account that the 
same effects are present for all firms After market prices have adjusted 
to the tax, the availability of internal funds may not be a significant 
independent force acting on the demand for capital goods 

No mention has been made so far of the effects of uncertainty This 
opens up a whole new set of problems, particularly because uncertainty 
may enter in many ways, and several altemauve assumptions may be 
made about the objectives of the firm in an uncertain environment 
Perhaps the simplest assumpuon would be that the firm maximizes 
expected present value, one might then believe that the analysis would 
remain valid with a simple reinterpretation of the vanables in terms of 
expected values But it is in fact obvious that this will be true only under 
some additional assumptions, if, for example, the pnee of the product 
and the technology are both uncertain and stochasUcally dependent, the 
equivalence will not hold, simply because the expectation of the product 
of price and output will not be equal to the product of the expectations 
In general, the reinterpretation of the model will be valid only if the 
condidons of the Simon-Theil certainty equivalence theorem are 
sausfied * Although these are restrictive conditions, it may perhaps not 
be unreasonable to see our analysis as an approximauon to the case of 
expected present-value maximizauon 

A further complication enters if it is assumed that the firm is risk 
averse Not only will we then face the problem of denving meaningful 
shadow prices, there is the added problem that the influence of factor 
prices on optimal factor use becomes much more complex, since un¬ 
certainty introduces income effects into the comparauve statics of firm 
behavior. ’ 

In one important respect the introduction of uncertainty may add 
significant new elements to the analysis As shown by Mossin (1968) 
and others,*® proporUonal income taxation will affect the investor’s 

* For an exposition of this theorem, see Thcil (1964, chap. 2) 

^ Analysu of the bduvior of the nsk-avene firm may be found in Sandmo (19714) and 
Ldand (1972) 

‘"The first treatment of this problem was by Domar and Muagrave (1944). A later 
discussion along the same lines can ^ found in Musgiave (1959, chap. 14) 



300 


JOURNAL OF FOUnCAL ECONOMY 


choice between safe and nsky assets In the case of full loss offset pro¬ 
visions, taxation will lead to a substituuon of risky for safe assets in 
investors’ wealth portfolios Supfiose now that short-term capital goods 
are more risky than long-term ones, the precise meaning of this assumption 
would of course have to be clarified in a rigorous analysis. With insufficient 
depreciation allowances the corporate income tax will tend to discourage 
short-term investment according to the analysis in Section 4 above, but 
the risk aspects might tend to counteract this tendency This is, of course, 
only one of many possible outcomes and is not intended as more than an 
example, there is clearly a need, however, for a more detailed study of 
the interaction of durability and riskiness in the impact of the corpiorate 
income tax 


9. Some Concludmg Remarks 

The above analysis is in the nature of partial-equilibrium theory We 
have studied the impact of the corporate income tax on the demand 
functions for capital and labor services An extension of the analysis 
would be to construct a general equilibrium model and to study the 
incidence of the tax, that is, its effects on market prices and the interest 
rate This problem has been studied by Harberger (1962), and the more 
recent literature has been surveyed by Mieszkowski (1969) It lies outside 
the scope of the present article to tackle these problems as well, but some 
brief remarks may be in order 

It may be argued that a partial-equilibrium treatment like the present 
one tends inevitably to be misleading Suppose that one concludes that 
imposition of a corporate income tax means raising the rental rates of 
capital-goods services relative to the wage rate and teuung the use of 
long-term capital goods more heavily than the use of short-term capital 
goods Will this not tend to change general-equilibrium prices in such 
a way as to counteract these first effects’ It has been maintained, for 
example, that the rate of interest will be lower in the new equilibrium 
situation, this will of course lower the rental rates generally and relatively 
more for long-term than for short-term capital goods 

It IS quite clear that taking account of the full general equilibrium 
effects of the tax will tend to modify the quantitative magnitudes of the 
effects analyzed above However, since a theoretical treatment can yield 
only qualitative results, this is hardly a very significant conclusion. The 
real question is whether the counteracting tendencies could actually be 
expected to be stronger than the primary effects of the tax; this does m 
fact seem very doubtful. For a general-equilibrium model to yield such 
results, it would probably have to violate even very weak stability 
conditions, which would tend to reduce further the plausibility of such 
a result. 



CORPORATE INCOME TAX 


301 


We have seen that there are a vanety of reasons why the corporate 
income tax may distort mvestment decisions, even on strict neoclassical 
assumptions If the chief rationale for the existence of the tax is to be 
sought in Its redistributional effects, these distortions must be judged to 
be undesirable side effects, to be avoided if possible But a concept of 
taxable profits which correctly takes account of depreciation, interest, 
and capital gains may present the tax authorities with great administrative 
difficulues An alternative solution, which has been suggested by Vernon 
Smith (1963), would be to use cash flow as the tax base instead of profit, 
so that all new capital-goods purchases are written off for tax purposes 
immediately, while no capital costs are deductible in later periods 
Going back to equations (4) and (5), one sees immediately that this 
amounts to a tax on present value, which will be allocatively neutral, not 
only among capital goods of different degrees of durability, but also 
between labor and capital goods Of course, in years of large capital-goods 
purchases, cash flow may become negative, requiring either a subsidy 
or provisions for carrying forward the “deficit” to years of piositive cash 
flows However, it is important to note that no general reduction of the 
tax base is involved, compared to the case of an “ideal,” nondistortive 
definition of taxable profit From a theoretical point of view, this proposal 
throws an interesting light on the distortion arising from corporate income 
taxation, and its practical merits would seem to deserve serious attention 


References 

Brown, E Cary “Business-Income Taxation and Investment Incentives ” In 
Income, Employment, and Public Poli^ Essays in Honor of Altnn H Hansen New 
York Norton, 1948 Reprinted in Readings in the Economics of Taxation, edited 
by R A Musgrave and C S Shoup London* Allen & Unwin, 1959 

Domar, Evsey D , and Musgrave, Richard A “Proportional Income Taxation 
and Risk-Taking " Q J E. bS (May 1944) 388-422 

Fromm, Gary, ed Tax Jncentives and Capital Spending Amsterdam North-Holland, 
1971. 

Hall, Robert E , and Jorgenson, Dale W “Tax Policy and Investment Behavior.” 
AER 57 (June 1967) 391-414 

Harbcrger, Arnold C “The Incidence of the Corporation Income Tax ” J P.E 
70 (June 1962) 215-40 

Hirshleifer, Jack “On the Theory of Optimal Investment Elecision " J P E 66 
(August 1958). 329-52 

Johansen, Leif Publu Economus Amsterdam North-Holland, 1965. 

Lcland, Hayne E “Theory of the Firm Facing Uncertain Demand ” A E.R. 62 
(June 1972) 278-91. 

Mieszkowski, Peter. "Tax Incidence Theory. The Eflect of Taxes on the Distri¬ 
bution of Income." J Econ. Literature 7 (December 1969) 1103-24 

Mossin, Jan “Taxation and Risk-Taking An Expected Uulity Approach.” 
Economua 35 (February 1968). 74-82. 

Musgrave, Richard A. TTie Theory qf Publu Finance, New York: McGraw-HiU, 
1959. • 



JOURNAL OF POLITICAL ECONOMY 


302 

Samuelson, Paul A “Tax Deductibility of Economic Depreciation to Enture 
Invariant Valuations ’’ J P.E. 72 (December 1964}: 604-6. 

Sandmo, Agnar. "Sparing og mvestenng under sikkerbet og usikkerhet.” 

Statsoktnomxsk Ttdssknft 62 (September 1968). 143-63. 

-. “Investment and the Rate of Interest " J.P.E 79 (November/December 

1971) - 1335-45 (a) 

- “On the Theory of the Competitive Firm under Price Uncertainty.” 

A E.R. 61 (March 1971). 65-73. (b) 

Smith, Vernon L “Tax Depreciation Policy and Investment Theory.” Jntemat. 
Econ Rev 4 (January 1963) 80-91. 

Sodersten, Jan “Taxes and Investment Decisions.” i^iMdirA 7. £con 74 (September 

1972) 329-43 

Theil, Henn Optimal Decision Rides for Government and Industry. Amsterdam 
North-Holland, 1964. 



Bargaining, Yield Curves, and Wage 
Settlements: An Empirical Analysis 


Yochanan Comay.* Arie Melnik, 
and Abraham Sobotnik 

Techmon—Isratl Insitiule of Ttchnolofy 


We began by noting the lack of empirical work on bargaining, to 
partially redress this imbalance we drew a sample of stnke-bound 
bargaining cases and translated each side’s scries of offers and counter¬ 
offers into monetary terms, thus creating management and union “yield” 
curves for each case. Some expectations as to the slope of these curves 
were gleaned from the literature and incorporated into three simple 
models—“linear,” “quadratic,” and “interactive ” Separate ordinary 
least-squares regression of the offers and counteroffers for each side 
and each strike showed that the explanatory pwwer of even such simple 
models was quite high, especially for management One result was that 
the umon appears to be more responsive to the other side’s offers, while 
management’s behavior can be explamed mamly by the passage of 
strike time 


I. Introduction 

The literature on bargaining models is by now profuse, int^atcd, and 
rigorous, though still lacking in empirical content ' Some cmpincal 
progress has been made through expenmentaJ work by Siegel and 
Fourakcr (1960) in their study of bargaining outcomes under various 
controlled conditions Empirical studies by Ashenfelter and Johnson 
(1969) on United States data, by Eencavel (1970) on Britain, and 
Vanderkamp (1970) on Canada have made casual use of the Hicksian 
bargaining model In comparison to the plethora of bargaimng models, 
there does, however, appear to be an imbalance between model building 
and model testing in this field 

* Dr. Comay died on active duty on November 24, 1973. 

‘ Bargaining models have tended to develop by evolution, building one on another, 
and would make an interesting study in the history of ideas (see Zeuthen 1930, Nash 
1950, Harsanyi 1956, Bishop 1964, CroM 1965, and Sarayder 1965) 


303 




JOURNAL OF POLmCAL ECONOMY 


304 

In what follows we hope to partially redress this imbalance by ocamin- 
mg the data from a sample of bargaining cases, where the bargaimng 
was conducted during an ongoing strike, and by then testing a simple 
Hicks-like model Section I outlines Hick’s model and its variant to be 
used here The survey of the literature that follows yields some expecta¬ 
tions for such a model Section II deals with the data and the translation 
of offers and demands into monetary terms. Section III presents the 
regression results both for the different yield curves and for predicting the 
final settlement Results and conclusions are summarized in Section IV 


II. The Model 

Hick’s bargaining model (1968), developed in the midthirties and 
reformulated in the sixties, has come under heavy attack over the years ^ 
His model consisted of two intersecting curves within a vertical wage-rate 
axis and a horizontal length-of-stnke axis It was implied that the wage 
settlement would occur at the intersection of an “employer’s concession 
curve” and a “union resistance curve” which were assumed to possess 
positive and negative slopes, respectively 

For management, any point along the curve represents the highest 
wage they would be willing to pay rather than endure a stnke of that 
length The opposite-sloped union-resistance curve is constructed 
similarly The final result is reproduced in figure 1 below 
The z IS the wage the employer would have paid if not subject to union 
pressure Hicks did not explicitly claim the point of intersection as the 
determinate bargaining solution, though the implication was clearly 
present 

It can be shown that under a suitable reinterpretation, the point of 
intersection in figure 1 will in fact represent the point of static equilibrium, 
and that Hick’s model docs possess implications for the final settlement 
It does not, however, yield implications for the exact path of offers and 
counteroffers dunng the negotiation period beyond the result that the 
union’s demand curve over time will follow a nonincreasing slope, while 
management’s path will be nondecreasing ^ 

There are several reasons which have been suggested elsewhere in the 
literature for expecting concessionary behavior, namely, a generally 
upward slope for management’s offer curve and a downward slope for 
the umon’s curve over the length of strike Foremost among these is the 
mounting damage the stnke entails in terms of loss of wages, loss of 

* See, for example, Shackle (1964) 

’ Ai pomted out by a referee, the Hicksian model i> phrased in terms of expected length 
of stnke Bargaimng dunng a stnke can in fact be viewed as the process through which 
expected time influences real smke time. 



BAROAMINO and KTTLEiaNTS 3O5 



opcraung profits and market position, and damage to equipment through 
idleness * There is also a fixed recurring cost associated with the bargain¬ 
ing process itself * 

Another reason which has been advanced for cxpecung concessionary 
behavior is the rise in the expected costs of conunued disagreement as 
the strike progresses * Costs which had only been potential in the original 
prestrikc reckoiung now become a reahty, also, as the strike progresses, 
the parties revise upward their expiectations as to the length of stnke 
associated with any persistent demand Most important, Chamberlain 
(1955) and Cartter (1959), as well as Schelling (1956), share the view 
that compromise behavior is quite rauonal, and is in fact a piotent 
bargaining tool 

The above discussion indicates that one may expect a posiuvely sloped 
management curve and an opposite-slojjcd union curve with respect to 

* See Walton and McKenie (1965, p 31) Other lotset enumerated by them are loa 
of management itatui with upper management or itockholden, lots of goodwill with 
labor, and loss of public image Equivalent measures on the umon’s side are Ion of 
instituuonal security due to declines in membership, and loss of goodwill vnth manage¬ 
ment and with the public 

* See Cross (1965, p 73) 

* This point was made by Cartter, using a composite model mcorporatiag dements 
of Chamberlain’s and Pen's bargaining modds It u worth noting parenthetically that 
Cartter duauttes esumauon problems "since we do not anUopate an attempted empirical 
verification of this model of collective bargaining" (see Pen 1952; Chamberlain 1955, 
and Cartter 1959. pp 49-125) 





JOIANAL OF POUnCAL SOONOMV 


306 


stnkc timc.^ For simplicity wc assume that the curves are linear.* For 
any given union-management bargaining case we thus have the following 
functions 

U{S) = f/(0) - at{S), (1) 

M{S) = M(0) + pt{S), (2) 

where U{S) = a particular union’s demand at the 5th bargaimng 
session, with all elements of the demand translated into monetary terms; 
U(0) — starting demand of the given union; t(S) = time, in days, from 
start of strike to 5th session, a = rate of concession for the union, per 
unit of time, and A/(5), Af(0), and p = equivalent measures for 
management 

Agreement, A, is reached at time i(S*) when the curves intersect, or 
M(5*) = f/(5*) 


U{0) - oit{S*) = M(0) -I- p/(5*) 
U(0) - M(0) 


/(5*) = 


a + P 


(3) 


Substitution of (3) in (2) yields the equation for the wage settlement 


-('-A) 


a + p 


M(0) -I- 


M(0) + 

P 


P 


a + P 


a + P 

U{0) 


U{0), 


(4) 


The A IS thus a weighted average of the starting offers, with the weight 
dependent upon the rates of concession The higher a in relation to 
namely, the higher the relative rate of concession by the union, the closer 
the final settlement to management’s position Similarity of behavior, 
namely, a = P, will result in a settlement halfway between the starting 
posiUons 

The assumption behind the linear model is that the costs pier unit of 
time—or at least their subjective valuation—remain constant over the 
length of the strike This assumption could be unreahsUc for several 
reasons For example, from the point of view of the umon, the strike 
would entail a loss of wages; increasing marginal uuhty of income would 
make each passing stnke day more costly. For management, the daily 
losses, in terms of income, customers, etc , should rise with the passage 


The acact slopes will be determined by a host of &ctors On the umon tide, one could 
mention the size of the stnke fund, the labor market tituauon, the perceived profit titua- 
tiem of the employer, public support for the strike, etc. Management’s behavior will 
probably be influenced by lU levd of mventones before the strike, its financial rating, the 
structure of the market in which it operates, and the proportion of fixed costa out of total 
costs. 

* Severe data lunitauons restneted the scope of the models which were empirically 
testable; the simplest forms were adt^ited in all cases, with a minimum number of 
parameters. 



BAROAININO AMD 8BTTLEliENTS 


307 


of Strike time. The simplest method to incorporate thu would be through 
quadratic yield curves, as in equation (5)-(6) below ’ 

U(S) - UiO) - yt\S), (5) 

M(S) = M(0) + St^S) (6) 

These imply an increasing rate of concession by both parties 
As noted by Cross (1965), however, the choice of which specific offer 
to make will be affected not only by the passage of time and its associated 
costs, but also by the counteroffer made by the other party In Cross’s 
model, each side’s offer affects the other's expectations as to his opponent’s 
rate of concession Rapid (or slow) concessions by one side will cause the 
other to revise his expectauons accordingly The level of his demands 
will be positively associated with his expiectations as to the oppionent’s 
rate of concession, the opposite holds for the level of offers The conclusion 
Cross (1965, p 80) reaches is that “the greater is the concession rate of 
player II, the smaller will be the resulting concession rate of player I ’’ 
In terms of our model, this implies a negative relation between a and 
As shown in table 1, this conclusion does not appiear to be suppiorted 
by the data, the relauon between * and p is weakly positive It appears 
that the yield behavior is different from that postulated by Cross, resulting 
in yield curves which are close to mirror images of each other In other 
words, higher concessions by one party are matched by higher concessions 
by the other, and vice versa The simplest manner of incorporating this 
element would be to make each side’s counteroffer contingent on the 
other’s prior offer, as is done in equations (7) and (8) below 


U{S) = U{0) - et[S)M{S - 1), 

M{S) = M(0) + i:i{S)IU[S - 1) 

EquaUons (7) and (8) also imply an increasing rate of concession ' * 
Solving as before, we arnve at the settlement function (9) 

^ = [1__1. 

|_ 6 M{S* - I) + auls* - 1)J 

^ r muis* - 1) ] 

\jM{S* - 1) + - l)J 

eM{S* - I) 




Af(0) 


4^(0), 


eM{s* - 1 ) + 






ZlUjS* - 1 ) 


sM{S* - 1) + C/t^(5* - 1) 


Af(0) 

1 /( 0 ) 




(7) 

( 8 ) 


(9) 


* More complex lofanthmic fomu were tned and found to be inferior ui tenai of 
explanatory power. 

The simple ooirelation b^een them u -t-.M. 

'' Thu can be seen by taking the second denvaUve of eqq. (7) and (B) with reqiect to 
tunc, t 



3o8 

m. Data 


JOURNAL OF POLITICAL ECONOMY 


The New York State Bureau of Labor Statistics served as the source for 
our sample of bargaining cases which included only relatively large-scale 
strikes (1,500 workers or more), which occurred in the same period 
(1966-67), most of which took place in New York City Various issues of 
the New York Ttrrus were used to uncover information on the offer- 
counteroffer rounds during the stnkes The cases finally selected for 
empirical analysis included only those for which five such rounds could 
be reconstructed, the cases are listed in the Appendix 

All offers and counteroffers were converted into monetary terms and 
expressed as percentages of the prcstrike mean wage In converting 
nonwage items into monetary terms, certain guidelines were followed 
Reductions in work time were considered cqmvalent to wage increases 
For example, a decrease of the working week from 40 to 36 hours, with 
the weekly wage remaining unchanged, was evaluated as a 10 {jcrcent 
wage increase Pensions and special retirement clauses, which involve 
futurity, were discounted to the strike period using a discount rate of 
4 percent and 20 years of remaining work time Where negotiations 
were conducted over future wage increases to be granted during the 
period of the contract, these were also discounted to the year of bargaining 
Ail offers and counteroffers were summarized, therefore, as percentages 
of current wages 


IV. Results 

To test the adequacy of the above formulations, we restneted our sample 
to strikes which encompassed five or more offer-counteroffer rounds 
between the start of the strike and its settlement. The final sample 
included eight bargaining cases The results of ordinary least-squares 
regression of the union and management offers for each strike accordmg 
to the linear, quadratic, and interactive models of equations (1), (2), (5), 
(6), (7), and (8) are summarized in table 1 

All three moe^pls perform better in explaining management’s offer 
curve than in explaimng the umon’s With the exception of the television 
artists’ strike all coefficients of determination for management are .80 or 
higher For the union, on the other hand, in four out of the eight strike 
cases the coefficients of determination arc below 61 

Mean wage was calculated as the mean of beginning and senior workers’ salaries, 
since no information was available on the seniority distnbuuon of umon members. 

' * Esrimates were relatively insensitive to differing assumptions concermng the re¬ 
maining length of work time Estimates were more semtrve to vanations in discount rates, 
however, due to the relative tnsigntficance of the items concerned, the final percentage 
estimate was not altered significantly by changes in the discount rates, 



o 




309 








2 JO JOURNAL OF POUTICAL ECONOMY 

The quadratic model appears to be superior to the linear in cxplaming 
the variance of the union offers Thus, in four out of the eight cases, the 
percentage of variance explained by this model is higher by 5 percent 
or more, while in three cases the two models are similar The umon tends 
to hold out for long periods and then yield rapidly toward the end of the 
strike, thus creating concavity in its yield curve What is most interesung, 
however, is that the introduction of interactive elements in the union’s 
function improves the ht in four of the bargaimng cases, in the rest the 
interactive model is about the same in explanatory power as the others 

The same cannot be said for management, where the interactive model 
does not improve the fit significantly In fact, all three models perform 
about the same, except in the television artists’ case, where the quadrauc 
IS clearly superior Thus, there is some evidence that in management’s 
case pnmarily the passage of strike time explains their offer behavior, 
while the 'union is more responsive to the actual offers made by 
management 

The accuracy of the models m actually predicting the final settlement 
can be assessed by solving for these values according to the three models, 
utilizing the results from table 1, It is unclear a priori which will perform 
better, since this test incorporates both blades of the scissors, union and 
management The results are summarized in table 2. 

The superionty of the interactive model in predicting the final settle¬ 
ment IS evident from the table In half of the cases it improves the pre¬ 
diction substantially, while in the other half it does as well as the linear 
Its relative success is not surprising in light of its greater utilization of 
available information Thus, for each round, the model uses information 
on the other party’s behavior in the previous round to predict the current 
offer What is somewhat surprising is the closeness of the predicted and 
actual outcome, even for such a skeletal model In absolute terms the 
greatest deviation is just under 5 percentage points, for the Board of 
Education strike In this case the union set its demand on the first day 
of the strike and did not budge through 11 bargaimng rounds—while the 
Board of Education yielded continuously The other cases range between 
zero and 1 8 percentage {Joints As a {jercentage of the actual settlement 
this means a deviauon of 26 percent for the Board of Education, and 
0-7 percent for the other cases 


V. Summary and Conclusions 

We began by noting the lack of empirical work on bargaining, especially 
in light of the vast array of extant theoretical models. To partially redress 
this imbalance, we drew a sample of strike-bound bargaining cases and 
translated each side’s senes of offers and counteroffers into monetary 
terms, thus creaung management and union “yield” curves for each case 



baroainino and settlements 


3 ” 


TABLE 2 

Actual and Preoictsd Settlementi, bv Stedumo Party 




Settlement (%) 


Strike and Model* 

Actual 

Predicted 

DiRierence 

Ford 

L 

58 

6 1 

03 

0. 


36 

-22 

I 


6 1 

03 

Marine 

L 

27 0 

28 9 

1 0 

Q 


26 1 

-09 

I . 


28 8 

1 8 

r ivc airlines 

L 

154 

15 5 

0 1 

Q. 


10 I 

-53 

I 


153 

-0 1 

Board of Education 

L 

18 8 

23 6 

48 

Q. 


156 

-32 

I 


23 7 

49 

1‘an American 

L 

12 5 

13 3 

08 

0 . 


5 7 

-68 

1 


12 7 

02 

Television 

L 

14 2 

146 

04 

0. 


159 

1 7 

1 


14 2 

00 

Mm mg 

L 

122 

146 

24 

Q 


70 

-52 

I 


12 9 

07 

Builders 

L 

24 0 

22 5 

-I 5 



35 5 

II 5 

I 


23 3 

-07 


" L ■= 'Mine*r,” Q • **quadra(ic,** and 1 • **toteractive *’ 


Some expectations as to the slope of these curves was gleaned from the 
literature and incorporated into three sample models—“hnear,” “quad¬ 
ratic,” and “interactive ” 

Despite the limited nature of the sample, there are several conclusions 
which emerged even from an analysis of the raw data Thus, the yield 
behavior of the union appears to be different in nature from that of 
management, and consists of a lengthy penod of little or no concession 
followed by rapid yielding in the last few days of the strike. Management 
yields gradually along the period of the strike 

For both parties, however, the rate of concession increases during the 
i>trike period, resulting in concession curves which are concave to the 
origin This could stem froiA the rise in the stnke cost per day for both 




I 


312 JOURNAL or POLITICAL ECONOMY 

parties as the strike progresses. One possible explanation for the union 
could relate to the decline in the strike fund and the diminishing oppor- 
ttmities for umon members to purchase on credit. For management, the 
losses increase not only through loss of sales but also through idle and 
detenorating equipment, loss of potential chents, etc. The other explana¬ 
tion put forward in this paper is that the concavity also stems from 
interactions between the bargaimng parties As was noted, the resultant 
shape of the yield curves does not appear to be consistent with Cross’s 
bargaimng model. 

Separate ordinary least-squares regression of the offers and counteroffers 
for each side and each strike showed that the explanatory power of even 
such simple models was quite high, especially for management One 
interesting result was that the introduction of interaction into the model 
improved the specification for the union, while not significantly affecting 
the results for management The union thus appears to be more responsive 
to the other side’s offers, while management’s behavior can be explained 
mainly by the passage of strike time We then compared the models from 
the point of view of predicting the final settlement, here the superiority 
of the “interactive” model emerged most clearly 


Appendix 

List of Bargaining Gases^'* 

1 Ford Motor Co vs United Auto Workers (September 7, 1967) 

2 Marine Touring and Transportation Association vs. Maritime United Marine 
Division {Local 333, AFL-CIO) (January 30, 1967) 

3 Five Airlines {United, TWA, Fan-Am, Eastern, National) vs Machinists Union, 
AFL-CIO (July 8, 1967) 

4 New York City Board of Education vs. Teachers Union {Local 2) (September 11, 
1967) 

5. Pan American World Airways vs Air Line Pilots Association (March 31, 1965) 

6 ABC, CBS, and NBC vs Federation of Television and Radio Artists (March 29 
1967). 

7 Moving and storage companies in New York City and inanity vs. International 
Brotherhood of Teamsters (March 1, 1965). 

8 Southern Tier Builders Association, Binghamton, New York vs. Bricklayers, 
Masons, and Plasterers Int (May 1, 1965) 


References 

Ashenfelter, O., and Johnson, G R. “Bargainmg Theory, Trade Unions, and 
Industrial Strike Activity.” A.E.R 59 (March 1969). 35-^. 

Bishop, R. "A Zeuthen-Hicks Model of the Bargaimng ProccM.” Econometnea 32 
(July 1964)- 411-17. 


Date in parenthesis refers to the first day of each strike 



baroainino and settlements 


3>3 

Cartter, A. M. TTuoty qf Wagu and Emplojmmt. Hometvood, Ill.: Imin, 1959. 
Chamberlain, N. W. A Gmtral TTuoiy Eeonomu Prectss. New York. Harper & 
Row, 1955. 

Cross, J. G. “A Theory of the Bargauung Process.” A E.R. 55 (March 1965): 
67-94. 

Harsanyi, J. C. “Approaches to the Bargauung Problem Before and After the 
Theory of Games." Econonutnea 24 (April 1956) • 144-57. 

Hicks, J R The Theoty qf IVages. 2d ^ London St Martin Press, 1968. 

Nash, J. F. “The Bargauung Problem ” Economelnea 18 (April 1950): 155-62. 
Pen, J. “A General Theory of Bargaining ” A E.R. 42 (March 1952) • 24-42. 
Pencavel, J. H “An Investigation into Industrial Strike Activity in Bntam." 
Economica (August 1970), pp. 239-56 

Sarayder, E “Zcuthen’s Theory of Bargaining* A Note." Econometnca 33 
(December 1963). 803-13. 

Schelling, T C “An Essay on Bargaming ” A.E.R 46 (June 1956) 281-306 
Shackle, G L. S “The Nature of the Bargaimng Process ” In The Theory of Wage 
Delerminatum, edited by J T. Dunlop London Macmillan, 1964. 

Siegel, S , and Fouraker, E Bargaimng and Croup Decision Making New York. 
McGraw-Hill, 1960 

Vanderkamp, J “Ekxinomic Activity and Strikes in Canada ” Indus Relations 9 
(February 1970) 215-30. 

Walton, R E , and McKersie, R B d Behaoioral Theory of Labor Negotiations. 
New York McGraw-Hill, 1965 

Zeuthen, F. Problems of Monopoly and Eeonomu Welfare London RouUedge, 1930 




Interpretations of Departures from the 
Pareto Curve Firm-Size Distributions 


Yuji Ijiri and Herbert A Simon 

Canugu-Mellon Umversity 


Empincal firm-aize data show concavity when size is plotted against 
rank (largest firm = rank 1) on a log-log scale, a departtire from the 
Pareto distribution which shows a straight line. Two reasons for thu 
departure are proposed One is an exponential decay in the effect of a 
unit increase in size upon the firm’s future growth It is shown analyt¬ 
ically that the greater the decay rate, the greater the concavity. The 
other IS an effect of mergers and acquisitions Based on Federal Trade 
Comnrussion data, the actual firm-size distribution in 1969 is compared 
with a hypothetical one where all mergers and acquisitions in 1948-69 
were “undone” to demonstrate their effect 


I. Concavity of the Firm-Size DistributionB 

It has been observed repieatedly that the size, s, of a firm and its rank, r, 
in an industry or in an economy satisfy approximately the Pareto law 
(see, for example, Steindl, 1965) 

srf = A, ( 1 ) 

ivhere p and A are constants The size may be measured by annual sales, 
otal assets, or the number of employees The rank is based on the selected 
ize measure, the largest firm being assigned rank 1. 

In a previous paper (Ijin and Simon 1971), we used the constant fi as 
in indicator of the degree of concentration in the population because it 
ndicates the frequenaes of large firms relative to smaller firms. For 
xample, if the rank of a firm X is twice the rank of a firm Y, the size of 
f IS 2"^ times the size of Y, since Sj^ « = Arr^2~* » 

he larger the the greater the relative size of a large firm (small rank) 

fhe authors are proie«or of industrial admimstration and R. K. Meihai P tofenor of 
Computer Saence and nychology,^respectiveIy, at Gamegie-Mellon Univenity This 
roject has been supported in part by a gnutt from the National Science Foundation. 


3*5 




3 i6 journal of politioal eoonohy 

compared with a smaller firm (large rank). Clearly, A is the size of the 
largest firm 

When log s is plotted against log r, the relation is a straight line* 

log i = log ^ log r (2) 

Thus, P is the slope of the line on a double-log scale. 

The empincal data on firm sizes fit this distribution reasonably well 
as a first approximation (Steindl 1965) However, we often encounter 
firm-size distributions for the economy that have some concave (down¬ 
ward) curvature For example, the Fortunt data for 1969 (Fortune 1970a, 
19706) show a considerable curvature for the size distribution based on 
amount of assets (total assets after accumulated depreciation) for the 831 
largest industrial firms This distribution is shown in figure 1 ‘ The straight 
line through the empirical data is the Pareto curve fitted to the tenth 
largest firm and the eight hundred thirty-first firm * (The first mne 
firms are ignored in fitting the curve in order to take advantage of the 
statistical averaging incorporated in the cumulative distribution function 
except at its extreme tail) 

As shown in figure 1, the actual size of a firm around rank 100 is 
almost twice as large as the “theoretical” size predicted from the straight 
line That is, actual assets of the one hundredth firm are $1.01 billion 
while “theoretical” assets are $0 50 bilhon Of course, many lines can 
be drawn on the figure as “theoretical” distributions The above illus¬ 
tration IS intended only to show the significance of the curvature of the 
actual firm-size distributions ^ 

The ratio of the actual size to the theoretical size of a firm with a given 
rank may be called a “size variance ” Then, the downward concavity 
of the distribution shows an upward size variance for middle-rank firms 
relative to small- pr large-rank firms, where the size variance is measured 
from the Pareto curve 

Faced with this rather significant departure from the theoretical 
distribution, one might abandon the Pareto curve and look for othei 
distributions which, when the cumulative frequencies are plotted on a 
double-log scale, show curvature similar to the empincal distnbutions * 

' Empirical size-rank distnbuUons based on sales size and on assets size are almost 
indistinguishable except for the difference in the intercept as can be seen by plotting 
data from Fortune (1970a, 19706). 

* It has been shown (Simon 1955) that the Pareto curve can be denved from Gibrat’s 
law, which states that the percentage growth rate of a Brm is distributed independently 
of Its size 

* The selection of the tenth and the eight hundred thirty-first firms is arbitrary How¬ 
ever, we use the “theoreticid” distnbubon only for purposes of illustration. Our con¬ 
clusions, especully with regard to the effect of mergers and acquisitions on the size 
distribution, would bold equally well for other methods of fitung a Pareto curve to the 
data. We have therefore us^ the sunplest possible method of fitting—drawing the curve 
through two pomts in the cumulative distnbution. 

^ The lognormal and negative binomial are examples of such distributions. 



DEPAKTURE8 FROM PARETO CURVE 


3>7 


AtMft 

(Billions) 



Fio 1 —Solid curve actujU size-rank relationship as of December 31, 1969 Broken 
urve estimated size-rank relationship as of December 31, 1969, when the eflecti of 
ncrgcrs and acquisitions in 1948-69 were eliminated. Solid line theoretical size-rank 
elationship (the Pareto curve) for actual data (connecting the sizes of the tenth and 
■ght hundred thirty-first firms) Broken line saune as solid line for estimated “mergerleu” 
lata 


^owevcr, the Pareto curve is derivable from simple and economically 
ilausible assumptions as shown in Simon (1955), namely, size indepen- 
ience of percentage growth rate (Gibrat’s law) and constancy of the 
ntry rate Both assumptions have significant economic implications 
fence, it seems to be profitable to attempt to retain the Pareto curve as a 
irst approximauon and to explain the deviations from this distribution 
y considering additional economic factors that violate the assumptions 
f the derivation, and hence CBuId produce perturbations. 



JOiniNAL OF POLITICAL ECONOMY 


318 

The purpose of this paper is, therefore, to explore the ways of explaining 
the size variance by economic processes that can be observed empirically 
We consider two alternative explanations The first extends an earlier 
paper (Ijiri and Simon 1964), which analyzed by simulation the effects 
of autocorrelation of growth, giving a firm with a history of recent growth 
a better chance for further growth than a firm of the same size whose 
growth had taken place in the distant past 

Under Gibrat’s law, a unit increase in the size of a firm contributes 
to Its future growth, since the base to which the percentage growth rate 
IS applied is thereby increased by one unit But the effect of a unit increase 
most likely diminishes as time goes on In the simulauon model in 
Ijiri and Simon (1964), this pattern of diminishing effects is represented 
by an exponential decay The simulation model showed that the greater 
the decay (or discount) rate, the greater the concavity of the distribution 
This decay rate has a significant economic meaning it measures the 
effect of growth momentum and of the stability of the economic environ¬ 
ment For example, when consumer brand loyalty is high and market 
share is stable, recent growth has a long carry-over effect When consumer 
brand loyalty is low and market shares are unstable, recent growth has 
only a short-run effect upon the firm’s future growth 

We are now able to derive analytically this effect of the decay rate, 
which we estimated by simulation in our earlier paper In the next 
section, we summarize our analytical characterization of this model, 
leaving mathematical derivations to the Appendix 

In Section III, we consider a second economic process that may con¬ 
tribute to the concavity of the distribution, namely, the effect of mergers 
and acquisitions In an earlier paper (Ijin and Simon 1971), we analyzed 
the effect of mergers and acquisitions on the concentration measure p 
When we wrote that paper, we had available only a very limited amount 
of data on mergers and acquisitions We now have analyzed a more 
extensive set of data over a wider period, based on the Federal Trade 
Commission’s report covering mergers and acquisitions of manufacturing 
and mining compames between 1948 and 1969 Contrary to our previous 
findings, the results of this analysis show that mergers and acquisitions 
have contnbuted to a noticeable extent to increasing the concentration 
measure and increasing the concavity of the firm-size distribution Wc 
present our results of the merger analysis in Section III 


IL Aatocorrelated Growth 

The curvature of a firm-size distribution may be quantified in many 
different ways when we fit a curve to empirical data. It is possible to fit 


log X = log d - ^ log r + c(Iog r)*, 


( 3 ) 



departures from PARETO CURVE 


3«9 


where the coefficient e depends on the curvature The distribution is 
convex downward if c is positive and concave downward if r is negative 

Alternatively, we might select a set of pairs of ranks t and j and measure 
a representative value (mean, median, etc ) of the rapos for each pair 

d = W(ryrj->). (4) 

where is the size of the rank k firm {t < k < j) and o satisfies 

k = 0*-“ (5) 

The denominator in (4) is the theoretical value for the size of the Ath 
firm under the Pareto law, and the numerator is its actual value 

A stronger theoretical basis for quantifying and interpreting the 
curvature can be obtained by using a parameter that we introduced in an 
earlier paper (Ijin and Simon 1964) In that paper, we demonstrated 
by a simulation that the following process leads to a distnbution which 
approximates the Pareto curve but with a curvature 

1 At the start of the process, namely, at epoch 0 (a specific point in 
time), we have several firms in the population (an industry or an 
economy), each firm having a unit (sales) size® and weight 1 The weight 
IS an indicator of the firm’s growth potential relative to other firms 
Placing several firms in the population is an initialization step for the 
model It has been shown that after large numbers of epochs the resulting 
distribution is insensitive to the initialization (see Ijiri and Simon 1964) 

2 At each epoch /, / = 1, 2, 3, , one unit of sales is given to a 

randomly selected firm 

3 There is a constant probability at that the sales unit at epoch t 
goes to a new firm If this happens, a new firm with size I is added to 
the population This firm is treated as an old firm in all succeeding epiochs 

4 If the sales unit at e{x>ch t docs not go to a new firm, it goes to an 
old firm The probability of a given old firm receiving the sales unit is 
proportional to its weight—that is, it is equal to the firm’s weight divided 
by the sum of weights of all firms in the population The size of the 
selected firm is increased by one unit 

5 After a new firm is created or an old firm is selected among firms 
in the population, the weight of each old firm in the population is dis- 
counted by muluplying its current weight by y Then, if a new firm is 
created, its weight is set equal to I, if an old firm is selected, its weight 
[after the above discounting) is increased by 1 

In our earlier paper (Ijin and Simon 1964), we showed by simulation 
hat the resulung firm-size distribution exhibits a downward concavity if 
' < 1, IS a straight line if y = 1, and has a downward convexity 

’ Thu can be stated in terms of ^ unit of assets, but we shall use sales units for the 
ake of ease in explanauon. 



320 


JOURNAL OP POtmCAL ECONOMY 


if y > 1 The curvature is greater as |y — 11 is greater. (For the case of 
y = 1, namely, without discounting, sec Simon 1955 ) Therefore, y may 
be used to quantify the curvature observed in an empirical distribution 
by fitting a theoretical distribution generated under a given y 

The parameter y has a simple economic interpretauon It indicates 
the relaUvc significance of recent sales as compared with earlier sales 
in generating future sales Both recent sales and prior sales contribute 
in generating future sales, but the latter contnbute less than the former 
if y < 1 and more than the former if y > 1 Thus, if y < 1, a firm cx- 
pienencing growth recently has a better chance of growing than a firm 
of the same size that had its growth in earlier periods 

We now show these points analytically for the case where 0 < y < 1 
(distnbution concave downward) 

Let fF, be the sum of weights of all firms m the jxjpulation after step 5 
for epoch t has been completed Then, 

fF.^i = yW, + 1, (6) 

as described in step 5 Then, for a sufficiently large t, 

= 1 + y + y^ + + y' S —?— (0 < y < 1) * (6a) 

1 - y 

Let us then consider a firm that was created at epoch t The size of 
this firm at epoch I is 1 and so is its weight Then, the probability that 
this firm will experience a sale at epoch (/ + 1) is the probability that 
the sale goes to an old firm (1 — a) times the probability that this 
particular firm expicriences the sale, [1 — 1/(1 — y) = 1 — y], or a 
total probability, (1 — a)(l — y), which we denote by d Thus, the 
probability that this firm does not get the sale at epoch (/ + 1) is (1 —5) 
Now assume that this firm did not get the sale at epoch (t + 1) The 
weight IS now discounted from 1 to y for epoch (< + 2) Thus, the 
probability ofgettmg the sale at epoch (( + 2)is(l — a) (1 — y)y = 5y, 
while the probability of not getting the sale at epoch (/ + 2) is (1 — Sy) I 
Similarly, the probability of not getting the sale at epoch {l + 3) is 
(1 - Sy^) 

Therefore, the probability that this firm will not get any sale after 
epoch t (that is, the probabihty u^ that the size of this firm remains at 1 
forever) is 

u, = (1 - d)(I - ^y)(l - V).-- (7) 

* More precudy, M', “1 + ?’ + ?'* + ••■ + *r*, where t u the number of fini“ 
created at imtialuation with sue 1 and weight I. Clearly, k has only a small effect when 
tu large. 



DEPARTURSS FROM PARZTO CURVE 391 

For y close to but less than 1, j » (1 — (z)(l — y) u close to 0, and 
so IS for <=1,2, . P Then, we may use the approximation 

In (1 + x) S X, (8) 

obtaining 

ln«i s -^(1 + y + + • •)--(!- «)(1 - y)/(l - y) (9) 

or 

Uj S (10) 

In general, the probability of a firm with a present weight w not getting 

any sales in the future is easily shown to be 

S (11) 

by applying the above argument Equation (10) is a special case for 
w = 1 

The implication of the expression in (11) to the firm-size distnbubon 
i<. this First, u, IS the expected asymptotic proportion of the umt-sized 
firms among all firms In (10), u^ depends only on a and is independent 

of y 

Next, consider a firm of size s and weight w Here, 

- E (12) 

1-0 

where A* = 1 if the firm received the sale at epoch {t ~ k) and =» 0 
otherwise The sequence Aq, A|, , should have exactly s I’s There¬ 

fore, for any given history of a firm (given (Aj}), 

^ > 0. (13) 

dy 

But, the probability of this firm not receiving any more sales is given 
by (11), namely, « = (1 - <5w)(l - ayii;)(l - dy^w) £ r-O-*** 
Hence, 

du du d\nu ™ . ,,,, t , , „ 

— = — - - — “ « E - <5y7(J - ^ < 0 (14) 

dw dm u dw TTo 

Thus, from (13) and (14) 

^ < 0 (15) 

dy 

This means that the probabihty of having further sales (1 — u) for 
old firms becomes smaller as y becomes smaller (namely, as discounting 

^ If }’(< 1} u not close to I, if becomes close to zero except for the first few le t mi . 
In the umuUuon in Ijm and Simo^ (1964), it was noted that even a mild departme of 
y from I, such as 0.95, ii sufficient to create a considerable curvature in the dtstnbution. 



322 


JOURNAL OF POLITICAL EOONOUY 


becomes heavier). Growth beyond size 1 becomes more and more 
difficult as weights are discounted more heavily (or as y becomes smaller). 

Furthermore, it can easily be seen from (12) that on the average as 
the size ^ of a firm increases, the effect of reducing y on ui becomes more 
severe since there are more terms in the expression for w that are reduced 
as y becomes smaller Thus, on the average, reducing y has a greater 
effect on the probabihty of further growth as size s becomes larger 
Therefore, not only is the probability of further growth for old firms 
decreased as y becomes smaller, but also the decrease in probability is 
greater the larger the firm 

From this analysis, we can conclude that as y becomes smaller the 
concavity of the firm-size distribution on a double-log scale is increased 

In the Appendix, we present a further analysts of the effect of y on the 
concavity from the viewpoint of the mean and the variance of the 
distribution of firm sizes 

The empirical support for this autocorrelation between recent growth 
and future growth (in our analysis this means y < 1) has been given in 
Ijin and Simon (1967) There, an excess growth ratio of a firm in 1954—58 
over and above the average growth ratio of the economy is definitely 
carried over to the subsequent period in 1958-62 but only in proportion 
to the cube root of the original excess growth ratio 

Empirically, this decay effect appears to be quite plausible When 
two firms of equal size are compared, the one that grew more recently 
seems to have a better chance for growth than the other whose growth 
occured in the remote past Growth momentum is not likely to change 
suddenly 

Thus, the existence of concavity in the firm-size distribution does not 
seem to be a good reason for abandoning the Pareto curve but a good 
reason for modifying the Pareto curve by introducing the autocorrelaUon 
in the growth process 


m. Mergers and Acquisitions 

Let us now turn to the second economic factor that may have contnbuted 
to creating concavity in the firm-size distribution, namely, the effects of 
mergers and acquisitions 

In our previous paper (Ijiri and Simon 1971), we analyzed the steady- 
state distribution of firm sizes in an economy in the following two steps 
A first process selects firms that disappear as a result of mergers and 
acquisitions Then, the assets of all disappearing firms tire pooled and 
allocated to surviving firms. This separation into two steps allowed us 
to analyze data that were not detailed enough to make a correlation 
analysis of the relation between the sizes of acquiring firms and the sizes 
of acquired firms 



departures from PARETO CURVE 


323 


The theoretical model to which the data were compared m Ijiri and 
Simon (1971) was based on the following two assumptions (1) the 
probability of a firm disappeanng is indefiendent of its size, and (2) the 
pooled assets of disappeanng firms are allocated among surviving firms 
in such a way that the percentage increase due to allocation in the pooled 
size of the surviving firms in a given size range is independent of the size 
range The empincal data we examined in Ijin and Simon (1971) were 
not extensive enough to detect any significant departure from the above 
asstftnptions These data referred to mergers and acquisitions that 
occurred in 1956-57, when mergers and acquisitions were not as frequent 
as in the 1960s 

We obtained and examined more detailed data covering the period 
1948-69 * After having analyzed this new set of data to retest the model, 
we found sigmficant departures from the above assumptions, sufficient 
to produce some curvature of the firm-size distribution 

Tables 1 and 2 are based on the Federal Trade Commission’s pub¬ 
lication Large Mergers in Manufacturing and Mining, 1948-1969 (1970) 
and the Fortune data (1970a, 1970i) on assets of the largest industrial 
corporations as of December 31, 1969 In table 1, we assemble the basic 
data needed for estimating the “merger-free” distribution—the distn- 
bution which would be obtained from the actual 1969 data had no 
mergers and acquisitions occurred in the 1948-69 period We can then 
see the effect of mergers and acquisitions on the firm-size distnbuuon by 
comparing the (estimated) merger-free and the (actual) postmerger 
distributions Table 1 also tests the assumption (1 above) that the 
probability of disappearance of firms by mergers and acquisitions is 
independent of the size 

In table 1, column 1 shows the frequencies of firms m the indicated 
asset size ranges as of December 31, 1969, based on the Fortune data 
Columns 2, 3, and 4 show the frequencies of firms involved in mergers and 
acquisitions in 1948-69 as reported in the Federal Trade Commission 
report (1970) They are classified in three ways, based on postmerger 
size (col 2), the premeigcr size of the acquinng firm (col 3), and the 
premerger size of the acquired firm (col 4) 

Now suppose that all mergers in 1948-69 had occurred at once just 

' There arc a number of diHerencei between (he data in this present study and thoK m 
the previous one (Ijin and Simon 1971) that complicate our task of mterpreting the 
findings The present study covers a longer time period and more firms, enabling us, 
for example, to classify firms with 10 site categones instead of the lour categories of the 
earlier study. Size is measured in the present study by assets and m the previous study by 
sales, but we do not believe that this difference eban^ matters very much. It is entirely 
possible that the Pareto curve provides a better approumation for the earlier period 
both because mergers were less fiequent and because those that occurred were distnbuted 
>n closer conformity to the assumptions of the model than the mote fiei]uent mergers of 
‘he 1960s were. * 



TABLE 1 

FuxQfjEScna or Fnun Clajofieo by Asset Sizes 


Size CuiSs and 
Asset Size 
Range 
{% Biluon) 

Number op 
Large 

Number of Firms Involved in 
Mergers, 1948-69, Classifieo by 

Estimated 
No OF 
Firms If 
. Effects of 
Mergers Are 
Euminated 
(1) ~ (2) + 
(3) + (4) 

(5) 

Rate of 
Disafpeak- 

ANCE 

(4)/(5) 

'(6) 

Firms in 
Fortune 
1,000 IN 
1969 
(1) 

PostmerMr 
Size of 
Combined 
Firms 
(2) 

Premerger 
Size of 
Acquiring 
Firms 

(3) 

Premerger 
Size of 
Acquired 
Firms 
(4) 

1) Over 12 8 

2 




2 

000 

2) 6 4-12 8 

7 

4 

4 


7 

000 

3) 3 2-6 4 

11 

14 

11 


8 

000 

4) I 6-3 2 

33 

36 

31 

1 

29 

0 03 

5) 0 8-1 6 

70 

94 

83 

4 

63 

006 

6) 0 4-0 8 

95 

147 

127 

9 

84 

0 11 

7)0 2-0 4 

137 

184 

172 

20 

145 

0 14 

8) 0 1-0 2 

214 

206 

171 

83 

262 

0 32 

9) 0 05-0 1 

262 

159 

160 

139 

402 

0 35 


831 

844 

759 

256 

1,002 


10) Below 0 05 


146 

231 

734 





990 

990 

990 




TABLE 2 

Aggregate Assets op Firms Classifieo by Asset Sizes 

Size Class 

AND 

Asset Size 
Range 

Fortune 

1,000 

IN 

1969 

(1) 

Firms Involved in 
Mergers, 1948-69 

Estisiated 

without 

Mercers 

(D- 
(2) + 

(3) + (4) 
(5) 

Surviving 

Firms 

(5) - (4) 
(6) 

AcqUIRED 
Firms’ 
Assets 
Classi¬ 
fied BY 
Size op 
A o- 

owrino 

Firms 

(7) 

Rate Of 
Growth 

{7)/(<>) 

(8) 

Post- 

merger 

Firms 

(2) 

Ac- 

cmiring 

Firms 

(3) 

Ac¬ 

quired 

Firms 

(4) 

1) Over 12 8 

32 36 




32 36 

32 36 


000 

2 ) 6 4-12 8 

54 61 

39 09 

38 89 


54 41 

54 41 

0 20 

000 

3)3 2-6 4 

49 86 

56 87 

45 20 


38 19 

38 19 

0 91 

0 02 

4) 1 6-3 2 

72 56 

78 03 

69 32 

1 85 

65 70 

63 85 

4 95 

008 

5)0 8-1 6 

82 07 

106 66 

93 65 

4 23 

73 29 

69 06 

8 52 

0 12 j 

6)04-08 

53 23 

83 78 

71 97 

5 24 

46 66 

41 42 

10 28 

025 1 

7) 0 2-0 4 

38 71 

54 67 

50 74 

6 11 

40 89 

34 78 

11 43 

0 33 

8) 0 1-0 2 

29 69 

30 76 

24 95 

11 39 

35 27 

23 88 

7 77 

0 33 

9) 0 05-0 1 

18 54 

12 21 

11 80 

9.53 

27.66 

18 13 

5 12 

028 


431 63 

462 07 

406 52 

38 35 

444 43 

376 08 

49 18 


10) Below 0 05 


5 51 

6 18 

16 53 



5 70 




467 58 

412 70 

54 88 



54.88 



324 








departures from PARETO CURVE 


325 


before the close of the year 1969. This is not a totally unreahstic assump¬ 
tion, since by far the greatest portion of the mergers are concentrated 
in the last 2-3 years of the 20-year period We can then calculate the 
size distnbution of firms that would result if all such mergers and 
acquisitions were “undone" at a single moment in time-, as if the merger 
movie were suddenly and swiftly run backward 
Take, for example, the size class 5 of firms with assets of SO &-$l 6 
billion As of December 31, 1969, the Fortutu data show 70 firms in this 
size class We want to estimate what this number would be if mergers and 
acquisitions were undone all at once To do so, we first note that in this 
size class there were 83 instances where firms acquired other firms and 
four where firms were acquired by other firms On the other hand, there 
were 94 merger and acquisition instances where firms, after merger, 
belonged to this size class Hence, there was a net gam of seven (94 — 
83 — 4 = 7) in the number of firms in this class by reason of mergers 
and acquisitions Since the actual number of firms in this class in 1969 
IS 70, we estimate that 70 — 7 = 63 would have been in existence in 
1969 had there been no mergers, as stated in column 5 By taking the 
cumulative frequencies based on the data in column 5 and plotting the 
result in figure 1 as a broken curve, we obtained an estimated size-rank 
relationship when all mergers and acquisitions are “undone ” We shall 
come back to this point again shortly 
Finally, the last column, column 6, shows the percentage rate of 
disappiearance of firms in each size class by mergers and acquisitions over 
the 20-ycar picnod From this we can conclude that smaller firms have a 
higher chance of being absorbed by mergers and acquisiuons, contrary to 
the assumption (1) of equal probabihty of disappearance 
To test the assumpuon (2), on the size independence of growth rate by 
mergers, we prepared table 2 Ckilumn 1 was taken from the Fortune data, 
adding the assets of all firms belonging to a given size class Columns 
2, 3, and 4 were prepared in the same way as in table 1 except that they 
refer to aggregate assets instead of frequencies 
Consider, again, the size class 5 The table shows that the aggregate 
assets of the postmerger firms belonging to this class arc SI06 66 bilhon 
while the aggregate assets of the acquiring and acquired compames are 
$93 65 and $4 23 billion, respectively Thus, as a result of mergers and 
acquisitions, this size class gained S8 78 bilhon m aggregate assets 
106 66 -1 93.65 — 4 23) Since the actual aggregate assets based on 
the Fortune dau arc $82.07 billion, we estimate the “mergerless” aggregate 
size of this class to be $73 29 billion, as shown in column 5. Other figures 
in this column were prepared in the same way. 

From data in column 5, we eliminated the aggregate assets of acquired 
hrms belonging to the class (col. 4) The result is shown in column 5. 
We would like to see how these aggr^ate assets of acquired firms are 
redistributed among the surviving firms. Column 7 shosvs the distnbution 



JOURNAL OF POLITICAL ECONOMY 


326 

of the aggregate assets of acquired firms based on the premerger size of 
the acquiring firms by which the acqiured firms were absorbed Thus, for 
example, the size class 5 increased its aggregate assets by 12 percent as a 
result of reallocation of assets of acquired firms From the data in the last 
column. It seems clear that the assumption (2) of size-independent 
growth rate by reallocation is not satisfied ’ 

What then is the effect of mergers and acquisitions on the firm-size 
distribution^ To show this effect graphically, we prepared from column 5 
of table 1 the broken curve in figure 1 The broken curve indicates an 
estimated firm-size distribution when the effects of mergers and acquisition 
arc eliminated This was superimposed on the actual postmerger size 
distnbution given by the sohd curve in figure 1 

The broken line was obtained by connecting the sires of the tenth and 
eight hundred thirty-first firms in the “mcrgerlcss” distribution in order 
to make it comparable to the sohd line indicating the theoretical dis¬ 
tribution for postmerger data 

Comparing the two distributions, shown by the solid and broken lines, 
respectively, we note the following two significant points First, the slope 
of the solid line is steeper than the slope of the broken line As discussed 
in Ijiri and Simon (1971), this slope measures the degree of concen¬ 
tration Therefore, we can conclude that mergers and acquisitions in 
the 1948-69 period increased the degree of concentraUon " 

The second important implication of table 1 is that the concavity of 
the distribution is increased as a result of mergers and acquisitions The 
vertical deviation between the curve factual or estimated) and the line 
(theoretical) is much wider for the postmerger distribution than for the 
“mergerlcss” distribution ' ^ 


rV. Summary and Conclusions 

We have presented two different economic explanations for the concavity 
of the double-log firm-size distributions as observed in empirical data, 

’ To some extent, the conclusions from tables I and 2 are affected by the assumption 
that all mergeis and acquisitions have taken place at once A year-by-ycar analysis 
would indicate somewhat smaller differences among the rates of disappearance across 
the classes and among the rates of growth by asset reallocation across the classes 

“’It can be seen from eq (I) that if the rank, r, is increased to kr, the size, s, is reduced 
to sjk^, indicating that the larger the the greater the relative size of a larger firm over 
a small firm when the ranks of the two firms are fixed It is clear from eq (2) that this 0 
IS the slope of the double-log distnbution 

'' For the "mergerless” data, the size of the tenth firm is estimated by interpolation 
to be S5 706 billion and the size of the eight hundred thirty-first firm is estimated to be 
SO 064 billion from the data in column 5 of table 1 Connecting these points, 0 is esbmated 
at 1 015 For postmerger data, the size of the tenth firm is S6 146 billion and the size of 
the eight hundred thirty-first firm is SO 050 billion from the Farhau data. Connecting 
these points, 0 is estimated to be 1.089, an increase of 7 percent in the slope as a result of 
mergers and acquisitions 

' * The greatest sue variance (the ratio of actual over theoretical sue) is approximately 
1.6 ibr the “mergerlcss” dau and 2.1 for the postmerger dau. 



DEPARTUKU prom PARETO CURVE 337 

one based on the autocorrelation of growth and the other based on 
mergers and acquisitions 

Since both explanations are not only economically plausible but also 
supported by empirical data, it is likely that both factors have contributed 
to the concavity of the firm-size distribution Whether there are other 
contributing factors or whether these two factors are sufficient to ehminate 
most of the concavity is difficult to determine The degrees of auto¬ 
correlation reflected in 7 would have to be determined independently 
from the observed curvature of the empincal distribution before we 
could answer this question At present, we have no such indcpiendent 
estimate of y 

However, the analysis in this paper docs indicate the advantage of 
adjusUng the Pareto curve for these additional economic factors instead 
of abandoning it by reason of the concavity in the observed distribution 

From the standpoint of statistical curve fitting to empirical distribu¬ 
tions, we may find other distributions to be superior to the modified 
Pareto curve However, it is not our sole objective to find a distribution 
that can best fit empincal data What we are after is a model that has 
sound economic support and justification and that can still give a 
reasonably good fit to the data The fact that the Pareto curve frequently 
approximates not only firm-size data but also a vanety of economic and 
noneconomic skewed distributions is an important advantage in retaining 
It as a first approximation, instead of abandoning it 


Appendix 

Consider a firm which was created at epoch ( as a result of getting one unit of sale 
The probability of getting a sale at epoch (t + 1) is d Its weight is discounted 
from 1 to y at epoch (/ + 2) Hence, if the firm fails to receive the sale at epoch 
(< + I), the probability of a sale at epoch (< + 2) is 6y On the other hand, if 
the firm experiences a sale at epoch (t + 1 ), its weight is changed to (y + 1 ), 
hence the probability of a sale at epoch (I + 2) is (<5y + S) Note that in either 
case the original sale contributes Sy to the probability of a sale at epoch (( + 2) 
regardless of whether there was a sale at epoch (t + 1 ) 

In general, a sale at epoch t contributes to the probability of a sale at epoch 
(t + k) by the amount Thus, if we focus only on the contribution by the 

ongtnal sale, this contribution decreases geometrically from S for epoch (( -t- 1), 
Sy for epoch (< + 2), Sy^ for epoch {t + 3), and so on Note that the evenu at 
each epoch (based only on the contribution to probability by the original sale) 
are mutually independent. Therefore, we have a sequence of independent 
Bernoulli trials where the probability of success decreases geometrically. The sum 
of sales generated .Y| + Jfj -h - 1 -, where A’, is 1 if a sale is generated 

at epoch (t + 1 ) and 0 otherwise, has the generatmg funcuon 

P,{s) = [1 - d(l - r)][I - dy(l - r)][l - dy»(l - r)] . (16) 

Note that i’i(l) «= 1. The subsenpt 1 in Ni and Pf is used to indicate that this 
eiqiression involves only the sales generated from the original sales. 



338 


JOURNAL OF POLITICAL BQONOUY 


Hence, the denvative of Pi (s) with respect to s is 

’ 1 - <5(1 - j) 1 - <5y(l ~ s) 1 - 

S/ 


-<5/(1 - .) 

The expected value of N, E(N), is then given by 


(17) 


E{Ni) = P;(l) = /•,(!) d ^ = (1 - «)(1 _ y) —^ = 1 - a. (18) 

*=o ^ y 


The second derivative, P"(s), is from (17) 


p:{^) 


s I 


dy* 


_ . _ s/0 - s) 

<5"y"* 


- E ftt 


But from (17) 




*4^ [1 - <5/(1 - ^)V 

Jl _ ^ 


0 * - <5/(1 - W 

Substituting this in (19), 

[/>;(r )]2 


p:(s) = 

Using the result in (18), 


Piis) 


Sfi-r 




*4«[1 - <5/(1 - S)]^ 


(19) 


( 20 ) 


( 21 ) 


p:(i) = (1 - «)^ - <5^ X] 


k -0 


= (1 - «)" - (1 - <*)"(! - yy 


= (1 - ay 


2y 


1 _ y2 '■ "< 1 + j, 

Now the variance of iV,, var (Nj), is given (sec Feller 1968, p. 266) by 
var (N,) = />:(1) + FKl) - [^.'(l)]^ 

= (' - - a) - (1 - ay 


(21a) 


^(i_,)[i,. (!-«)(■-/ ]. 

L 1 + 7 J 


( 22 ) 


Thus, var (i\7j) -> 1 — « as y -» 1 and var (N,) -► a(l — «) as y -> 0. 

Note that if y or a is close to 1, <5 = (1 - o)(l - y) is close to 0; therefore, 
applying the approximation ( 8 ) to (16), we obtain 


or 


logPi(j) = - (1 - j)d(l + y + y* )=_(!_ j)(i _ a) (23) 
Piis) = ,-<t-Ht-.)^ (24) 



3*9 


OEPARTUMB prom PARETO CURVE 

M^ch it E generating function of a Poiaion distribution with mean 1 — at. How¬ 
ever, regardless of the value of y or a, the mean and the vanance of given in 
(18) and (22) are exact and not approximations. 

Now let us take into account the fimt that the sales generated by the anginal 
lale also contnbute to generating further sales. Thus, we have a branching 
process, each new sale having the potential of generating further sales. We may 
call the original sale a zero generation sale and a sale generated as a result of an 
nth generauon sale an (n + 1) generation sale. 

Strictly speaking, the independence assumption needed for the application of 
theorenu in branching processes is not satisfied here For example, at epoch 
(t + k), the probability of the original sale generating another one is At 

the same time, if the original sale has generated another sale at epoch (/ 4- 1), 
the latter also has a probability of generating another sale at epoch (/ + k) equal 
to The two events are not independent It is not passible for the original 

sale and the first-generation sale both to generate sales, because only one unit of 
sales occurs at epoch (/ + k) However, by reducing the time interval between 
epochs and making a corresponding adjustment to y, S can be made as small as 
we want Hence, the probabihty of having two or more sales occurring in a given 
epoch may be considered to be negligible 

Let be the size of the Irth-generation sales and be the generating fiincuon 
of Its probability distribution Since there is only one original sale, we set Wg = 1. 
The generating funcuon of N^ has already been analyzed above 

The Ath-generation sales, iV*, may be divided into JV, groups based on their 
ancestor in the first generation The number in each group has the same probabihty 
distribution as Since they may be considered mutually independent 

approximately, the generating function is given by a compound function (see 
Feller 1968, pp 295-96) 

= P(P^.i{s)) (25) 

From this, we obtain 

p;(i) = /';(i)p;.,(r) = ( 26 ) 

Pl^s) = *(/»;(i))»->i>:(i) (27) 

Hence, E{N^) and var (fVj) are 

= PiW = (1 - «)*, (28) 

var (AT,) = p;(i) + P'(i) _ (P^d))^ 

= A:(P;(l))‘-*Pr(l) + (1 - «)‘ - (1 - «)“ 

= *(i - «)*-• [(I - - (1 - 

+ (I - «)* - (1 - a)“ 

= *(1 - «)*^‘ + (1 _ - (1 _ „)«. (29) 

Now let Afj be 

Afj = A7 q Af, + + Afj 

M B lim Mi, 

00 


and let 


(30) 



33® 

namely, 


JOURNAL or POUnCAt SCONOMY 
r firm The generating function of Mnmt wothfy 

the ultimate size of a firm 1 lie gene b 


= iPlo(s)) 


(31) 


(Feller 1968, p 298) Hence, 

p'(l) = P(p{s)) + sP'(p{s))p'(s), 

p'(s) — P(p(j)/[1 — 

= /’-(') = = i 

Also from (32) 


Hence, 


p'is) = 2P'(pU))p'(s) + sP-ipismp'is))^ + sP'(p{s))p‘{s), 

2P'{p(s))p'{^) + sP‘'i,p(s)){p'{s))^ 

1 - rP'(p(r)) 


P’i^) = 


1 [ 2(1 - 

a |_ ot 


- “) + (L 


«)" 2y 


1 + yj 


The variance of Af i 


(32) 

(33) 

(34) 

(35) 

(36) 

(37) 


var (M) = p'(l) + p\\) - b'(l)]^ 

a L a 1 + y ej 

1 - a (1 - a)^ 2y 

= + -^^5 — ytI’ 


(38) 


Thus, for 0 < a < I, the growth process of a firm dies out with probability 1 
since E{M) = 1/a < oo Furthermore, we note that E{M) depends only on a 
and IS independent of y In addition, var {M) is decreased as y is decreased 
(namely, heavier discounting) However, since the total number of firms generated 
depends only on a and epoch 1, the tail end of the firm-size distribution for the 
largest rank is fixed for given a and t and is independent of y Thus, in order to 
reduce the variance of M, the sizes of above-average firms must be reduced and the 
sizes of below-average firms must be increased, thus creating more curvature in 
consistency with our earlier observations 


References 

Federal Trade Commission Large Mergers tn Manufachmng and Mimng, 194S- 

1969 Statistical Report no 5 Washington Government Printing Office, 
February 1970 

Feller, W rin Introduction to Probability Theory and Its Applications Vol I 3d ed. 
New York Wiley, 1968 

Fortune “The 500 Largest U S Industnai Corporation*,” Fortune, May 1970. (a) 
- The Second 500 Largest U S Industnai Corporations,” Fortune, June 

1970 (b) 

Ijiri, Y , and Simon, H A “Business Firm Growth and Size ” A.E R 54 (March 
1964) 77-89 

- “A Model of Business Firm Growth ” Econometnea 35 (April 1967). 


348-55 



DErARTUWU FROH PARETO CURVE 


33* 

-“Effects (ff Mergen tad Acquiadons on BunncM Finn Concentrstion.” 

J.P.E. 79 {MMch/April 1971): 314-22. 

Simon, H. A. “On a Oast of Skew Outribution Func:tions.” Bwnutnca 52 
(December 1955): 425-40. 

Steindl, J. Random Process and the GrowA of Firms: A Stsufy of the Pareto Law. New 
York: Haiher, 1965 





Soviot Growth in Absence of Centralized 
Planning: A Hypothetical Alternative 


Micha Gisser and Paul Jonas 

Vnivtrsity ^ Stw Maaco 


The study makes an attempt to investigate whether the Soviet Union 
would be richer or poorer if it had adopted a decentralized, agriculture- 
propelled, market-onented economic policy The Soviet economy is 
divided into agricultural and nonagricultural sectors Each sector is 
approximated by a Cobb-Douglas production function Special output, 
lalxir, and capital indices are then used in order to compare the actual 
with the hypothetical growth of the Soviet economy TTie conclusion 
reached is that in absence of centralized planning, the growth of the 
USSR would have been the same or better 


I. Introduction 

Would the Soviet Union be richer or poorer if it had adopted a decentra¬ 
lized economic pohey’ 

This question has been raised but never answered ‘ The “middle- 
dchelon” students of Soviet-type systems would argue that the question 
IS trivial, since it is recognized that a centrally planned system is able to 
generate a process of spieedy industrialization and rapid economic growth 
These views are summarized by Campbell “Ejtplained m the simplest 
terms, the achievement has been to increase the level of Soviet economy 
at a pace scarcely ever achieved in human history , The Soviet system 
has made it possible to achieve unprecedented rates of growth" (Campbell 
1960, p 29) 

What are the main characteristics of this Soviet-type system that is 
able to generate the suggested successful economic performance^ The 
works of Baykov (1947), Dobb (1948), Erlich (1950, 1955, 1960), 
Gerschenkron (1960), and others have revealed the vanous characteristics 

' Levine, for example, suggests that the examination of the possible course of the Soviet 
economy in the absence of central planning would be interesting. He feels, however, that 
the analytical difficulties involved “defy meaningful handling” (Levine 1968) 


* 


333 



334 JOURNAL OF POLITICAL ECONOMY 

of a Soviet-type development policy. The most important prerequisites 
for a “classical” centrally planned development process are. 

1 A dictatorship should take the factors of production, except labor, 
into state ownership, and coercion should be applied not only to the 
ancun rigimt, but also to every social stratum In other words, policy 
makers and activists should recognize that a new society based on central 
planning cannot be established without the extraordinary efforts and 
tears of the working class 

2 Therefore, a charismatic leader should utilize the idea of creating 
a “golden age” to mobilize the labor force and should request or press 
for “voluntary ’ and undercompensated achievements and high moral 
standards for workers and lower-fehelon cadres At the same time, 
decision-maktng centers must tacitly accept the fact that there is no 
special economic-ideological feature in a Sovict-tyjjc system and that such 
an organization is subordinate to the same problems of scarcity and re¬ 
source allocation as any other system 

3 The agricultural sector should serve as a reservoir for capital and 
manpower for various industrial prcyects First, the peasantry should be 
neutralized, therefore, the latifundia of the landowners should be dis¬ 
tributed among those “who till the soil ” Peasant landownership, how¬ 
ever, should not last long The strong, if not mythical, ties between the 
peasantry and the land should be broken Forced collectivization, com¬ 
pulsory deliveries, harsh administrative methods, and high taxes all pro¬ 
vide excellent means to achieve this end, when the political power is 
firmly established These methods will induce an exodus of the active and 
young agricultural labor force from a sector that can be charactenzed by 
disguised unemployment Moreover, they will create capital accumula¬ 
tion from a segment of the economy where potential savings exceed 
actual savings 

4 A goal-oricnted command economy should now be organized, giving 
full economic power to a central planning board which orders the com¬ 
position of the product mix in such a way that the output of capital goods 
must always exceed replacements A distorted pnee system, which makes 
inputs cheap and consumption expiensive, will ease the way to achieve 
this goal 

5 Finally, rigidly enforced development plans must be formulated 
Activists in various “transmission belts” and managers are expected to 
mobilize the workers for the fulfillment of targets Industnalization should 
start discontinuously, as a sudden great spurt, combined with the pro¬ 
liferation of every kind of heavy industry in a closed market virtually 
isolated from the outside world and considering this impiort substituting 
and unbalanced industrialization strategy as the most important means 
of expansion 



SOVIET OROWTH 


335 

To implement the outlined economic strategies,^ the state should act 
as an Aztec-like god, demanding from the population more and more 
sacrifices and regarding personal consumption not as a goal but as an 
unfortunate input in the process of production. It is argued, however, 
that while the incurred social costs are painful, they are unavoidable in a 
process which, in a relatively short period, transforms a backward country 
into one that is just, equitable, and industrialized The command model 
was considered a reliable generator of development, but recently the 
picture of accelerated economic growth in the Soviet Union, and generally 
in the Soviet bloc, has changed Comments such as “the recent appear¬ 
ance of decline in output growth rates is now a commonplace observation 
among the students of Soviet economy” (Kaplan 1968) have become more 
and more frequent ^ 

The Soviet Union has arnvcd at a stage when economic stimuli are 
needed to create growth, and the planners have to realize that, in a com¬ 
plex economic milieu, scientific prognostication and effective stimulation 
of production through profitability should be introduced 

This recent trend has created a dramatic transformation m the camp 
of Sovietologists What remains that is appealing in a command system, 
one may ask, if rapid economic growth can no longer be considered a 
success indicator^ 

A few years ago, the views of Jan Drewnowski (1%1) were generally 
accepted Drewnowski argued that the Lange-Lemcr model of market 
socialism did not explain any reality because the suppositions are far 
away from the existing conditions “Economists,” he warned, “must not 
indulge in building castles in the air” (p 342) 

With the present shortages m the Soviet Umon, which may onginate 
from a phase when agriculture was exploited or, at the best, disregarded, 
more and more Sovietologists turn their attention to the examination of 
a Yugoslav-type model, or some variation of the old dream- socialism 
without coercion, socialism with a human face In this latter framework, 
decision making is decentralized, and extensive use is made of the market 
mechanism and financial instruments which influence the direction and 


^ The works of Erlich reveal that the penon mainly responsible for thii type of de¬ 
velopment strategy was Evgenii Preobrazhensky (1963, Nurkse 1937), a brilliant Russian 
theoretical economist and unsuccessful politician who died as one of the victims of 
Stalin’s purges. 

^ Growth rates decline from about the early I9G0s, and the concealment of tbc 1963 
national income figure until mid-1964 gave ground for suspicion that a new growth 
trend had begun The official Soviet estimate for this year was that the gross social 
product rose by 3 percent The American governmental agency, the CIA, estimated the 
1963 increase as 2 5 percent Independent Western estimates ranged from zero (Warren 
Nutter) to the acceptance of the official Soviet figure (tee Alec Nove, ’’21 Per Ceai and 
All That” [1964]). 



336 JOURNAL OF POUTICAL ECONOMY 

the coordination of the economy (Bergson 1%7; Roberts 1968). There¬ 
fore, in the spint of our time, we also will engage ourselves in “building 
castles in the air ” 

We select for our suppositions the policy recommendations of Nikolai 
Bukharin, the spiokesman of the opposition to the centrally planned course 
in the famous Soviet industrialization debate He rejected the idea that 
peasants would never provide resources for industrialization and, there¬ 
fore, objected to the proposition that the government should forcefully 
collectivize agriculture and take resources by using coercion (Bukhann 
1964) * Bukhann suggested an “agriculture-propielled” economic de¬ 
velopment, that IS, that agriculture instead of heavy industry be em¬ 
phasized and provided economic and technological help, and that a 
stream of savings originating from a prosperous peasantry should be 
plowed back into industnal expansion by the government He even 
proposed to the peasantry “Enncktssez vous ” 

We are aware of all the dangers and analytical difficulties associated 
with an examination concerned with the question “What would have 
happened in the USSR if, instead of a rigid Stalinist centrally planned 
course, Bukharin’s flexible ideas had prevailed^’’ This problem looks, 
however, extremely challenging, and one ought not to shy away from the 
answer 

For our examination, in accordance with Bukharin’s position, we will 
make the following assumptions (1) among the factors of production, 
only capital is in state ownership, (2) the state adopts an agriculture- 
oriented policy in which the peasantry is not only protected from arbitrari¬ 
ness and requisitions, but is also induced to benefit from agrotechnical 
knowledge available in the West and is allowed to pursue various agricul¬ 
tural activities freely, and (3) the whole economy is cast in a decentralized 
New Economic Policy framework where the nationalized industry and the 
private agriculture are linked together by a semiprivate, flexible trading 
apparatus 

n. The Actual Growth of Soviet Agricultural Sector 

In order to examine the growth of the Soviet agricultural sector in the 
past, we construct a simple model with which we can separate the effects 
of labor, capital, and technology The Soviet economy is assumed to be 
governed by the Cobb-Douglas production function with neutral techno¬ 
logical change as follows. 

^ = ^(0 ( 1 ) 

* Ja»ny leenis to concur with Buklurin’i views, when he states “The only conclusion 
that can be reached on the basis of the Soviet experience is that collectivization of agri¬ 
culture should never be attempted by the methods resorted to in the USSR” (Jasny 



■OVIST OSOWTH 


TAK.E 1 

Sham BmHATCt vor Cartajl aro Labor 


337 




Capitai. 



Labor 



Low 

Bergion 

High 

High 

Bergson 

Low 

Agriculture 

0 15 

0 25 

0 35 

085 

0 75 

0 65 

Noiugriculture 

0 20 

0,30 

040 

080 

0 70 

060 


where X, A, AT, and L denote agncuitural output, technological change, 
capital, and labor input 

The choice of the Cobb-Douglas production function is the best for the 
period to be considered here This is mainly due to its simplicity, which is 
particularly an asset when data are very crude Another type of produc¬ 
tion function was used by Weitzman (1970) Note, however, that Weitz- 
man applied his production function for a recent time period which is 
different from the era investigated here Moreover, he was particularly 
mterested in the effect of a low elasticity of substitution between labor and 
capital on the reduction in the Soviet economic growth in the late 1960s 
Following Solow (1957), we can apply a simple formula for measuring the 
contribution of the change in technology to the change in output This is 
sometimes called the neutral shift in technology 

X/X = A/a + xi/XK) -I- (1 - a){LIL}, (2) 

where the dot denotes a first-order derivative with respect to tune, and 
A/A measures the neutral shift in the agricultural-production function. 
The reader should note that in fact A/A means more than just a neutral 
shift in the producUon function Gnliches (1963) has demonstrated that 
most of the estimated rates of disembodied technical progress can be 
allocated to quality changes m the productivity of specific inputs For 
example, a nse in the level of schooling would be considered an improve¬ 
ment of human agents Since this is such an important pomt, an entire 
section will be devoted to it later 

Kaplan indices (1968) are used as a measure of the percentage changes 
of outputs and inputs 

A more difficult problem is to obtain a rough estimate of the coefficients 
of capital and labor in the Cobb-Douglas production function Based on 
the findings of Bergson (1963), we adopt the following share estimates for 
capital and labor in the Soviet Uiu'on (see table 1) 

The Kaplan indices are available for capital (excluding livestock), sown 
area, and man-years The omission of livestock introduces an error into 
the data. It is hoped that this error is the smallest error m the data. Based 
on Beigson (1963), it was assumed that nonlsmd weighs 17.5 percent and 



JOURNAL OF POLmCAL EOONOMV 

TABIX 2 

Contribution of Tschnoloov Chanob to Growth of-Soviet Aorioulture 



1928-40 

1951-54 

1954-58 

1958-63 

Output 

1 60 

560 

9 20 

-0 20 

Capital 

3 20 

490 

6.10 

360 

Man-Years 

1 20 

-090 

-0 70 

0.80 

Technology * 

Capital low 

0 10 

5 63 

888 

-142 

^rgson 

-0 10 

5 05 

8.20 

-1.70 

Capital high 

-0 30 

4 47 

7 52 

-1 98 


—Based on Kaplan indices (percentage change per annum) 
* AIA, expressed as a percentage (see eq (ij) 


land 82 5 percent in total agricultural capital * In table 2 we show the 
Kaplan indices for outputs and inputs and our estimate of the contribu¬ 
tion of technology 

It can be seen that the actual productivity trend in agriculture in the 
period of 1928-40 was stagnant During this period the ownership and 
control of the bulk of productive resources were gradually transferred to 
the state, several prices existed and were fixed more or less arbitranly The 
government extracted from agriculture a significant part of its value 
added by means of the regimen of compulsory deliveries to state procure¬ 
ment organizations at low fixed prices 

In the period starting from 1951, actual agncultural productivity in 
the Soviet Union shows a “great spurt” characteristic, and this culminates 
in the year 1958 and declines after that The reason for this upsurge is that 
Soviet planners realized the lagging performance of agriculture and 
turned their attention to the problems of agncultural productivity The 
government, dunng this penod, subsidized to some extent the pnccs of 
some selected agncultural products, such as fiber and oil crops, and also 
gave help in the form of controlled pnees for fertilizen, fuel, feed, and 
building matenals which were lower than retail prices applicable to 
private use. Another subsidy, dunng the upsurge, consisted of direct 
grants to machine-tractor stations and state farms to cover their operating 
losses 

m. The Actual Growth of Soviet Nonagricultural Sector 

In order to examine the growth of the nonagncultural sector, we use 
similar procedures Let the nonagncultural sector be summanzed by the 
following Cobb-Douglas production function 

r = (3) 


^ See Appendix A 



SOVIET GROWTH 


339 


TABLE 3 

CoNTRiBunoM or Tmcmnolooy Chahob to Growth or 
Soviet Nomaoriculturai, Seoior 



1928-40 

1951-54 

1954-58 

1958-63 

Output 

8 10 

700 

650 

590 

Capital 

14 10 

9 20 

10 10 

11 30 

Man-yean 

850 

350 

1 10 

1 80 

Technology • 

Capital low 

-1 52 

236 

360 

2 20 

Bergion 

-208 

1 79 

2 70 

1 25 

Capital high 

-264 

1 22 

1 80 

030 


Note —B«ied on KnpUn indicci (percentafc chnnBc per nnnum) 
* M/B, exprcMcd M a percrnuge (lee eq K)) 


where F, B, ^(j), and denote nonagncultural output, technological 
change, capital, and labor input Again, following Solow (1957), we 
obtain the measurement of the change in technology as follows 

y/v = b/b + Mo)/^(i) + (1 - /?) ^(i)/A.) (4) 

The estimates of the capital and labor shares based on Bergson are 
^ = 0 30. 1 - i? = 0 70 

Using Kaplan indices, we obtam the following measurements of the 
neutral shift in the aggregate nonagncultural production function 

It IS interesting to compare the results of tables 2 and 3 The percentage 
shifts in the productivity move roughly in the same direction in both 
agnculture and in the nonagncultural sector “Technology” m the non- 
ag^icultural sector between 1928 and 1940 declines, and at the same time 
this penod shows an extremely rapid growth rate The dechne of “tech¬ 
nology” is counteracted by huge capital investments and by changing 
dramatically the capital-labor ratio This penod kept Soviet agnculture 
stagnant, but produced in the still partly underutilized industnal sector 
a high growth rate The rapid gjrowth in the nonagncultural sector 
dechnes a little bit after the reconstruction period, but the output in the 
agricultural sector increases significantly After Stalin’s death, in the 
penod of “general thaw,” the slowdown in nonagncultural output rate 
begins, which is, interestingly, coupled with a dramatic rate of growth in 
the agncultural sector In the last phase of 1958-63 the direction is the 
same in the agncultural and nonagpncultural sector, and they both 
contnbute to the retardation in Soviet g^wth 

It IS possible to propose a hypothesis which would link certain fluctua¬ 
tions in the growth rates and productivity shifts with the changing insU- 
tutional system m the Soviet Union The national purpose, which was 
loosely defined as “maximization of industnalization,” may produce 
penods of investment booms that exceed the capacity of the economy, and 



340 


JOURNAL OP POLITtCAL ECONOMY 


these must be followed by penods of consolidation. We may also conclude 
that the fluctuations in the nonagncultural and especially in the agncul- 
tural output in the Soviet Union around a general trend line seem pro¬ 
nounced enough to constitute an addition to the evidence of economic 
cycles in Soviet-type economies A comparison with the behavior of other 
output senes, especially those relating to investment in some East Central 
European Soviet-type economies, shows a marked similanty (Staller 
1964) 

It can be supposed that the recent retardation in Soviet gprowth can be 
attributed to long-run cycles in the Soviet agricultural-production func¬ 
tion a penod m which because of political reasons the agricultural sector 
does not absorb new technologies from abroad is followed by a penod of 
political relaxation, in which the agricultural sector attempts to absorb 
the technologies it failed to adopt dunng the long period of political 
rigidity and make use of accumulated improvement in inputs which was 
not exploited due to an absence of economic freedom 


rV. Disembodied Technical Progress 

It helps if, following Gnliches (1963), one decomposes the contnbution 
of technology in agriculture into (a) improvement in the quality of inputs, 
(b) elimination of disequihbna, and (r) selecting the optimal scale. As far 
as improvement m the quality of human agents is concerned, one can 
conjecture that more schooling contributes more to agricultural produc¬ 
tivity when the educated farmer is free to apply his education in adopting 
new techniques, better marketing procedures, and more effective manage¬ 
ment There was a considerable dnve to increase the level of schooling 
in the USSR dunng 1928-40 It is interesting to note that agricultural 
production shows no respionse to this increased level of education in 
other words, the disembodied technological progress is not detectable 
What was the retison for this phenomenon^ Was the centrally planned 
structure where decisions were made at a higher level and managers 
followed instructions step by step so rigid that the latter could not use 
their newly acquired skills^ Did the abnormally depressed agncultural 
pnce system g^ve too little incentive for the leaders of cooperatives to 
improve efficiency^ Is it true that the most important gain from education 
was that it enabled agncultural workers to avoid regulations? The “great 
spurt” of 1951-58 probably shows that due to more favorable pnees and 
relaxation of economic controls, agncultural employees could take ad¬ 
vantage of the previous improvement m inputs, such as more schooling. 
For example, the rising level of schooling and improving machinery was 
helpless dunng the period pnor to 1951 because of the known fact that 
the Soviets failed to develop or adopt hybnd seeds and chemical fertilizers 
We can also conjecture that dunng that early penod in Soviet develop- 



SOVIET GROWTH 


341 


ment, the resistance of peasants to the collectivization drive overwhelmed 
the rise in their level of schooling, as well as improvements in nonhuman 
inputs. Thus, findings in table 2 probably indicate that the process of 
collectivization not so much eliminated disembodied technical progress 
as It did concentrate it in a relatively short time period The reader should 
also note that for the period after 1950 there was a gradual change in the 
sex and age composition of the farm iabor force The rural male piopula- 
tion suffered heavy losses dunng World War II, and by 1950 agriculture 
was heavily dependent upon female labor The propiortion of males in the 
labor force increased year by year thereafter, and this change in the sex 
ratio should have contributed to output over and above the changes in 
man-years of employment and schooling This leads us to the next section 
which boils down to assuming that had the Soviets not collectivized their 
agricultural sector, disembodied technical progress would have probably 
been spread more evenly over time In particular, the ag^cultural sector 
would have enjoyed a moderate increase in disembodied technology 
dunng the period 1928—40 


V, The Hypotheticsd Alternative 

In accordance with our assumption (2), we suppose that the Soviet 
Union had not collectivized its agncultural sector Rather, the Soviet 
Union allowed farmers to buy inputs and sell agncultural output in a free 
market in which pnees are cither determined according to laws of supply 
and demand, or, alternatively, administered by the government, but not 
in a manner which is unfavorable to farmers We assume also that the 
Soviet Union encourage her pnvate farmers to adopt all the new agri¬ 
cultural techniques that were made available in the United States and 
other Western countnes Is it safe to assume that, under these assumptions, 
the Soviet agncultural sector could have benefited from disembodied 
technologies at a rate similar to the rate that had prevailed in the United 
States^ Note that this is not an implied assumption, this is a historical 
question 

If we accept the proposition that in absence of collectivization a free 
Soviet farm sector would have enjoyed a more evenly distributed disem¬ 
bodied technical change over time, we can take a further step and assume 
that such progress of disembodied technology would have followed the 
Western pattern over time Following Schultz (1953), we can accept the 
estimate of 1 percent increase in agncultural output per unit of input 
during the period 1928-40 The relevant rate for the post-World War II 
penod for the United States is 2 percent (Heady 1%7). Thus, the agn- 
cultural-production function would have shifled hypothcbcally at an 
annual rate of 1 percent from 1928 to 1940, and at a rate of 2 percent in 
the postwar period. 



340 


JOURN4L OF POLmCAI. ECONOMY 


these must be followed by penods of consolidation. We may also conclude 
that the fluctuations in the nonagncultural and especially in the agncul- 
tural output in the Soviet Umon around a general trend line seem pro* 
nounccd enough to constitute an addition to the evidence of economic 
cycles in Soviet-type economies A comparison with the behavior of other 
output senes, especially those relating to investment in some East Central 
European Soviet-type economies, shows a marked similarity (Staller 
1964) 

It can be supposed that the recent retardation in Soviet growth can be 
attributed to long-run cycles in the Soviet agricultural-production func¬ 
tion • a penod in which because of political reasons the agricultural sector 
does not absorb new technologies from abroad is followed by a penod of 
political relaxation, in which the agncultural sector attempts to absorb 
the technologies it failed to adopt dunng the long penod of political 
rigidity and make use of accumulated improvement in inputs which was 
not exploited due to an absence of economic freedom 


rV. Disembodied Technical Progress 

It helps if, following Gnliches (1963), one decomposes the contribution 
of technology in agrnculture into (a) improvement in the quality of inputs, 
(b) elimination of disequilibna, and (r) selecting the optimal scale. As far 
as improvement in the quality of human agents is concerned, one can 
conjecture that more schooling contributes more to agncultural produc¬ 
tivity when the educated farmer is free to apply his education in adopting 
new techniques, better marketing procedures, and more effective manage¬ 
ment There was a considerable dnve to increase the level of schooling 
in the USSR during 1928-40 It is interesting to note that agncultural 
production shows no response to this increased level of education' in 
other words, the disembodied technological progress is not detectable 
What was the reason for this phenomenon^ Was the centrally planned 
structure where decisions were made at a higher level and managers 
followed instructions step by step so ngid that the latter could not use 
their newly acquired skills^ Did the abnormally depressed agncultural 
pnee system give too little incentive for the leaders of cooperatives to 
improve efficiency^ Is it true that the most important gam from education 
was that it enabled agncultural workers to avoid regulations? The “great 
spurt” of 1951-58 probably shows that due to more favorable prices and 
relaxation of economic controls, agricultural employees could take ad¬ 
vantage of the previous improvement in inputs, such as more schooling 
For example, the nsing level of schoobng and improving machinery was 
helpless during the penod pnor to 1951 because of the known fact that 
the Soviets failed to develop or adopt hybnd seeds and chemical fertilizers. 
We can also conjecture that dunng that early penod in Soviet develop- 



SOVIET GROWTH 


34' 


ment, the resistance of peasants to the collectivization drive overwhelmed 
the rise in thetr level of schooling, as well as improvements in nonhuman 
inputs Thus, findings in table 2 probably indicate that the process of 
collectivization not so much eliminated disembodied technical progress 
as it did concentrate it in a relatively short time period The reader should 
also note that for the period after 1950 there was a gradual change in the 
sex and age composition of the farm labor force The rural male popula¬ 
tion suffered heavy losses during World War II, and by 1950 agnculture 
was heavily dependent upon female labor The proportion of males in the 
labor force increased year by year thereafter, and this change in the sex 
ratio should have contributed to output over and above the changes in 
man-years of employment and schoohng This leads us to the next section 
which boils down to assuming that had the Soviets not collectivized their 
agricultural sector, disembodied technical progprcss would have probably 
been spread more evenly over time In particular, the agricultural sector 
would have enjoyed a moderate increjise m disembodied technology 
during the period 1928—40 

V, The Hypothetical Alternative 

In accordance with our assumption (2), we suppose that the Soviet 
Union had not collectivized its agricultural sector Rather, the Soyiet 
Union allowed farmers to buy inputs and sell agncultural output in a free 
market in which prices are either determmcd according to laws of supply 
and demand, or, alternatively, administered by the government, but not 
in a manner which is unfavorable to farmers We assume also that the 
Soviet Union encouraged her pnvate farmers to adopt all the new agri¬ 
cultural techniques that were made available in the United States and 
other Western countnes Is it safe to assume that, under these assumptions, 
the Soviet agricultural sector could have benefited from disembodied 
technologies at a rate similar to the rate that had prevailed in the Umted 
States’ Note that this is not an implied assumption, this is a histoncal 
question 

If we accept the proposition that in absence of collectivization a free 
Soviet farm sector would have enjoyed a more evenly distributed disem¬ 
bodied technical change over time, we can take a further step and assume 
that such progfress of disembodied technology would have followed the 
Western pattern over time Following Schultz (1953), we can accept the 
estimate of 1 percent increase in agncultural output pier umt of input 
during the pieriod 1928-40 The relevant rate for the piost-World War II 
pienod for the United States is 2 percent (Heady 1967). Thus, the agri- 
cultural-piroduction function would have shifted hypxithetically at an 
annual rate of 1 piercent from 1928 to 1940, and at a rate of 2 percent in 
the postwar pienod 



342 JOURNAL OF Po'uTICAL ECONOMY 

We admit that borrowing the productivity growth rates fiom Schultz’s 
and Heady’s work on the United States overlooks the fact that these 
growth rates are not just a function of the specific tune space in which 
they occurred, but also are related to the factor proportions and other 
practical circumstances obtained in the United States in those years One 
might suggest that if a comparison of this kind is tried at all, it should be 
with some period in the United States history when the country’s eco¬ 
nomic structure was somewhat more similar to the facts of Russia in the 
1930s and 1950s And yet, if one takes into consideration the fact that the 
Soviet government embarked on a process of mass education of the 
Russian peasants, and also that the Soviets did not have to invest in 
agricultural technologies, but only to borrow available technologies from 
the West, our assumption looks more attractive Notice finally that we 
are not adding the change in disembodied agricultural technology in the 
United States to the Soviet change We are replacing Soviet patterns by 
the Western pattern For example, table 2 shows that agricultural output 
grew at an annual rate of 9 2 percent during 1954—58 Using the estimate 
based on Bergson, disembodied technology increased at a rate of 8 20 
percent This should be replaced by a 2 percent change as indicated 
above Accordingly, the hypothetical growth rate of agricultural output 
should be 3 00 percent (3 00 = 9 20 — 8 20 + 2 00) 

According to Eason (1963), the Soviet agncultural labor force 
amounted to 84 9 and 61 2 percent of the total labor force in 1926 and 
1939, respectively Thus, it is safe to assume that on the average, dunngthat 
time period, 1 percent released from the agricultural labor force would 
have amounted to 2 5 percent increase in the nonagncultural labor force 
Replacing 0 10, —0 10, or —0 30 percent change (table 2) by 1 percent 
would have resulted in a change of 0 9, 1 I, or 1 3 percent in disembodied 
technology Since the agricultural share is assumed to be 0 85, 0 75, or 
0 65, respectively, adding the above percentage changes and maintaimng 
the same growth rate of agricultural output would have released more 
labor to the nonagncultural sector As explained in Appiendix B, this 
would have raised the annual growth rate of output in the nonagncultural 
sector from 8 1 to 10 22, 10 67, or 11 10 percent, respectively, for the 
penod of 1928-40 

Note that the assumption that the agncultural output target for the 
year 1940 had not changed is somewhat arbitrary Given our hypothetical 
model, this assumption allows a higher rate of farm out-migration which 
would have been favorable to the industrial sector In other words, the 
process of farm out-nug^ation is a channel through which progress in the 
agncultural sector can be “exported” to the industrial sector Had we 
adopted for our hypothetical model an agncultural output target for the 
year 1940 higher than the actual taiget, less progress would have been 
transferred from the agncultural to the industrial sector Nevertheless, 



SOVIET GROWTH 


TABLE 4 

Actoal and Hypotmbtioal GaowTH Rats* or Outtut in the 
Two Sectobs, USSR (Peroentaob Change pek Annum) 


343 




1928-40 

1951-54 

1954-58 

1958-63 


Nonagncultural 







Actual (Kaplan) 


8 10 

700 

650 

590 


Hypothetical 







Low capital 


10 22 

700 

650 

590 


Bermn 


10 67 

700 

6.50 

590 


Hi^ capital 


11 10 

700 

650 

590 


Agricultural 







Actual (Kaplan) 


1 60 

560 

9 20 

-0 20 


Hypothetical 







Low capital . 


1 60 

1 97 

2 32 

3 22 


Bernon 


1 60 

2 55 

300 

3 50 


Hi^ capital 


1 60 

3 13 

368 

3 78 


Noth —For detailed explanation, »cc Append 

IX B 






TABLE 5 




Actual and Hypothetical Growth Indices, USSR 



1928 

1940 

1951 

1954 

1958 

1%3 

Nonagncultural 







Actual 

100 

254 63 

272 45 

333 77 

429 38 

571 90 

Hypothetical 







Low capital 

100 

321 46 

343% 

421 36 

542 07 

721 99 

Bergson 

too 

337 57 

361 19 

442 48 

569 23 

758 17 

Hi^ capital 

100 

353 65 

378 40 

463 56 

5% 35 

794 29 

Agricultural 







Actual 

100 

120 98 

127 76 

150 45 

213 93 

211 80 

Hypothetical 







Low capital 

100 

120 98 

123 37 

130 80 

143.37 

167 99 

Bergson 

100 

120 98 

124 07 

133 80 

150 60 

178 86 

High capital 

100 

120 98 

124 77 

136 86 

158 14 

190 38 


according to our tissumption, the nonagncultural sector could have 
benefited from an additional 2 65, 3 67, or 5 00 percent growth of its 
labor force for 1928-40 (see Appendix B) Given a share of labor of 0 80, 
0 70, or 0 60, this could have meant an additional 2 12, 2 57, or 3 00 
percent on top of the 8 1 percent actually achieved This means that 
hypothetical growth rates of outputs in the USSR could have been as 
shown in table 4 

Based on table 4, and assummg for the sake of argument that output 
levels in 1950 reached their previous 1940 level, we construct m table 5 
indices which compare our hypothetical path of growth with the actual 
one 

In what follows we shall adhere to the estimates based on Bergson How¬ 
ever, the reader can check the results for low and high capital estimates as 
they are shown in table 5 






344 


JOURNAL OF POLITICAL ECONOMY 


Table 5 shows that if our hypothetical alternative had been followed, 
in 1940 nonagncultural production in the USSR could have been 33 
percent higher than the actual level This would have been achieved 
without any sacrifice, social or economic, on the part of the agricultural 
sector ® 

In 1963, the trade-off would have been slightly less favorable: non- 
agricultural output would have still been 33 percent higher than the actual 
level, but agricultural output would have had to be 14 percent less than 
It actually was 

, The reader should not be puzzled by the fact that under our hypothesis 
the agricultural sector ends up a “loser” in 1963 This results from our 
assumption that the agricultural output target for the year 1940 had not 
changed under our hypiothesis Recall that because of this assumption the 
agricultural sector could have released 1.47 percent of its labor force, 
which meant 3 67 percent extra growth for the industrial labor force 
In table 4, this labor transfer, while leaving the rate of growth of the 
hypothetical agricultural output unchanged at 1 6 percent, increases the 
rate of growth of the industnal sector from 8 1 to 10 67 percent for the 
period 1928-40 

Sensitivity Analysis (Vanation in Assumptions) 

Some sensitivity analysis has been used throughout the analysis In 
applying the Cobb-Douglas production function, we used three estimates 
for the coefficient of capital low, “Bergson,” and high But there are 
some other moderate variations in assumptions that can be made in 
order to test the validity of our model We now consider some of the 
problems that arise in conjunction with sensitivity analysis We admit that 
some of these problems cannot be solved, while other problems can be 
subjected to a meaningful test Consider the following 

a) Would the various Kaplan indices be valid under our hypothetical assump¬ 
tions? —We can give a reasonable answer only when the labor input is 
considered We believe the labor-input index would have remained the 
same, first due to the fact that population would have grown at the same 
rate over time Second, we believe that labor-force partiapation rate 
would have remained about the same, mainly because incentives to join 
the labor force would have replaced coercion 

What about capital indices? In this case we can only speculate Would 
saving be lower under our hypothetical alternative^ Even if domestic 

*Note that actual agricultural output in 1958 and 1963 were ‘'abnormal,” the first 
too high and the second too low because of climatic conditions. Accordingly, if one uses 
a 3-year average output level centered on 1958, the adjusted actual output for 1958 in 
table 5 would be 196.8, using the same process gives an adjusted output for 1963 of 222 4. 



SOVIET GROWTH 


345 

saving would have been lower, it is very likely that borrowing capital in 
“capitalistic” markets abroad would have become easier, thus making up 
for the reduction in domestic saving. 

b) It is very likely that prices received by farmers would have been much higher 
under our hypothetical alternative compared with actual prices —This gives nsc to 
a problem which is closely related to the following 

c) It IS not at all clear that the Russian industrial sector was ready to absorb 
more labor released by the agricultural sector —Let us now discuss (b) and (r) 
and test a variation in our assumption concermng the rate of farm out¬ 
migration If agnculture and the rest of the economy had been linked 
together by a scmiprivate, flexible trading apparatus, there would have 
been fewer persons transferring from agriculture to industry dunng 
1928-40 than under our hypothetical alternative, or compared even with 
what m fact occurred The prices received by farmers would have been 
much higher than the nominal prices at which much of the farm output 
was transferred during the thirties 

Next, there are doubts that industrial employment could have grown 
at a significantly higher rate from 1928 to 1940 than it did without result¬ 
ing in an even greater rate of negative disembodied technology This 
results from assuming that no technological change, either positive or 
negative, would have been associated with the transfer of additional labor, 
while dunng the 1928-40 period the data in table 3, using Bergson’s co¬ 
efficients, implied that the actual annual growth in inputs “overexplained” 
the growth of output by an annual rate of slightly more than 2 percent 
Thus, It IS not clear how further straining of organizational capacity could 
result in the abihty to expand size further as required to absorb an 
industnal labor force growing at 11 2 piercent annually instead of 8 5 
percent 

To test this variation in our assumptions, let us revise our hypiothetical 
alternative Assume that under the semipnvate, flexible trading apparatus, 
the rate of farm out-migration would have been the same as the actual 
rate This means a reduction of the farm labor force from slightly over 80 
percent in 1928 to 60 percent in 1940 and 40 piercent in 1963 It is reason¬ 
able to assume that propier taxes and subsidies coupled to a frce-market 
mechanism would have achieved such a goal Under such assumptions, 
the hypothetical growth rates in the agricultural sector for the pienod 
1928—40 would have been 2 50,2 70, and 2 90, rcspiectively (For example, 
2.50 = 3 20 X 0 15 -1- 1 20 X 0 85 -f 1 00 ) These rates would replace 
1 60 percent in table 4. Resulting from such an adjustment, table 6 would 
replace table 5, as follows. Here, the agricultural sector would have ended 
up at 203.53, which is shghtly less than the actual 211 80 What it means 
IS that, even with this modification of assumptions, under a free market the 
Soviet Economy would have performed as well as it actually did. 



346 


JOURNAL OF POUTICAt ECONOIIV 


TABLE 6 

Actual tjm Hytothetical Growth Imdicbi, USSR, 
AmuMiNo No Chanoe in Farm Out-Mioratton Rates 



1928 

1940 

1951 

1954 

1958 

1963 

1 Nonagricultural 

Actual =5 Hypothetical 

100 

254 63 

272 45 

333 77 

429 38 

571 90 

2 Agncultural. 

Actual 

too 

120 98 

127 76 

150 45 

213 93 

211 80 

Hypothetical 

Low capital 

100 

134 49 

137 14 

145 40 

159 37 

186 73 

Bergson 

100 

137 67 

141 18 

152 26 

171 37 

203 53 

High capital 

100 

140 92 

145 33 

159 40 

184 19 

221.73 


VI. Conclusion 

The result of our analysis is that industrialization without the “super- 
industnalizers” could have occurred at the same rate or even a more 
impressive rate than actually happened The computed annual growth 
rate in the nonagricultural sector could have been 2 57 percent on top 
of the 8 1 percent dunng the period 1928-40 This could have been 
achieved at no social and economic cost to the agricultural sector until 
1940, and at a small economic cost (but without the incurred social cost) 
to the agricultural sector thereafter 

Our results suggest that there was the same or even greater growth 
possibility in the approach of those who, in the “Great Industrialization 
Debate,” argued that a market relaUonship between a free agncultural 
sector and a state-owned industrial sector could generate savings and in¬ 
vestments in the former and could reduce costs and establish profitability 
in the latter In this way more capacity could have been created in in¬ 
dustry tvithout sacrificing agriculture and living standards The computa¬ 
tions show that this idea was far from being utopian The acceptance of 
the Preobrazhcnsky-Stalin path led to unnecessary suffenngs on the part 
of the Soviet population and misallocation of resources 

We admit that our model gives rise to some difficulties that were pointed 
out in the last section 

First, if agriculture and the rest of the economy had been linked to¬ 
gether by a “semipnvate, flexible trading apparatus,” there would have 
been fewer persons transfemng from agriculture to industry during 1928- 
40 than we have suggested The prices received by the farmers would have 
been higher than the nominal prices at which much of the farm output 
was transferred dunng the thirties Moreover, as indicated in the last 
section, we have doubts about the capacity of the Soviet industrial sector 
to absorb more labor than it actually did dunng 1928-40 Adjusting our 
assumptions accordingly yielded table 6. All these findings are summarized 
in table 7 as follows. Thus, even if we assume no change in the rate of farm 


SOVIET GROWTH 


TABLE 7 

StniHARY or Fikdino* Imncu 


347 



1928 

1940 

1963 

1 Nonsvicultural 

Actual 

100 

254 63 

571 90 

Hypothetical, aisuming an accelerated ntte of fann 
out-mi|ration ^ 

too 

337 57 

758 17 

Hypothetical, no change in the race of farm out¬ 
migration 

100 

254,63 

571 90 

2 Agricultural ■ 

Actual 

100 

120 98 

211 80 

Hypothetical, anuimng an accelerated rate of farm 
out-mi^ration 

100 

120 98 

178B6 

Hypothetical, assuming no change in the rate of farm 
out-nugration 

100 

137 67 

203,53 


Note — Gapilal coefRcienti are according to Bergson 


out-migration, we have shown that the Soviet Union could have achieved 
the same economic miracles under a system which is closer to a frec'tnarlcet 
mechanism We conclude that the heavy pnce in human hfe, in human 
suffcnng, and in lack of freedom that many Soviet citizens had to pay for 
adopting the Preobrazhcnsky-Stalin system was not needed It seems that 
Bukhann was nght after all 






JOURNAL OF FOLinOAL EOONC»IY 


348 

Appendix A 

The following (table Al) m a revued venion of Bergson (1963) of the shares of 
capital and labor: 


TABLE Al 

Bergson's Sector EsmtATEs 



Both 

Aorioultural 

Nonaoricultural 


Sectors 

Sector 

Sector 








(Billions 

Billions 

% 

Billions 

% 


o f Rubles) 

of Rubles 

Share 

of Rubles 

Share 

Set A 






Employment 

Capital 

179 9 

462 

110 1* 

6 9t 

75 

69 8 

39 3 

64 

36 

Farm land 

22 4 

22 4 

25 



Livestock 

70 

70 




Total 

255 5 

146 4 

100 

109.1 

100 

SetB 






Employment 

Capital 

179 9 

18 5 

110 1* 

2 at 

80 

69.8 

15 7 

82 

18 

Farm land 

22 4 

22 4 

20 



Livestock 

28 

28 




Total 

223 6 

138 1 

100 

85 5 

100 

SetC 






Employment 

Capital 

179 9 

46 2 

110 !• 

6 9t 

70 

69 8 

39 3 

64 

36 

Farm land 

33 6 

33 6 

30 



Livestock 

70 

70 




Total 

266 7 

157 6 

100 

109 1 

100 


* Agricultural labor force accounted for 61 2 percent of the entire labor force m 1939 Thi» n bated on 
findmgi of Eaton (1963) 

t We aiiumed (hat 15 percent of the item capita) wat agricultural 


Appendix B 

Based on Eason (1963), during the period 1928-40, 1 percent of labor released 
by the agricultural sector would have contributed 2 5 percent increase in the 
labor force of the nonagncultural sector Based on Bergson (1963), the agricul¬ 
tural labor share was 0 75 and the nonagncultural labor share was 0 70 Since 
the data are crude, we considered a range of shares, rather than a point estimate 
The hypothetical growth rate of the nonagncultural sector for the penod 1928-40 
was calculated, as shown in table 3 

The hypothetical growth rate of agricultural output for 1928-40 is unchanged 
This IS true because the agricultural production goal would have remained the 
same for the year 1940 under our hypothesis But, replacing the change in dis¬ 
embodied technology amounting to 0.10, —0 10, and —0.30 by 1 percent change 
we obtained 0 9, 1.1, or 1.3 alternative hypothetical disembodied technological 
changes. This would have enabled the agricultural sector to release 1.06, 1.47, or 
2.00 percent per annum of its labor force durmg 1928-40. 







349 



350 


JOURNAL OF POLmCAL ECONOMY 


Next, the annual growth rates of agricultural output for the postwar periods 
were obtained by replacing the actual percentage changes in disembodied 
technology with our 2 percent hypothetical change, as follows. 


1951-54 

56-5 63 + 2 = 1 97 
56 - 5 05 + 2 = 2.55 
56 - 4 47 + 2 = 3 13 

1954-58 

92-888 + 2 = 2 32 
92-8 20 + 2 = 300 
92-7 52 + 2 = 368 

1958-63 

-0.2 - (-1 42) + 2 = 3 22 
-0 2 - (-1 70) + 2 = 350 
-0 2 - (-1 98) + 2 = 378 


References 

Baykov, Alexander The Development of the Somet Economic System Cambridge, 
Mass Harvard Univ Press, 1947 

Bergson, Abram “National Income ” In Economic Trends in the Soviet Union, 
edited by A Bergson and S Kuznetz Cambridge, Mass Harvard Univ 
Press, 1963 

- “Market Socialism Revisited ” y P £ 75, no 4, suppi (October 1967) 

655-76 

Bukhann, N I “Notes of an Economist at the Beginning of a New Economic 
Year ” Reprinted in Foundations of Somet Strategy for Economic Growth Selected 
Sovut Essays, edited by Nicolas Spulber Bloomington Indiana Univ Press, 
1964 

Campbell, Robert W Somet Economic Power Cambridge, Mass Riverside Press, 
1960 

Dobb, Maurice Somet Economic Development since 1917 New York n p , 1948 

Drewnowski, Jan “The Economic Theory of Socialism A Suggestion for Re¬ 
consideration " J P E 69, no 4 (August 1961) 341-54 

Eason, Warren W "Labor Force.” In Economic Trends m The Somet Uruon, edited 
by A Bergson and S. Kuznetz Cambridge, Mass Harvard Univ Press, 1963 

Erhch, Alexander “Preobrazhenskii and the Economics of Soviet Industrializa¬ 
tion ” Q J E 69 (February 1950) 57-88 

-. “Stalin on Economic Development ” In Continuity and Change in Russian 

and Somet Thought, edited by E Simmons Cambridge, Mass Harvard Univ. 
Press, 1955. 

-. The Sovut Imbistrialtzatum Debate, 1924-1929. Cambridge, Mass Harvard 

Univ Press, 1960 

Gerschenkron, Alexander. “Problems and Patterns of Russian Economic Develop¬ 
ment In Transformation of Russian Soeuty, edited by Cyril Black. Cambridge, 
Mass.. Harvard Umv. Press, 1960 

Griliches, Zvi. “The Sources of Measured Productivity Growth Umted States 
Agriculture.” J.P E. 171, no 4 (August 1963). 331-46 

Heady, Earl O A Primer on Food Agriculture and Public Poluy. New York Random 
House, 1967 

Jasny, Naum. The Socialized Agriculture of the U.5 S.R.: Plans and Peiformanees. 
StanfiMd, Calif.: Stanford Umv Fren, 1950. 



SOVIET GROWTH 351 

Kaplan, Norman M “Retardation In Soviet Growth.” Rmi Earn and Statu. 50, 
no 3 (August 1968) 293-303 

Levine, Herbert S “’^e Role of Central Planning.” In New Currents in Soviet-Type 
Economics, edited by George R Feiwel Scranton, Pa. International Textbook 
Co, 1968. 

Nove, Alec. “2^ Per Cent and All That ” Sovut Studus 16, no. 4 (July 1964) 28-54 

Nurkse, Ragnar Problems of Capital Formation in Underdeveloped Countries New 
York Oxford Univ Press, 1957. 

Preobrazhensky, Evgeny The New Economus Oxford Clarendon Press, 1965 

Roberts, Paul Craig “Drewnowski’s Economic Theory of Socialism " J P E 76, 
no 4 (July/Auguat 1968) 645-50 

Schultz, Theodore W The Economic Organization of Agncidtme, New York 
McGraw-Hill, 1953 

Solow, Robert M “Technical Change and the Aggregate Production Function ” 
Rev Econ and Statu 39, no 3 (August 1957) 312-20 

Staller, George J. “Fluctuations in Economic Activity Planned and Free Market 
Economies, 1950-1960 ” A.E R 39, no 3 (June 1964) 312-20 

Weitzman, Martin L “Soviet Postwar Econoimc Growth and Capital-Labor 
Substitution ” A.E R 60, no 4 (September 1970) 676-92 




Ghetto Employment and the 
Model Cities Program 


Bennett Harrison 

Masiachuselts InslituU of Tuiaiology 


In 1971, the Model Cities Program subsidized 25,000 jobs in municipal 
and nonprofit agencies Both the original legislation and subsequent 
administrative rulings require participating cities to give preference in 
hiring to residents of the target areas—the urban ghetto. Yet, by the 
spring of 1971, fewer than half the positions were held by ghetto residents 
Moreover, nonresidents were being paid sigmficantjy higher salaries than 
residents. Thu paper attempts to identify the determinants of intercity 
variauon in “resident participation rates” and the causes of the resident/ 
nonresident differences in earnings It was found that cities with highly 
structured bureaucracies and/or strong mayors allocate a larger sl^re 
of Model Cities jobs to “outsiders” than is the case for other cities. On 
the other hand, real or potential “black power” (as measured, for 
example, by whether or not the city experienced a major civil disorder 
in 1967) increases ghetto partiapauon in the Model Cities Program. 
Analysu of more than 14,000 in^vidual employee records shows that 
neither occupational “crowding” nor lack of educational credenUaJs 
sufficiently explains the salary differences between ghetto and non¬ 
ghetto Model Cities workers 


Introduction 

During the 1960s, the federal government launched a panoply of anti¬ 
poverty programs designed to amehorate conditions in the ghettos of 
American cities Government planners, academic researchers, and 
representatives of the organized pioor themselves have focused their 


The author u aisoaate professor of econonucs and urban studies. Department of 
Urban Studies and Planmng, MIT Thu paper was wntten under a grant from the 
U.S Office of Economic Opportunity to the Maryland Project on the Economics of 
Discnminabon, and was delivered to the 1972 annual meetings of the American Pohtical 
Saence Association Data and guidance m their use were provided by the U.S. Depart¬ 
ment of Housing and Urban Development and by the National Civil Service League 
Bradley Schiller, Charles Levine, and Frank Thompson contributed helpful cnticum of 
an earlier draft. I am also grateful to All M. Parhixgan for hu able and energetic rcKarch 
and programming assutance I alone am, of course, responsible for the final ptcMntation. 




354 


JOURNAL OF POLITICAL EOONOUY 


attention on two particular aspects of these programs; what we might 
call the “output side,” that is, the kinds of service these programs were 
intended to deliver, and the extent to which the groups receiving the 
services—the poor themselves—were permitted to participate in the 
design and administrauon of the program. 

Curiously, httle attention has been paid to what I shall call the “input 
side”—namely, direct, salaried employment in these activiUes This 
myopia is surely true for the Model Cities Program One can find 
numerous stones in the popular press about the local conflicts surrounding 
the election of neighborhood boards, but one is hard pressed indeed to 
find any discussion of the manpower aspects of Model Cities Yet, if I am 
correct in my estimate that—by the middle of 1971—there were perhaps 
25,000 salaned employees in the Model CiUes Program across the country, 
then Model Cities as a source of jobs is now larger than New Careers, 
Operation Mainstream, or the Job Corps. Even if Model Cities fails to 
provide cither high-quality services to the ghetto community or a vehicle 
for increased political participation by the urban poor (and in saying 
this, I do not mean to imply that such has been the case), it would still 
be an important part of our overall anUpoverty effort simply because 
of the employment and wages it has generated 

There are three kinds of positions in the Model Cities Program 
(1) direct City Demonstration Agency (CDA) jobs for professionals, 
paraprofessionals, clencal/office workers, technicians, laborers, and 
operatives (e g , truck drivers) on the central CDA staff itself, (2) similar 
employment in the municipal or pnvatc nonprofit “delegate agencies” 
(DAs) which arc operating projects supported in whole or in part by 
HUD-Modcl Cities funds, and (3) a share of the jobs generated by new 
construction associated with the program (this category has produced 
too few jobs to permit analysis) If all of the estimated 25,000 jobs in fact 
went to the urban poor, then—at roughly $8,500 a job—ghetto income 
would be increased by $213 million a year, a not-insignificant sum 
In fact, however, all of the jobs are not going to ghetto (i c , “model- 
neighborhood”) residents On Apnl 30, 1969, after nearly 2 years of 
“start-up,” only 44 percent of the salaried employees in the 77 “first- 
round” Model Cities projects were from the designated target neighbor¬ 
hoods The “resident participauon rate” {RPR) was only 22 percent for 
professionals (a not-unexpected result for a planmng period), 68 percent 
for subprofcssionals, 49 percent for clerical workers, and 80 percent for 
other categories Moreover, nonresidents (of all races) received sub¬ 
stantially higher salaries than residents in the same occupations and with 
similar demographic characteristics; these salary differentials ranged 
from $800 to $1,300 per year (Harrison 1973).* A much more compre- 

‘ Thu study of the Model Cities planning period was conducted for the National Civil 
Service League with quesnonntures administered by HUD and processed by the Bureau 
of Applied Social Reswch at Columbia Uiuveisity. 



MODEL CITIES PROGRAM 


355 


hensivc survey of 75 projects m the spring of 1971—the data base for the 
present paper—indicates the absence of any improvement m the situation, 
again, fewer than half of all the jobs in the program were held by ghetto 
residents ^ Professional RPRs were 23 percent for whites and 30 percent 
for minorities For subprofessionals, technicians, clencal workers, and 
laborers, the respective RPRs were 63 percent and 74 percent, 57 percent 
and 51 percent, 46 percent and 52 percent, and 56 and 64 picrcent 

These employment practices violate both the intent of Congress and 
administrabve regulations of the Department of Housing and Urban 
Development Title I of the original Demonstration Cities and Metro¬ 
politan Development Act of 1966 (PL 89-754), in authorizing HUD to 
undertake the Model Cities Program, stipulated that “residents of the 
model neighborhoods should also be given maximum opportunity for 
jobs in the projects and activities of the program ” Section 103(a) of the 
act requires that each CDA provide “maximum oppiortunities for 
employing residents of the area in all phases of the program and enlarging 
oppiortunities for work and traimng ” In 1970, the HUD assistant sec¬ 
retary for community development issued “CDA Letter Eleven,” 
requiring participating ciues to give preferential treatment to model- 
neighborhood residents as a necessary condition for retaining HUD- 
Model Cities funds (to date, no city has been cut off) 

The present paper has as its purpose the explanation of intercity 
variation in total and occupation-specific RPRs, and the esumauon of 
the magnitude of wage discnmination against model-neighborhood 
residents Probably the most impiortant institutional factor informing the 
design of the econometric models that follow has been the lack of inde¬ 
pendence of Model Cities projects from their local city governments 
Unlike (for example) the Community Action Program of OEO “All 
funds for a Model Cities Program are directed to a local City Demon¬ 
stration Agency (DCA) through the elected city council This gives local 
city councils final veto piower over the program by ruling out any direct 
funding relationships between the community and HUD It is an in¬ 
escapable conclusion that the official intenuon was to make city hall 
responsible for the participation of citizens and to prevent the developi- 
ment of autonomous, government-financed ciUzen organizations, as 
had occurred in the pwverty program” (International City Managers’ 
Association 1972, pp 12-13) This is why the specification of explanatory 
vanables measuring the structure of city government receives the greatest 
priority in the following pages ^ 

^ The second Model Ciues Survey was conducted dunng the winter and spring of 
1971 by the National Civil Service League, on a contract lioni HUD TTie RPRs were 
generally higher in DAs than in CDAs—quite the opposite of what one would expect 
considering that CDAs are supposed to directly leprcKnt the interests of the model 
neighboihood For all occupations, the nspecbve averages were 56 and 43 percent 

* In the 1969 study, avil service and "potenual black power” vanables proved to be 
especudly sigmiicant determinants of resident employment in Model Ciues 



JOURNAL OF POLITICAL EQONOUY 


356 

As with the Ckinununity Action Program (CAP), the employment 
aspect of Model Cities has a variety of manifest functions* to give 
partiapants (model-neighborhood residents) direct economic benefits 
and skills, to “encourage” city agencies to become more responsive to 
the needs of the poor, and to challenge “crcdentiahsm” as a barrier to 
the pubhc employment of the disadvantaged * Moreover, the actual 
delivery of services by city agencies is also involved. There is increasing 
evidence that, in nonwhite urban communities in particular, “no dehvery 
of services may be possible unless there is parucipation” (Altshuler 1970, 

p. 221) ’ 

Again, hke CAP, there are also latent functions associated with the 
employment activiues of Model Cities These include the creation of 
“demonstration effects” within the community, the development of an 
mstituUon around which community organization is at least theoretically 
fiossible, the creation of a “fifth column” for the ghetto inside city hall, 
and the focusing of pressure on a system (public personnel) which is 
already under attack from other groups such as the still predominantly 
white unions 

Evaluations of CAP, while equivocal to pessimistic about the impact 
of the program on local government behaviour, have been fairly optimistic 
in their assessment of the direct employment impact Dale Rogers 
Marshall writes “Poor people get jobs or positions in groups which 
lead to jobs They also exhibit increases in pohtical education, information 
about how the system works, and m organizational skills and feelings of 
political efficacy Thus participation serves as a training ground for 
indigenous leaders” (Marshall 1971, p 473) 


The Correlates of Resident Participation 

The poliucal science and sociology literatures include a number of 
quantitative studies of intercity vanation in the performance of poverty 
programs Marshall, for example, writes that “differences in the imple¬ 
mentation of parucipation arc associated with differences among the 
communities in which [Community AcUon Agencies] are located Size 
of the city, size of the minonty populauon, and poliucal factors seem to 
make a difference” (Marshall 1971, p 458) A Wisconsin Poverty 
Institute study by Michael Aiken and Robert Alford regressed the local 


* “Credentialism” is defined as the excessive (1 e , job task-unrelated) use of educational 
credentials by employers as employee-selecUon devices, perhaps to facilitate the practice 
of “statisucal discnminauon” (cf Berg 1970, Ruchlm 1971, Hamson 1972, e^i chap 1, 
on statistical discnmmation, see Piore 1971) 

’ Thus, in Its technical assistance efforts for HUD, the NaUonal Civil Service League 
has been stressing the use of “selective certification” to achieve hiring preference for 
model-neighborhood residents, on the grounds that cities caimot meet their legal and 
constitutional obligauons to provide an equal distnbuUon of services to all communities 
unless they use commuiuty residents to deliver services to their own neighborhoods 



MODEL CITIES PROORAM 


357 


per capita poverty grant in 1966 on five types of independent variables 
size and age of city as proxies for the degree of bureaucratization, the 
present manifestations of bureaucratization (such as the presence of a 
civil service piersonnel system), the degree of poverty (as a measure of 
“need”), ethnic/cultural heterogeneity (which the federal government 
might take to be a predictor of conflict over the grant), and the power of 
the chief executive 

For Aiken and Alford (1968), the object of study was the relationship 
between the “ability to mobilize” (as measured by the five clusters of 
variables) and one particular measure of the payoff to mobihzation— 
acquisition of a federal grant In the present paper, the problem to be 
explored is how the “abihty to mobihze” affects the extent to which the 
urban poor benefit (“participate”) from those grants in one particular 
way—through direct, paid employment 


Spectfication 


We define the resident participation rate as 


RPR 


( number of employees who arc\ 
model-neighborhood residents I 
total number of employees / 


X 100 


Six different multiple regressions will be fitted, one for each of six RPRs 
corresponding to (1) all employees, (2) professionals only, (3) sub- 
professionals, (4) technicians and operatives, (5) clerical workers, and 
(6) laborers The mean values of these six variables were 48 percent, 
26 fiercent, 66 percent, 55 percent, 48 percent, and 60 percent, 
respectively 

The first class of explanatory variables to be exanuned measures 
actual Model CiUes inputs One independent vanable will be cumulauve 
funding as of December 31, 1970 (A',) The mean funding level of the 
64 projects for which suffiaent data were available to jjermit a complete 
analysis was $4 18 milhon It is difficult to predict how this would vary 
with RPR An alternative formulauon is per capita funding (X ^), that is, 
total funding divided by the total number of project employees The 
mean value of (ATj) was $30,040 

The second class of independent variables measures the structure of 
the municipal government The vanable (Aj), the age of the city (as an 
indicator of its degree of bureaucratization), is measured by the last 
two digits of the year (after 1900) when the city reached a population 
size of 50,000, a threshhold selected after examining the size distnbution 
of the 64 cities in the sample * The mean value of (A'j) is 27 (correspondmg 


* Ciuei which attained this scale before 1900 were arbittanly asagned values of “00 ” 
CiUes which had not yet attained a scale of 50,000 by 1970 were arbitrarily assigned 
values of “80 ’’ Data availabihty dictated thu prixxdure 



JOURNAL OF POLITICAL ECONOMY 


358 

to the year 1927). Following Aiken and Alford, we should expect that 
the degree of bureaucratization increases with dme, so that (as measured 
here) age and degree of bureaucratization arc mverscly related. It is my 
guess that aty bureaucratization constitutes an obstacle to ghetto 
residents, bureaucratization and RPRs are probably also inversely 
related Therefore, the maintained hypothesis is dRPRjdX^ > 0 
A more static variable is (X^), the 1970 population level in the city 
(mean = 314,562) To the extent that larger cities contain more workers 
with whom ghetto dwellers must compete (and if they also contain larger 
and/or more strongly entrenched bureaucracies), size and RPR are 
likely to be negatively related 

There are three different ways to measure the size of the bureaucracy 
directly by the 1968 level of mumcipal employment (A’5), by the per¬ 
centage growth in municipal employment between 1962 and 1968 

( municipal employment mumcipal employment\ 

_^1^^_I__ I X 100, 

(mumapal employment in 1962) 


and by the proportion of the working residents of the city who are 
employed by the mumcipal government, estimated (somewhat crudely) 
by 


(^7) 


( 1962 municipal employment \ 
1960 employment of all workers I 
living in the city / 


X 100 


(Again, data availability forced this rather peculiar specification How¬ 
ever, since public employment has been growing much more rapidly 
than private employment in cities [Harrison 1971], the fact that [A',] is 
undoubtedly biased upward for I960 makes it a better indicator for the 
more recent past) The effect of increases in {X^)-{X-j) on RPR is 
ambiguous Larger (stronger^) city bureaucracies are more capable of 
effectively blocking ghetto demands On the other hand, the faster a 
city’s payroll grows, the easier it ought to be to accommodate the poor— 
especially in jobs subsidized by the federal government ^ The mean 
values of (Xf)-(X^) are 4,797, 12 8 percent, and 7 7 percent, respectively 
Still another way of measuring the extent of bureaucraUzation involves 
mdicators of the degree to which local personnel pracuces are insU- 
tutionahzed Variables now susceptible of measurement include the 


Piven (1972) believes that the rapid growth of city payrolls is to a great extent not a 
cause but a result of (or response to) white demands for sufficient employment to main¬ 
tain "social distance” from blacks, who have unquesUonably been increasing their 
absolute parUcipation in mumapal employment through the poverty programs Under 
this hypothesis, the level and/or rate of growth of aty employment would still be positively 
correlated with RPR, altbcnigh the direction of causation would be reversed. 



modsl cities prooeam 


359 


presence or absence of a civil service/tenure system (A'g), whether or 
not written examinations are normally administered m hinng for un¬ 
skilled vacancies (JT,), whether or not arrest records arc used as selection 
instruments (A’jo), and whether or not the jurisdiction has a full-time 
personnel director (A’, j) These arc all given dummy variable specihca- 
tions; their mean values across the sample (interpreted as proportions) 
are 0 69, 0.29, 0.74, and 0 91, respectively Since nearly half of the 
Model Cities projects already ojjeratc within their local civil service 
systems—and all are being urged by HUD to do so—we should expect 
{Xg) and (A', o) to be inversely related to RPR (given what we know 
about the relatively poor cognitive skills and illegal acUvities of—as well 
as police attitudes toward—many ghetto residents, cf Hamson 1972, 
chaps 1,5) I would expect the same to be true for the other two variables 
as well 

In a study of the Model Cities and Community Action Programs in 
Chicago and Detroit, Allan Rosenbaum (1970) showed that, cspiecially 
because of “the ambiguity of the federal intention,” both programs 
have been redirected “towards the ends of the mayors of the two 
cities As a result the programs have, in most instances in these two cities, 
been simply mirror-image reflections of the political-institutional arrange¬ 
ments and environment in which they operated ” The finanaal plight 
of big-city mayors has been well documented Given this constant 
pressure from influenual constituents to locate external resources for 
local uses, and assuming that pressure on the mayors from ghetto 
organizations and individuals is relatively less pronounced, then the 
presence in a city of a strong mayor (or city manager) nught well correlate 
negatively with RPR, as Model Cities funds are “leaked” to other (non¬ 
ghetto) constituents or arc substituted for local revenues (which were 
already being allocated for the employment of nonghctlo residents) 
Three indicators of the relative political power of mayors can be 
quantified Let 

. „ _ 1 if city has a mayor/council form of government, 

~ 0 otherwise, 

(Y 1 _ ^ mayor has veto piowcr over his/her council, 

0 otherwise, 

.y . _ 1 if mayor is in his/her second or greater term, 

0 otherwise 

The mean proportions for these three indicators of mayoral power are 
0.71, 0 23, and 0.41, respectively All three are expected to be mversely 
related to RPR (the veto-power variable will actually be specified as 

■^13 ^ -^ij) 



JOURNAL OF POLmCAL BCON(»rF 


360 

The third set of explanatory variables measures actual or potential 
“black power,” that is, the relative importance of minorities (not only 
blacks) in the city A wide variety ofindicators can be specified, including: 


.y . _ model neighborhood population in 1965 
~ (mean = 32,873), 

1 if Citizens’ Advisory Board was elected by the 
(■^is) = community, 

0 otherwise (mean = 0 72), 

1 if the local CDA director is a member of a minority 
iX^^) = group, 

0 otherwise (mean = 0.48), 

(A’lg) = 1970 nonwhite city population (mean = 73,052), 

.y , _ 1970 nonwhite share of total city population 
(mean = 20 percent), 

/1970 nonwhite 1960 nonwhite\ 

(A'jo) = \ population ~ populaOon / ^ 

(1960 nonwhite population) 

(mean = 53 percent), 


(^a.) 


(^za) 


(1970 nonwhite share — 1960 nonwhite share) 
(1960 non white share) 

(mean = 35 percent), 

1 if city had a major civil disorder in 1967, 

0 otherwise (mean = 0 42) 


It seems reasonable to expect that a large model neighborhood will be 
in a position to mobilize greater political support for a program such as 
Model Cities than will a smaller target population Similarly, the act of 
holding periodic elections for the local Citizens’ Advisory Boards required 
by HUD may both manifest and reinforce community concern for a 
“piece of the action” (such concern may not produce results, of course 
but a negative correlation between [A'jg] and RPR is most unlikely) * 
The CDA director is appointed by the mayor or aty manager, thi 
selection of a minority director probably manifests a mayoral commitmen 
to the equity objectives of the program Thus, it is expected tha 
dRPRjdXn > 0 Vanables (A^i 8 )-(-^ 2 i) measure the level and growt 
of relative piotential minonty voung jjower in the city, all arc expected t 


* Boards roust be formed by law, but they need not be elected. As indicated in the to 
over a quarter of the boards are in fact appointed. For analyses of the compouuon ai 
activiUes of these Boards, see International City Managers’ Association (1971) ai 
Hamson (1973). 



UODEL CITIES PROORAM 


361 

have a positive effect on RPR Finally, (X 22 ) measures an important 
negative sanction which ghetto dwellers are able (albeit often at great 
personal cost to themselves) to impose on city hall. The Kemer Com¬ 
mission’s index of major civil disorders in 1967 provides the source for 
quanuhcaUon of this “not” vanable Many soaal scientists attnbutc 
the expansion of anUpoverty programs after 1967 to the pohtical tension 
generated by the nots; if so, then we should expect that BRPRIdX 22 > 0 

One final variable deserves a word of explanauon there is 

some evidence from earlier studies that unskilled urban workers 
in general and minonties in particular tend to fare relatively 
better in high wage than in low wage cities A high wage city 
generally contains an elaborate industrial base which consumes 
considerable quantities of labor of all skill types The large 
pool of labor made possible by this industrial base provides an 
“agglomeration economy” to new employers—in this case, the 
CDA In other words, we hypothesize that model neighborhood 
workers will get more Model CiUes jobs in high wage ciUes 
because, in such cities, they are more hkely to have had work 
experience [Harrison 1973, p 52] 

In the 1969 study, this variable, 

manufactunng wage bill in 1963 
manufactunng man-hours in 1963 
(mean = 52 49), 

was indeed highly significant for the laborer category. As will be seen, 
It was among the most important variables in the present study as well 



Analysts 

Imtial inspection of the results of fitung the six regression models, 

RPR„ = *0 + f) + e, 
y-i 

I <= 1, ,6 occupauon groups 

n = 1, ,64 cities 

indicated the presence of substantial mulUcollineanty With a hnear 
specification and multiple R’s of 30- 80, first-order (pairwise) correlauon 
coefficients of more than .30 crudely indicate intercorrclation of sufficient 
magnitude to generate pxitentially senous multicollineanty. Experi¬ 
mentation with different combinations of vanables led to the decision to 
delete variables (A’j), (A^), (Aj), (A,j), (A,,), (A„), and (A,,) (the 
selection of a variable to represent a collincar set was determined by 



JOURNAL OP POLITICAL ECONOMY 


362 

ease of policy interpretation). Four theoreucally important variables— 
city age, the presence of a civil service system, growth of the black 
population, and growth of the black population share—were retained 
despite some colhneanty by runmng separate pairs of regressions for each 
of the SIX RPRs, including (A'j, ^^ 20 ) (^s> ^ 2 i)> respectively. 


7 


= bo 

+ *1^1 

4- 

*3^3 + E 


u 


14 17 

+ 


+ 

E + E bjXj 


j=9 


i-i3 j-16 




23 

+ 

^20-^20 

-f- 

E + e. 


J -12 


I = 1, 
« = 1, 

1 1 

RPR^„ = *0 + *1^1 + *6^6 + E 

J“7 

+ E i>jXj + E 

J=13 i=l6 

23 

+ X h^j + e 

y*2i 

1 = 1 , ..,6 

n = 1, ,64 


( 1 ) 


,6 

,64 


( 2 ) 


The results are displayed in table 1 It is immediately apparent that 
employees in the 64 Model Cities projects for which complete data were 
available fall into two very distinct groups, in terms of the relationships 
under study The proportions of professional and subprofessional hires 
drawn from the ghetto are much less strongly related to the piolitical- 
economic structure of the city than are the corresponding RPRs for 
techmcians/operatives, clerical workers, and laborers, as indicated by 
the large differences in regression coefhcients This dual stratification of 
the Model Cities labor force was confirmed by pooling the professional- 
subprofessional observations and the tcchmcian-clencal-labor obser¬ 
vations, fitting reg^ssions to each data set, and conducting Chow tests 
on the F-statistics for these regressions and one fit to the entire sample. 
The null hypothesis that the two structures come from the same population 
was rejected at the 99 percent level of confidence For this reason, the 
all-occupEtion results have been deleted from table 1. 

These differences are worth exploring further. It is true that the 
participation of ghetto residents in all five occupation groups is positively 
promoted by the rate of growth and relative importance of muniapal 



model orriES prooram 


363 

employment (variables and Xf).^ Moreover, subprofessionals at least 
share with the nonskilled labor categories the circumstance of being 
relatively disadvantaged by the presence in a city of a civil service/tenure 
system (A’,) and a strong mayor, as measured by whether or not the 
latter has a veto over his or her council On the other hand, 

professional ghetto residents gam in relative employment from civil 
service (^g) and written exam {X^) propensities in the mumcipahty; the 
other groups lose This is especially true for laborers, for whom a city’s 
decision to use written exams for unskilled jobs is associated with a 
decrease of fully 5 percent in their resident participauon rate Moreover, 
while a number of the “black power” vanables—^mcluding the race of 
the CDA director (Jf| 7 ), the growth of the black population level (^ 20 ) 
and share (^ 1 ^ 21)1 whether or not the city cxpenenced a not in 1967 
[X 2 2 )—positively affected the RPRi of the second group of ghetto workers, 
they were either unrelated to the RPRs of professionals and sub- 
professionals, or else—in the case of {X^ i)—had a negative effect, implying 
that the increasing relative importance of blacks in a city’s population 
endangers the relative opportunities of the middle class in the ghetto 

Within the second group of occupations, the funding vanablc (Jf,) is 
positively associated with RPR for technicians/operatives and laborers 
(two categones whose members are nearly all males) Apparently, it u 
the projects which are the best funded that most intensively employ 
model-neighborhood unskilled or semiskilled (“working-class”) men. 
These are also the two categones of sample workers who are the most 
likely to have acquired police arrest records For them, a jurisdiction’s 
propensity to use such records as screening devices reduces their chances 
of employment in the program, as expected The relauonship between 
(Ajo) and the technician/operative and laborer RPRs is negative and 
highly significant 

For the three nonskilled occupations, the government-structure 
variables behave quite consistently (although age of city [A",] was never 
statistically significant, perhaps because of its restneted range). Both 
“mayoral jiower” vanables (Ajj, A', 4 ) contnbute to reduced resident 
partiupation, as hypothesized Whether or not the city had a full-time 
personnel officer was of no consequence 

As indicated before, all three nonskilled occupaUonal groups benefit 
from increases in actual or latent “black power ” The “not” variable 
(A' 22 ) displays larger f-statistics than any other vanable in the model, 

* Still, the profoslonal and lubprofmional coefficients tend to be rather larger than 
those for the other occupaUons, the respecuve ranges are 0 %2-2 371 and 0.593-1 113 
for {Xf) and 1 372-1 451 and 0 056-0 OM Tor {X^) As the muniapal denumd for labor 
grows, ghetto professionals and subprofessionals peihaps face relatively lea competition 
from “outsiders” than do workers in the other occupaUons because of the relative short¬ 
ages of skilled (or specialized) labor residing in (or available to) the aty as a whole. 
'Hiese shortages do not in general obtain for the other occupational categories. 



TABLE 1 

Coefficients Estimated from Resident Participation Regressions 



364 



Black Power Variables 



diflerenl from lero at 




JOURNAL OF POLmCAL EOONOIfY 


366 

and its quantitative effect on the RPR of nonskilled ghetto workers ranges 
between 5.6 and 6 3 percent It would appear highly hkely that those 
aty halls which expenenced major civil disorders in 1967 have been more 
responsive to the employment demands of unskilled ghetto workers 
Finally, the manufactunng wage variable {X^i) again displays a 
positive and sigmficant relationship to resident participation for non- 
skiUed workers This is consistent with the hypothesis that high-wage 
cities tend to contain a large “secondary labor market" which employs 
large numbers of unskilled and clerical workers (Gordon 1972, Harnson 
1972, chap 5) To the extent that (even unstable) work experience 
enhances a ghetto dweller's chances for employment in the Model Cities 
Program—whether because of the recruiting practices of the CDA, job 
search practices of the resident workers, or both—we should expect to 
find a positive relationship between RPR and a proxy for the degree 
of work experience of the unskilled ghetto labor force The average wage 
rate may be such a proxy 


Wage Discrimination agtunst Model-Neighborhood Residents 

The survey of 75 Model Cities projects indicated that both part-time and 
full-time nonmodel-neighborhood residents earn higher incomes than 
residents of the target neighborhoods For whites, the ratio of median 
resident/nonresident annual earmngs ranged (across all occupations or 
job categones) from 76 to 97 The range for minority employees (blacks, 
Puerto Ricans, chicanos, American Indians, onentals, and “others”) 
was 57-106 The first-order correlations between residence (= 1 if MN, 
0 otherwise) and earnings were —0 623 for whites and —0 647 for 
minorities, both coefficients were significantly different from zero at the 
99 percent level of confidence 

Multiple regression analysis was employed in order to explore several 
possible explanations for this apparent wage disennunation against 
model-neighborhood residents Differences in earnings imght be attri¬ 
butable to the relatively lower incidence of possession by ghetto residents 
of educational credentials required for “admission” to municipal jobs, 
A complementary explanation—proposed most recently by Barbara 
Bcrgmann (1971)—suggests that neighborhood residents are “crowded” 
disproporuonately into the lowest-paying jobs This is likely to be 
esjiecially prevalent for female employees 

Two linear microregressions were estimated for the 10,176 white and 
two for the 4,131 minority employees. The results are displayed in table 
2 Clearly, model-neighborhood residents earn systematically lower 

Separate modeli for examining the relationship between earnings and education 
on the one hand and earnings and occupation on the other were necessitated by the high 
correlaaon between educauon and occupation. 



TABLE 2 

Ewploveb Waob Rborzisioni 


Indepe went Variable 

Dependent Variable 

Annual Earnosos (t) 

Whites 

Minorities 

(1) 

(2) 

(1) 

(2) 

Xi = Model-neighborhood 





resident (CPl) 

-1237 70* 

-1179 36* 

-1445 65* 

-1311,26* 


(20 07) 

(22 76) 

(67 34) 

(69 62) 

X, = Male (0-1) 

524 26* 

516 05* 

929 64* 

860 79* 


(20 64) 

(23 13) 

(69 60) 

(52 98) 

X, = Age (years) 

232 65* 

228 47* 

189 23* 

163 84* 


(18 78) 

(19 70) 

(23 46) 

(23.23) 

X^ — Age squared 

-6 14* 

-7 68* 

-4 23* 

-3,19* 


(0 25) 

(0 26) 

(0 30) 

(0 29) 

X, » Full-time worker (usually 





2 35 hours/week) (0-1) 

1049 91* 

1045 06* 

622 05* 

638 03* 


(21 25) 

(22 92) 

(95 24) 

(104 15) 

X, = Civil service/tenure 





coverage (0-1) 

-17 93 

-9 30 

-102 14 

-148 33 


(25 13) 

(26 54) 

(80 31) 

(79 54) 

X, = Completed exactly 12 





years of school (0-1) 

80 80 


-34 43 



(54 73) 


(270 02) 


Xf = Completed exactly 13-15 





years of school (0-1) 

-64 61 


1030 85* 



(46 38) 


(271.58) 


X, = Completed 16 or more 





years of school (0-1) 

2551 21* 


1826 55* 



(237 38) 


(258 35) 


XiB = Professional (0-1) 


2287 68* 


1441 13* 



(41 03) 


(128 75) 

Xii sE Subprofessional (0-1) 


820 07* 


1021 12* 



(39 22) 


(122 33) 

Jfi, = Techniaan/operative 





(0-1) 


-37 50 


-404 94 



(67 50) 


(209 99) 

AT,, » Clerical worker (0-1) 


114 56* 


248 35 



(15 23) 


(142 79) 

X,t = Employed directly by the 





CDA (0-1) 

271607* 

2813 60* 

3266 46* 

3609 09* 


(58 20) 

(61 31) 

(156 23) 

(13581) 

X,B = Employed by a delegate 





agency producing educa- 





tion services (0-1) 

83 98* 

77 90* 

579 28* 

550 83* 


(35 20) 

(35 03) 

(151 05) 

(142 79) 

X,, = Employed by some other 





type of delegate agency* 





(0-1) 

21 71 

22 34 

573 41* 

532 17* 


(32.71) 

(34 72) 

(136 82) 

(131 43) 

Degrees of freedom 

12, 10163 

13, 10162 

12, 4118 

13,4117 


0 846 

0 828 

0 518 

0 530 


SouACB.—HUD-NGSL Model Gi(ie» Survey 
Notv —Stviul^ erron are in parenthciei 

■ To avoid tinguUnty in the [a'J lf] matra, a cat^o^ of anarrllancom ddcfvte agtncaa employuif 12% 
of all Model Gitie» %vorken wai removed from the §et of ageociee reflected in vanable Xi » 

* Significantly diflerent from aero at the 05 level 


a 


367 







JOURNAL or rounoAL boonohv 


368 

TABLE 3 

Expected Annuai. Earninm roti Two Rejueientativ* Modbi. Cthm WowttR Tvms 

(m Dollar*) 



Nonresident 

Accountant 

Reudent 
Teacher’s Aide 

White' 

Eq (!) 

$13,651 

$6,706 

Eq (2) 

13,263 

7,364 

Minontv 

Eq (1) 

12,194 

4,805 

Eq (2) 

11.379 

5,729 

Difference 

Eq (1) 

1,457 

1,901 

Eq (2) 

1,884 

1,635 


salaries than nonresidents, even after the eflects of differential education, 
occupation, and “industry” (CDA, education-oriented delegate agency, 
or other delegate agency) have been removed Among whites, the 
expected differential in earmngs attributable to residence is $1,179-81,238 
per year, for minorities, the differential is even larger $1,311-81,446 
per year Neither occupational “crowding” nor differences in years of 
schooling provide sufficient causal explanations, in fact, experiments 
with interaction terms such as (^’,.,^ 7 ), (.^fiATg), and showed the 

marginal effect of education on earnings to be rather insensitive to 
residence 

This wage discnmination against model-neighborhood residents is 
comjxiunded by additional wage discrimination against minorities ' * 
Consider, for example, two representative worker types (a) a 30-ycar-old, 
nonresident, college-educated, male professional employed full-time by 
the CDA (say, as an accountant), and (b) a 30-year-old resident female 
subprofessional with only a high school diploma, working full-time for a 
local school board (say, as a teacher’s aide) Substitution of the appro- 
pnate values into the fitted regressions yields the expected earnings 
estimates displayed in table 3. The differences between annual earmngs 
by race are remarkably similar for two such disparate worker types. 

’ * The raw data pemuc the identification of 23 specific occupations (e g, “sub- 
professional-law enforcement”) and 23 employing agencies (eg, ''fire department"). 
Analysis of regressions speafied at this level of detail show that earnings vary nearly as 
much withm as among occupations and agencies—suggesting that the crowding hypoth¬ 
esis as a theory of wage determination may not apply to the public sector For regrenions 
including (a) neither occupations nor agencies, (i) occupaUons only, (e) agenaes only, 
and (d) occupations and agenoes, the respective coefficients of variation were .17, 15, 
15, and 12 

The mean white wage was $9,023; the mean minority wage was $6,912 The sample 
of neighborhood, 1 e, ghetto, residents is not biased toward minorities Forty-nine percent 
of the 10,176 whites lived in the target areas, while 51 percent of the 4,131 mmonty 
employees were model-neighborhood residents 




model omu PROGRAM 


369 

about $1,500-$1,900 for the tumghetto accountant and 91,600-11,900 
for the ghetto teacher’s aide. It u, of course, theoretically possible that 
these differences are in fact attribuuble to some factor(s) not specified 
in the model, rather than to race per se. However, with between 52 and 
85 percent of the wage variation explained by the specified models— 
a very high degree of “explanatory power’’ for cross-section regression 
analysis—it is hkely that at least part of these estimated differences are 
indeed attributable to discrimination 

Summary and Conclusions 

The Model Cities Program of the U S. Department of Housmg and 
Urban Development, although it is pnncipaily concerned with the 
delivery of services to the urban ghetto, has in fact become one of the 
largest manpower programs operating in urban labor markets In 1971, 
HUD subsidized some 25,000 budgeted posiuons in 903 municipal and 
nonprofit agencies, at an average salary of nearly $8,500 Two-thirds 
of the parUcipaung agencies provided on-the-job and other forms of 
training to Model Cities employees 

Both the original legislation and subsequent administrative ruhngs by 
HUD require each parUcipaung city to give preference in hinng to 
residents of the target areas—the urban ghetto Yet, by the spring of 1971, 
fewer than half of all the jobs m the program were in fact held by ghetto 
residents Moreover, nonresidents were being paid sigmficantly higher 
salaries than residents for pierforming essenUally the same tasks 

This paper has attempted to idenufy the determinants of intercity 
variation in “resident parUcipation rates’’—the share of jobs held by 
ghetto residents—and the causes of the resident/nonresident differences 
in earnings 

In the resident-parUcipation regressions, parucular attenUon was 
addressed to variables rcflecUng the poliucal structure of the cities in the 
program—including degree of bureaucrauzauon (eg,, civil service 
variables) and mayoral poliucal strength (as measured by, e g , number 
of terms and veto powers)—and actual or latent “black power” (c g , 
whether or not the city expienenced a major civil disorder in 1967) 
In general, cities with highly structured bureaucracies, strong mayors, 
and formal civil service systems and procedures allocate a larger share 
of Model CiUes jobs to “outsiders” (in violauon of the federal guidehnes) 
than is the case for other cities On the other hand, real or piotenual 
black political piower increases ghetto parUcipaUon in the Model Ciues 
Program, as does the presence of strong private-sector demand for 
unskilled labor (the latter effect is probably related to the propiensities 
of Model CiUes agencies to use previous work expienence as an employee- 
selection entenon) 



370 


JOURNAL OP POLrmiAL EOONOMY 


Analysis of 14,307 individual employee records shows that neither 
occupational “crowding’* nor lack of educational credentials sufficiently 
explains the salary differences between ghetto and nonghetto Model 
Cities workers, after controlling for sex, age, employment status, civil 
service coverage, and type of employing municipal or nonprofit agency 
Moreover, we were able to identify what appears to be an independent 
discriminatory effect by race 

I must conclude that, in terms of the leg^lative mandate for giving 
priority in Model Cities employment to the residents of the target areas 
themselves, these findings are disheartening The conclusions of my 
earlier study is no less valid here' “Once again, the urban poor have not 
received as large a share as might have been possible of the benefits 
associated with a government program ostensibly directed to improving 
their welfare This ‘leakage’ of public anti-poverty resources into the 
hands of individuals at least some of whom are of doubtful standing as 
proper recipients has characterized other federal programs It is un¬ 
fortunate that Model Cities appears to have begun its career in a similar 
way’’ (Harrison 1973, p 54) 

These results also bear on another—and much larger—issue of public 
policy the question of decentralization of manpower programs The 
research on which this paper reports suggests (although it of course does 
not prove) that local governments with the political power to do so— 
those with strong bureaucracies and powerful chief executives—can and 
will distort the equity objectives of federal programs Findings such as 
these therefore have a useful role to play in future discussions about the 
reorganization of manpower and antipoverty efforts 


References 

Aiken, Michael, and Alford, Robert R “Community Structure and Mobilization 
The Case of the War on Poverty ” Discussion Paper no 29-68, Umv Wisconsin, 
Inst Res Poverty, October 1968 

Altshuler, Alan Community Control • The Black Demand for Participation in Large 
American Cities. New York Pegasus, 1970. 

Berg, Ivar. Education and Jobs • The Great Training Robbery New York Praeger, 
1970 

Bergmann, Barbara R. “The Effect on White Incomes of Discrimination in 
Employment " J P E 79 (March/Apnl 1971)- 294-313. 

Gordon, David M Theories of Poverty and Underemployment. Lexington, Mass.’ 
Heath-Lexington, 1972 

Harrison, Bennett. Public En^loyment and Urban Poverty. Washington * Urban Inst. 
1971. 

-. Education, Training, and the Urban Ghetto. Baltimore; Johns Hopkins, 1972 

-. “The Participation of Ghetto Rendents in the Model Cities Program." 

J. American Inst. Planners 39 (January 1973): 4.3-55. 



MODEL orms PROGRAM 


37« 

IntfCDRtioiwI City Managen’ AMoaation. “Cttueni and Model Gibe*: Majw 
Imiet and Relationihipf.” In 1971 Mumetpal Year Book. Chicago: Intemat. 
City Managers’ Assoc., 1972. 

MarshaU, Dale Rogers. “Public Partiapauon and the Pohtics of Poverty.” In 
Race, Change, and Society, edited by Peter Orleans and William Russell Ellis, Jr. 
Los Angeles Sage, 1971. 

Piore, Michael J. “The Dual Labor Market.” In Problems m Political Ecoaony. An 
Urban Perspective, edited by David M. Gordon Lexington, Mass * Heath, 1971. 

Piven, Frances Fox “Cutting Up the City Pie ” New Republic, February 5, 1972, 
pp. 17-22 

Rosenbaum, Allan “Participation, Programs, and Politics The Federal Impact 
on the Metropolis ” Paper delivered to the 1970 meetings of the American 
Political Science Association, Los Angeles 

Ruchlm, Hirsch S. “Education as a Labor Market Variable ” Indus Rdatums 
10 (October 1971) 287-300 




Optimum City Size: The Externai 
Diseconomy Question 


J. V. Henderson 

Quun’s Umaasity, Canada 


The paper discussei whether market-achieved city size u greater or lets 
than optimum city size when externalities such as pollution arc present. 
Optimally taxing or controlling pollution may or may not lead to a 
decline in production of (Xilluting goods Moreover, taxation reorders 
production and consumption patterns in the city so that the mmimum 
net effect on welfare of city inhabitants is the decline m pollution Since 
pollution taxation increases the welfare of city mhabitants, city size rises 
with immigration from the rest of the economy 


A piopularly held idea is that cities, in particular big cities, arc too large 
and that optimum size cities would be smaller ' Nonoptimality arises 
because of unpriced production or consumption extemalilies such as 
noise, air, or water pollution and congestion, which if properly controlled 
would lead to a reduction m city sizes This paper disputes this conclusion 
for numerous reasons For illustrative puiposes, the papier concentrates 
on an analysis of air and water piollution 

The first task is to formulate the problem in a simple but appropnate 
context Firms produce an expiort good in a city Due to a production 
input such as sulphur-bearing coal or petroleum, the production of the 
export good results in air- or water-borne emissions such as sulphur 
dioxide, carbon by-products, and phenols * This pollution may affect 
production efficiency of firms or households, or it may just enter utility 


The theme and the general approach to a few of the problems discussed in the paper 
were inspired by an unpublished paper by George Tolley The author is liirther indebted 
to Oded IsraeU and anonymous referees for helpful written comments on the paper and 
u mdebted to members of the Urban Economics Workshop at the Umvemty of Chicago 
for verbal commenu on the June 1972 presentation of this paper 

' For example, see Sandquist (1970) See Mills and de Ferranti (1970) for a discussion 
of the subject Thu second article reaches no firm conclusioiu on thu subject. 

' See KnecK (1971) for a description of the muluple forms and effects of pollution 



374 


JOURNAL OP POLmCAL ECONOMY 


functions of people living in the city Concerning this, one might hy- 
piothesize that since many work places are air-conditioned and purified 
pnmanly to stabilize seasonal fluctuations in temperature, air pollution 
in general does not enter production functions or affect efBcicncy 
significantly Sinularly, water pollution little affects the efficiency of 
water use in industrial processes. However, pollution certainly affects 
consumption activity, either directly entering the utility function or 
affecting efficiency of household production That is, it is part of the 
amenity income from living in a city 

What happens if these externalities are priced to equate social and 
private marginal costs (assuming they can be priced) ’ ^ The usual analysis 
states that, if production of a good results in external diseconomies, 
optimally taxing this good will result in a decline in its production and an 
increase in production of other goods The inference is then made that if 
production of the polluting good falls, the size of cities producing that 
good falls In contrast, my analysis leads to the following conclusion 
Optimally pricing pollution will lead to a reordering of production 
priorities in cities and the economy away from goods responsible for 
pollution Within the city, the adverse effect of reduced production of an 
export good will be more than offset by the increase in output of other 
goods and the decline in pollution Welfare of city residents will rise, and 
immigration to the city from the rest of the economy will then occur 

Before proceeding to the main analysis, the concept of a city and of a 
system of cities in an economy must be examined In discussing optimum 
city size, It IS very important to understand what a city is and docs and 
how It relates to the total economy 

Cities produce a bundle of goods As characterized by Mills (1967), 
in even the most simple model, cities produce an export good or bundle of 
goods, housing, and a commuting good Agglomeration in cities occurs 
because of scale economies in producing, say, the export good. Workers 
work in the city center producing the export good and commute from 
their home site to the city center and back each day As city size increases, 
per person commuting costs rise due to increaising spatial distances that 
workers must commute and nsing congestion on the roads As expressed 
in Mills (1967) and in Henderson (1972), these nsing commuting costs 
with agglomeration may offset the benefits of agglomeration or scale 
economy exploitation in expiort good production That is, there is an 
efficient city size for productfon and consumption 

Extending this analysis, Henderson (1972) then shows if there is an 
efficient city size where net benefits of agglomeration are maximized,'^ 

’ See Bauinol and Oatei (1971) and Bohin (1971) for a dcKnption of pnang schemes 
and other methods of pollution control 

* Henderson does not state that the market achieves this efficient aty size. His analysis 
and any reasons for nonoptimality are mdependent of the analysis in this paper or 
external diseconomies 



OFtHfUM onv SIZE 


375 

there may be multiple aties producing the same bundle of goods.* 
Further, as has been asserted by numerous people (e g , Ldsch 1954 and 
Beckmann 1%8), Henderson also shows there arc different types of 
cities in the economy. Each type of city specializes in the production of 
different good or bundle of export goods. ‘ 

These pro]x>sitions indicate there is a system of cities in an economy 
with different types of cities producing different bundles of export goods. 
Within each type of city, there may be one or multiple cities depending on 
economy or world demand for the export goods of that city type, their 
costs of transportation in trade (the Ldschian [1954] limitation on city 
size), and their production technology, in particular, the degree of 
scale economies (the extent of city agglomeration economies) In dis¬ 
cussing the effect upon city size of taxing pollution in cities producing 
polluting goods, later in the paper we will take into account the relation¬ 
ship between these cities and other types of cities in the economy 


Externalities in Consumption 

In this section, we formally specify the nature of pollution and examine 
what happens to city size if piollution is optimally priced Our analysis is 
carried out in terms of a representative city in the economy of the type 
that produces a polluting good Implications of our analysis for the country 
as a whole rather than just the representative city are examined later in 
the paper 

Our representative city produces X, an export good, and F, a non- 
traded good or bundle of goods such as housing and retail services The 
production functions for the X and F industry are, respiectively, 

X = X(Lj„ N), (1) 

F = Y{Lr) (2) 

where and Ly are labor inputs and X is the input of an imported 
natural resource These functions arc assumed to be locally homogeneous 
of degree one As explained above, to have urban agglomeration, either 

’ This result can be denved in a Ldschian (1954) world where efhaent aty size is 
limited by the increasing transport costs of supplying larger and larger rural areas as 
aty size increases 

‘ Henderson’s (1972) reasoning u as follows If the production of two export goods or 
bundles or export goods involves no benefits from a common location such as utilizing a 
common speaahzed labor force or loCationally fixed mtermediate input, then it may be 
beneficial to locate the production of these goods in different ciues Locating their 
production together increases the labor force that must be housed and that adds to average 
oommuung costs m the uty while not increasing eSSciency m the producuon of the traded 
goods Separating the producuon of the two goods mto different abes lowers the increase 
m per person commuting costs relative to scale economy exploitauon per umt of traded- 
good output, as traded-good producuon and aty sue rise. Separating the two industries 
of coune involves further transportauon expoiditures due to mteraty trade, and a 
complete analysis would have to account for these costs 



JOURNAL OF POLITICAL ECONOMY 


376 

X or ¥ would globally have to have increasing returns to scale Local 
linear homogeneity could imply (dis)economies of scale are exhausted, 
at least at current city size ’ Note that urban scale economies are also 
needed to explain why people would cluster together in a city where there 
IS pollution which they could avoid by spatially moving apart We will 
comment on these scale economy problems below 

The fact that firms in the X industry use an input N, such as coal or 
petroleum, results in piollution such as sulphur dioxide or carbon emissions 
Finns using N all contnbute flows of emissions that result in a “stock” of 
pollution This stock then enters people’s utility functions or firm pro¬ 
duction functions as an unpriced public good (bad) The pollution 
production function is p = P{Nf^ JVj, , N„), where Nf arc N inputs 
of the ith firm in the X industry and where dPjdN^ $ -P/A^, Note that 
firms’ marginal contributions to the stock of pollution, dPjdN,, may differ 
depending on relative spatial locations of firms and prevailing winds or on 
individual firm production functions In this section we assume all 
dPjdN^'s are equal or pollution is a function of the industry use of N 

P = ^(A^) (3) 

The production function of P is not homogeneous of degree one At low 
levels of X output, additions of N add little to pollution At higher levels 
the natural absorption of pollution by the atmosphere or water due to 
chemical breakdown or spatial distribution by wind or water currents is 
slowed, and additions ol N contribute significantly to pollution This 
formulation of pollution being a “stock” created by emission flows from 
individual firms is due to Tolley (1%9) 

Pollution IS currently unpriced and enters people’s utility functions as a 
negative amenity Since pollution is caused by use of N, this means N is 
improperly priced, that is, units of A should be taxed according to their 
indirect effect through pollution on people’s utility What happens to 
output and city size when units of A^and pollution are taxed ? Recall that N 
IS an imported resource The social marginal cost of N lies above its 
import price By taxing N and reducing its use, one feels intuitively that 
people in the city should be made better off since we are reducing the 
use of an input for which social cost exceeds expenditures or perceived 
private benefits (marginal product) Just how city inhabitants are made 
better off is examined below rigorously both to emphasize the interactions 

’’ Why IS it, for example, that major cities such as New York, Boston, Chicago, Los 
Angeles, etc keep growing at a steady rate comparable to the national population growth 
rate’ Since these cities expand neither more rapidly nor less slowly than an average 
increase, neither can their scale diseconomies be overwhelming nor can their scale 
economies be unfulfilled I am suggesting they have reached a stage where city output 
as a whole has constant returns to scale Just how that breaks down for different sectors is 
another question 



OPnHOM CITY SIZE 


377 

involved and to prove the point In particular, we are cunous as to how 
production and consumption patterns change, how wages change, and 
what the effect of returning pollution tax proceeds to city inhabitants is. 

In setting up the problem, we assume city inhabitants have equal utility 
levels with inhabitants in other cities or parts of the economy We assume 
initially following the pollution tax that city inhabitants remain in the 
city It IS the usage of N and output and cCnsumption patterns that adjust 
first to the pollution tax Following this adjustment, if the welfare of city 
inhabitants nses, then migration to this city from the rest of the economy 
should occur over time Essentially we are solving a problem that is a 
“partial equilibrium” problem with respect to the rest of the economy 
To solve ngorously for an economy general equilibrium solution, one 
would want to have a total model of an economy solving for all city sizes 
and factor usages where fioUution is imtially unpriced and then priced 
m those cities where there is pollution. Algebraically this is not feasible,‘ 
but I believe the arguments m this paper will indicate that the directional 
changes in city size in my model would persist m a full solution 

To examine the adjustment of the city to the pollution tax, we set up a 
simple general equihbnum model of a city The production functions 
for the city are given in equations (1), (2), and (3) On the consumption 
side, It IS assumed all consumers have identical tastes and incomes City 
population, L, is equated to city labor force. Lx + L^, for simplicity 
Per person utility is defined by 

Ui = Ui{X‘iL, YIL, Z^IL, P) t = I m (4) 

For X and Z, superscripts, c, refer to goods consumed in the city as 
opposed to those produced in the city The city experts X — X’^ and 
impiorts Z' and the input N The balance of trade equation for the city is 

Px{X - xn = PzZ^ + P,N, (5) 

or city income (which equals piL before pellution taxes) equals 

/ = pxX + PyY — pf/X, (6) 

where p’s refer to prices The px and px are determined in national or 
international markets and are fixed for the city (see discussion below) 
When N is taxed for piollution, the tax proceeds arc redistnbuted to 
laborers in the city Then per laborer mcomc, IjL, is 

+ (I) + ,,< 0 , (7, 

* Hendenon (1972) solves for a general equikbnuin model of an economy with ciues 
That solution is complicated enough without adding in cxtemahties and taxatton 
soluuons to externalities The model simply becomes unsolvable 



JOURNAL OP POLITICAL ECONOMY 


378 

where is the pnee per unit of pollution and lp,jL)P is the negative per 
person head tax If we maximize utility in equation (4) subject to the 
budget constraint in equation (7), we can calculate the optimal pollution 
tax or tax on N, which we will call t. From the constrained maximization 
problem, the Samuelson public-good condition says 

P.lPx = ^, ( 8 ) 

where = dUjdX, Up = dUjdP < 0 Equation (8) states that the 
price ratio equals the sum of the marginal utilities of the public good 
normalized by the marginal utility of a private good To determine the 
relationship of the pollution price to the tax on N, we first examine the 
relationship between head taxes (jxillution prices) and taxes on iV and 
then use the production relationship for P m equation (3) If P and N 
were the original quantiues of pollution and N, then 

p,{P - A/*) = -t(N - \N), t > 0, (9) 

winch says that proceeds from the post-tax employment of X equal 
subsidies (or negative head tax) paid to laborers, where A’s refer to changes 
Rearranging and using equations (3), (8), and (9), we get the tax 

/=_(!- N)-‘(P/N - > 0,’ (10) 


where an asterisk refers to rates of change and — dPjdN 

We can now turn to examining the effect of the tax, I, upon equilibrium 
in the city To determine the change in equilibrium, we employ equations 
of change from Jones’s (1965) paper on general equilibrium models 
The production side of the model is given by Jones’s equations of full 
employment and equations indicating price equals marginal (average) 
cost or there is perfect competition 

‘>xi 

oJLyJ 

pxt "x/vi r pl \ 

where from activity analysis refers to the amount of the jth input used 
in a unit of the ith output 

From equation (12), we want first to obtain price changes that may 
occur on the production side of the model We assume that the pnee of X 
IS fixed m national or international markets and that the import price of N 





OPTIMUM CITY SIZE 


379 


IS constant. The purchase price of N rises by the amount of the tax, t 
If the purchase price of N rises and is fixed, then pj^, the price of 
labor, must fall to keep the cost of producing X from rising If Pi^ falls, then 
pY, the pnce of the nontraded good, must fall in perfect competition since 
labor IS the only factor of production in Y These results may be obtained 
formally by differentiating equation (12) and solving to yield 

(13) 

°XL 

where astensks refer to rates of change (eg , Pl = ^u^i Ofj is ^th 

factor’s share in the revenue of the ith good and = tjp^ 

The pnce changes in equation (13) induce changes in factor inputs, N, 
Lx or axiL, and Lj. or ayiL in equation (11) Using an elasticity of sub¬ 
stitution defined as Ox = (<J*t — Oxn)1(Pn ~~ Pl) relationships 

in equation (13), we differentiate equation (11) and simphfy to get 

where Aij is the fraction of thejth factor used in production of the ith 
good (note Xxn — 1 and =0) Imtially L = 0 by assumption 
Changes in output are determined by changes in factor inputs suggested 
in equation (14) and by demand conditions Demands for X‘, Y, and Z' 
are determined by income and prices Since px and p^ (the pnce of the 
import good) are fixed in national markets, to solve for changes m equi¬ 
librium output (not consumption) we only need one demand equation, 
Y(I, Px, py, pz) VVe differentiate this equauon, substituung in 
Px — P*r = ^ substituting for J from equation (6) where it is assumed 
that all pollution tax proceeds are returned to city inhabitants This 
yields** 

?(1 - kyrjrj) = 1 M r,y,N + nlprJr, (15) 

Differentiating equation (12) yields 


\exL exx\\p 


5] , 

1 J 0 Jl^ 


f J 1 0 J 


where, since N is only used in X production, Sy^ = 0, = 1, 0 Px 0 hy 

assumpUon Jones (1965) further reasons that Oxl^xl + » 0 by pnnaples of 

cost miniiiuzation. Given that, solving the equation yields equation (13) 

"To get equation (IS) we use the relationships Y • tiy,l + Hr.sr^r, Vr-s, " 
'll.py + kj at PyYIPlL at pi,LflPi,L ■« iyi., and / - -{pxm)ff+ (PxXII)^ 

+ iPxYlin?+Py). 



380 JOURNAL OF POLITICAL BOONOKY 

where t)fi and ffr.^r income and compensated price elasticities of 

demand for Y and Ay is the fraction of consumer income spent on Y. 

Combining equations (15) and (14) and using py = 
yields the equations determining output changes. 


^XL 

kyi 

0 

1 

0 

-1 

-PxX„ 

j In 

(1 — kytly ,) 

„ PnAT 
Hn -j- 






r • n 

X 


« 


Y 


* 


N 

- 


~^xl^xnI^xl ^x 
Ox 

~^,pr ^XnI^XL 


(16) 


Solving using Cramer’s rule we get 


^XS^YL _ £ 

OxL^XL e 




\ Oxs ^XLOxL / 


sincc/j,>0, <0, (17) 


< 0 


(18) 

(19) 


Since pollution,/’,equals^(A'^) from equation (3), for £;v = {dPjdN){NjP) 

P = < 0 (20) 

Note all our results arc in terms of the variable {llPfi) whose positive 
directional sign we know 

The decline in X production and N usage is expected since the pro¬ 
duction price of N nscs The increase in Y production (equals consump¬ 
tion) IS expected since py declines Note, however, these changes arc only a 
function of the elasticity of substitution in production and the compensated 
elasticity of demand of Y The former indicates the degree to which N 
declines in X production with an increase in pf/ and hence indicates the 
degree to which pollution may be reduced by substitution in production 
The second indicates only substitution effects arc present in consumption 
of market goods We have taxed N which, given a fixed Px, means p^ 
and hence py must decline. But the tax proceeds from iV offset the income 
effect of the pnee and wage changa to laborers, leaving only pure sub¬ 
stitution effects in consumption of market goods. 

To solve for changes in utiUty levels of city residents we first must 
solve for consumption changes of X^ and Z' We differentiate the balance- 
of-trade equation (5) [px{X — X*) = piZ‘ -f- ps^]> and substitute in 



opnMuu CITY nzE 381 

* * • 

for X, N, and then Z‘ by differentiating the demand equation for 

• • 

Z‘ “ Z{I, px, pYt Pt)- We solve first for X‘ and then for Z*. 



'-ha 

^XL 

^X.pr 

< 0, 

flx.pr > 0, 

(21) 


k ^XL 


< 0, 

^.PT > 0 

(22) 


Not surprisingly, X‘ and Z‘ decline with the fall in the price of Y providing 
all goods arc substitutes in consumption 

To solve for the change in per person utility we differentiate equation (4) 
to get 

dU = X‘- dUldX‘ + Y • dUjdY ■ 

+ Z‘ 8UldZ‘{^^ + P dUjBP 



We then substitute in the first-order conditions for a maximum—for 
example, dUjdX‘ = Xpx where k is the marginal utility of income—and 

substitute in for Z', Y, Z®, and P from equations (21), (18), (22), and (20) 
This yields 



^XL 


+ ^X*IX,PT + ^zlz.pr^ 

+ p(^i\,J-ZiL + 

\dP kj V ^XL ^XL 0XL 


But = 0 



dU 

since — , 
dP 


ti* 

'Ir.pr 


< 0 


(23) 


Therefore, the effects on utility of reduced Z' and Z' are offset by the 
increased Y That is, in terms of market goods, the effects on utility net 
out to zero Since P is the only other argument in the utility function, as 
indicated in equation (23), the dollar value of utility increases by the 
dollar value of the reduction in pollution 

As mentioned at the beginning, intuitively this makes sense The 
basic city resource is labor, which produces goods for its own consumption 
and to trade for other goods We tax the use of an input the aty imports. 
The effect of the tax is to reorder production pnontics in the aty away 
from X, where marginal social cost exceeds price, to Y Since the tax 



jomsAt or POiinaAt moosomy 

proceeds ere returned to labor, only mbalitution effiXts la amiet tm- 
sumption are present, and we showed that they net out in teruis of affecting 
uuhty. The net result is a Pareto-optimal reduction in the imported 
mput and the accompanying positive benefits from the optimal reduction 
in pollution. Given that utility levels in the city rise, city size should in¬ 
crease over time as workers from other parts of the economy migrate to the 
city. 

Three comments on the above analysis are in order First, allowing 
to vary would not affect our results As A' exports from the city fall, px 
could nse in national markets relative to its initial level This will affect 
the change in consumption and production patterns in the city, but not 
the basic effect on utility of a Pareto-optimal reduction in jxillution 
Second, we could solve our model with import controls on N as the 
method of polluuon control Again, production and consumption patterns 
will differ from the tax solution, but the net effect on utility will only be a 
fall in pollution 

Our third comment is more serious m nature In setting up the problem, 
we assumed the X and V production functions were locally linear homo¬ 
geneous If we were to consider situations where Nf) and F{Ly) 

were not locally homogeneous of degree one, our result could be weakened. 

If the producuon of X were to have industry increasing returns to scale 
and Y to have decreasing returns to scale, then the reduction of X and 
increase in Y would have adverse effects on labor productivity and the 
production potential of the city For example, if one postulates an export 
base theory for the existence of cities the fall in X output could have 
quite adverse effects on city size On the other hand if Y or the service 
sector of the city were responsible for the urban scale economies, the 
result of rising utility levels would be reinforced as production priorities 
were reordered to the non-poIluting service sector Moreover an inter¬ 
esting situaUon arises in the case where there are production externalities 
or where piollution affects firm or household production efficiency In 
that case production of both goods, X and Y, may nse following the 
imposition of taxes We now turn to this situation 


Externalities in Production 

As before, our representative city produces an expiort good and a non- 
traded good or bundle of goods such as housing, retail services, and house¬ 
hold produced goods (leisure, meals, child rearing, etc ). Firms in the X 
industry utilize an input that results in pollution which affects production 
efficiency in the X and Y industnes. Specifically, a firm’s production 
function in the X industry is 

*1 “ ^i)> 1=1,. ., n (24) 

ef,ldN, > 0, d/,ldL •> 0; {dg,ldP){dPldNj) <0, i,j - 1.. «. 



oFTtMtnt cmr nzs 383 

where x, U the firm’s output; Lf and are inputs of labor and the im¬ 
ported resource AT; and P is pollution. The /, u the (locally) linear 
homogeneoiu production function of the firm. ‘ ^ The gi(P) is a production 
function shifter* ’ that affects firms* efficiency in a Hicks neutral fashion 
with respect to factm- inputs; for example, pollution corrodes machinery 
as well as irritating workers Our specification is essentially one of industry 
scale diseconomies external to the firm. 

The industry production function for ¥ is 

y = G(P)F(Lr) (25) 

dP/dL, < 0, dGIdP-dPIdN, <0, 1 = 1 n 

The G{P) caused by usage of in AT production is also a production func¬ 
tion shifter The pollution production function is as before 

P = (A^i. . AT.) (26) 

dPjdN, $ dPIdNj, t ri J 

To examine the economic effects of pollution on production and output 
in the aty, we analyze optimum versus equilibrium factor pricing in the 
city. It IS assumed readers are familiar with the two constrained optimiza¬ 
tion problems of firms maximizing profits versus an economy maxiimzing 
a social welfare function that determine equihbrium versus optimum 
pricing of factors We turn directly to the results In competitive equi¬ 
librium, factors are paid the value of their perceived marginal products 
That IS, 

Pl = PxiiiP) SfJdL,, 1 = 1,2 n, (27) 

Pn = PxgiiP) + Px<lgil<^P-^PI^P^, fi> « = 1 (28) 

wher^, IS an abbreviation for/j(Z.„ AT,) From equation (27) we can see 
that the marginal product of N, to the firm is the sum of the normal 
first term plus the effect of using Nf upon pollution that then affects 
(negatively) the firm’s own production Of course the firm cannot dis¬ 
tinguish between the two effects and simply sees their sum as the marginal 

See above ducuauon on the local hnear homogeneity assumption 

' ’ Our producUon specificauon u chosen for analyuc convemence In the alternative 
speafication, Ptsan unpriced mput m the/, and F foncUons m equations (15) and (16) 
Then m addition to the extemahty distortion caused by the X industry emitting pol¬ 
lutants, we have an unpriced input distortion in the X and Y industries, since P vmuld 
then be an unpriced input m the /, and F producUon foncUons Thu would mean that P 
might earn a negauve rent which, m perfect compeuuon m output and factor markets, 
would have to be “assigned” to the earning of one or both of the other factors creating 
further dutortioiu such as analysed by Worcester (1969) and Meade (1952). There has 
been some debate on how to deal with the tent to an unpriced factor (tee, for ocanqile, 
Mishan 1965), and we avoid this debate by using the p^uction function speafication 
where the only dutortion u the extemahty itself 



384 JOURNAL OF POLITICAL ECONOMY 

product oCN'* For F, from equatioa (25) 

Pl » PrG(P) dFIdLy. (29) 

Equations (27) and (24) also describe the value of social marginal 
products for labor However, the value of social marginal products for If 
are different from equation (28) The iV| affects production efficiency of 
firms in the X and F industries by contnbuUng to pollution which in 
turn acts as a production function shifter in these industries For firms 
j and k the social marginal products of N are 

firm j. 

SMP, = p,{g^(P) dfjIdNj + dgjIdP-dPIdNj fj) 

dgJdP-dPIdNj +PydGldP dPjdNj fJ, 

(30) 

firm k 

SMP, = PMP) dfjdN, + dgjdP dPtdN,-A) 

J] dgJdP dPIdN /A + Py dGjdP dPIdN, f1 , 

(31) 

where f, and F are abbreviations for N/) and F{Ly) The first 

term m the square brackets indicates the negative effect in the form of a 
production function shift of employing additional on other firms in the 
X industry, and the second term represents the negative effect in the F 
industry In each case, per unit use of JVshould be taxed by the amount in 
the square brackets to equate both factor price with social marginal 
product and social and pnvate mai^nal costs The standard result 
following the impiosition of the tauc is that, with the increase in pn, tl^fC use 
of N, the use of L in X (for p^ fixed), and output of X fall 

What does this indicate with respect to city sizc^ To answer this 
question, I assume as above that the city labor force remains initially 
fixed and ask what happens to the welfare of city inhabitants following 
the imposition of the tax If the welfare nses, the presumption is that 
people from other parts of the country will migrate to the city over time 
Although the imposition of the tax leads to a fall in X output, the fall 
in N usage means the city has to spend less of its income on importing N 
In fact, since the value of social marginal product of N (eqs. [30] or [31]) 
is leas than the perceived value of pnvate marginal product of N which 
equals the price of N (eq [28]), reductions in N mean that the reduction 

'*The adduig-up problem is avoided by astunung that psdgJdP dPIdNf/i it 
neghgible. Then iu;tor payments will exhaust firm revenue (tee Cbipman 1970). 


+ Px( 




OPTunni amr nzE 


385 

in city income spent on N multiplied by the change in N) should 
exceed the fall in the value of output lost by reductions in N This relation¬ 
ship should persist as long as the price of N exceeds the value ofthe social 
marginal product of * * Therefore income and welfare of city inhabitants 
will nse following taxation 

As mentioned in the previous section, the proposition that, because 
pollution taxation corrects divergences between private and social costs, 
optimum city size will rise with piollution taxation may be simplistic 
For the case of externahties in consumption, we demonstrated that 
pollution taxation reorders production priorities and providing taxation 
proceeds are returned to city inhabitants, immigration should occur to 
the city The same type of analysis applied to externalities in production 
However, the same qualifications remain Pollution taxation may lead 
to a fall in the output of the good responsible for urban scale economies or, 
from an export base theory of city existence, the good respionsible for city 
existence This would have adverse effects on city size if the assumption of 
local linear homogeneity is inappropriate 

However, there are situations with externalities in production where 
no qualification is necessary In these situations when N is taxed and its 
usag^ falls, output of both X and Y may nse City inhabitants of course 
are better off, but also the export base and the service base of the city 
expand There are three pertinent situations 

1 Pollution in a city may conceivably be so extensive that additional 
use of N may actually reduce total output Although dfJdNf > 0, 
additions (reductions) of N may add (remove) so much to piollution and 
hence so affect production efficiency through large marginal reductions 
in the g, and G terms in equations (24) and (25), that, in net, total output 
will fall (nse) In other words, in compietitivc equilibnum, due to the 
terms in the square brackets, equations (27) and (28) for the social mar¬ 
ginal product of N may be negative If N is optimally priced or taxed, the 
importation and use of N will fall m the city For imtial reductions m N 
total productivity will rise, because the true social marginal product of 
N IS negative * * In the social optimum, total output may nse and of 
course city income (value of output minus impiorts of A'^) will nse (Recall 
total output IS maximized when the social marginal product of N is 
zero The social marginal product of N moves from an imtial negative 
value to a final positive value, so it cannot be definitely concluded that 
total output nscs ) Whether ^ or F both may nse or only Y nscs depends, 
in addition to demand conditions, on whether the social marginal product 

' ’ Tolley (1969) eoentially makes a simUar point To quote ToUey “A manifestauon 
of the gam from reducing the eatemahty is likely to be that the production function 
shift lowering costs, is greater than the eflrct of the tax as a cost-raiiuig item ” 

See Gordon (1954) for a similar point involving fishmg grounds. If the social 
marginal products are onginally negative, then the onginal equihbnum would be 
ineflSaent (we are interior to the transformation curve) as well as noooptunal (market 
pricing is mcorrect). 



JOURNAL OF POUnCAL BCXiNOMY 


386 

of in JT producrion alone u originally negative—on whether the first 
term in the square brackets in equation (31) outweighs the private 
marginal product of N (the expression in parentheses in eq. [31]). Then 
both X and Y output may rise. 

2 A simple example will illustrate our second situation There are two 
firms initially producing equal amounts of X, and both expenence equal 
amounts of pollution However, only one firm creates pollution due to 
differences in production functions or spatial locations—for example, the 
“nonpolluting” firm may be downwind on the edge of the city If all 
production were shifted to the nonpolluting firm, for the same inputs, 
total output of X and Y would rise 

In other words, firms may not contribute equally to pollution at the 
margin given either their production technology or relative spatial 
locations, that is, dPjdNj dPjdN^ Alternatively, dgJdP # dgJdP, or 
pollution affects efficiency of firms differently for the same reasons If 
either is the case, equations (30) and (31) for social marginal product will 
not be equal in competitive equilibrium (Of course, private marginal 
products will be equal) Suppose the social marginal product of N in 
firm k IS greater than m firm j If we held employment in Y and city labor 
force fixed, and moved some of Nj to firm k, output of X would rise for the 
same inputs 

As shown by Schall (1971), ail this implies that we are not on the 
Parcto-efficient transformation surface in competitive equilibrium where 
equations (28) but not (30) and (31) are equalized If we impose a 
Pareto-optimal tax on the use of N, we move from the private competitive 
equilibrium interior to the transformation curve to the social optimum on 
the transformation curve for the city By taxing A^, in the production of X, 
we (a) rearrange production in the X industry efficiently, shifting us to the 
transformation curve, and (A) eliminate the divergence between pnvate 
and social marginal costs caused by the X industry polluting itself and the 
Y industry Usually only (A) is considered, which would indicate a fall in 
X production and a nsc in Y production By including (a) where we 
rearrange N usage and X production between firms to those polluting less 
or being less affected by pollution, X output may nse despite the tax 
on N and its fall in total usage. 

3 Our third situation is a more economically technical one and our 
discussion is brief As Plott (1966) points out, because piollution is caused 
by the use of an input, taxing the input N may still lead neither to a fall in 
X output nor in use of labor As shown in Bear (1965), this occurs if N is 
an “inferior” input or if X and L are “substitutes” in production.*’ 
Taxing and reducing the use of N and raising its pnee can lead to an 
increase in X output for held constant. 


Inputs are substitutes in production if the addition of one input to production 
holding the second mput constant reduces the marginal product of the second mput. 



387 


OPTIMUU OnY SIZE 

The Representative City and the Economy 

The result of taxing pollution in the representative city analyzed above 
was to make the existing workers there better off Given that, we argued 
that over time immigration would occur to the city Then city output 
of Y would nse even further and output of X would probably nse relative 
to Its initial posNtax level If pollution is optimally taxed in all cities of the 
type that produce X, then sizes of all these cities should increase over time, 
even though economy output of X declines due to the nse in pnce of N 
These cities now have simply reordered production pnorities from X 
to Y, the nontraded good such as retail services, housing, and household 
produced goods, optimum city size has nsen because these cities are now 
more efficient production-consumption units 

It seems even possible that X production in these cities may nse with 
immigration, with the appropnate changes in production techniques to 
less pollution-intensive methods of production * * Of course, unless one of 
the three situations depicted in the production externalities section 
prevails, X output in the economy will still decline with the change in 
pnce of N For X production in some cities to rise and X output of the 
economy to fall, there must be a decline in the number of cities producing 
X That IS, given initial multiple cities of the tyjje that produce X, the 
optimum Size of that type of city or that type of production-consumption 
unit has risen, and hence the numbers of that type of city will decline. 
Just what solution will prevail could only be determined in a model of a 
system of cities far more complex than any that currently exist 


Conclusions 

My conclusion is that optimally taxing an extemahty will make people 
in a city better off at the existing size of the city That will induce 
immigration to the city and increase city size The analysis m the papier 
is specific to pollution but can be extended 

Immediate parallels for other typies of externalities spring to mind, 
in particular for congestion Is the effect of optimally pncing congestion 
just to raise the pnce of p>eak-time travel and make it more expicnsive to 
commute^ Or is the effect of pncing congestion to allocate resources 
efficiently, reordenng travel times and modes Moreover, switching to a 
time-vanable congestion pnce would most hkely reduce off-p>eak travel 
times relative to current gasolme tolls and raise peak travel times Also, 
evaluation of optimal investment in roads and financing of roads would 
change with optimal pncing (see Mohnng 1970) 

Parallels to the situations in the production externality case in the 
papier also arise. Walters (1%1) has shown that when traffic density 

' * That 11 , the NIL rauo wiU fall as more of the nonrcMurce mput n used for either 
poUuUon control or less poUutmg methods of utilizing the rewurce input. 



JOURNAL OF POLITtCAL ECONOMY 


388 

becomes very high, traffic flow vnil decrease (this occurs at about 11 
miles an hour). Optimal congestion taxes will lower traffic densities by 
reordering times when people travel and increase traffic flows or output! 
Furthermore, by taxing vehicles differentially according to their contribu¬ 
tion to cong^tion, traffic composition may be changed at pieak hours, 
increasing automobiles relative to trucks on intracity roads during rush 
hour That is, truckers may then choose to travel at nonpeak times and to 
travel the periphery of the city, rather than through the city 


References 

Baumol, W J , and Oates, Wallace E “The Use of Standards and Prices for 
Environment Protection ” Swedish J Eeon, no 1 (Summer 1971), pp 42-54 
Bear, DVT “Inferior Inputs and the Theory of the Firm ” J P E. 73 (June 
1965) 287-89 

Beckmann, M Location TTuory New York Random House, 1968 
Bohm, Peter “An Approach to the Problem ” Swedish J Econ , no 1 (Summer 
1971), pp. 55-66 

Chipman, J. S “External Economies of Scale and Competitive Equilibrium.” 
QJE 84, no 3 (August 1970) 347-85 

Gordon, H S “The Economic Theory of a Common-Property Resource The 
Fishery " J PE 62, no 2 (April 1954) 124-42 
Henderson, J V “The Types and Sizes of Cities A General Equilibrium 
Model.” Ph D dissertation, Umv. Chicago, 1972 
Jones, R W “The Structure of Simple General Eqmlibrium Models ” J P.E, 
73, no 6 (December 1965) 557-72. 

Kneese, A. V “Background for the Economic Analysis of Environmental Pol¬ 
lution ” Swedish J Econ , no 1 (Summer 1971), pp 1-24 
Losch, A The Economics of Location New Haven, Conn Yale Univ Press, 1954 
Meade, J E “External Economies and Diseconomies in a Competitive Situa¬ 
tion ” Econ J 62, no 245 (March 1952)' 54-67 
Mills, E S “An Aggregative Model of Resource Allocation in a Metropolitan 
Area ” A E R 57, no 2 (May 1%7) 197-210 
Mills, E S , and de Ferranti, D “Market Choices and Optimum City Size ” 
Papers and Proc American Econ Assoc (1970) 

Mishan, E J “Reflections on Recent Developments in the Concept of External 
Effects ” Canadian J Econ 21 (February 1965) 3-34 
Mohring, H “The Peak Load Problem with Increasing Returns and Pricing 
Constraints.” d £ R. 60, no 4 (September 1970) 693-705 
Plott, C R “Externalities and Collective Taxes ” Economua 33 (February 1966) 
84-87. 

Sandquist, J “Where Shall They Live^” Public Interest, no 18 (1970), pp. 88-100 
Schall, L D. “Technological Externalities and Resource Allocation ” J P.E 79, 
no. 5 (September-October 1971) • 983-1001. 

Tolley, G. S. “The Welfare Economics of City Bigness ” Urban Econ. Report 
no. 31, Umv. Chicago, 1969 

Walters, A. “The Theory and Measurement of Private and Social Cost of Highway 
Construction ” Cconometnca 29 (October 1961) • 679-99. 

Worcester, D A., Jr. “Pecumary and Technological Externality, Factor Rents, 
and Social Costs.” A.E R 59, no. 5 (December 1969). 873-85 



Communications 


A Conditional Logit Model of 
Occupational Choice 

Michael J Boskin 
Slatffnd Untvtrsify 


1. Introduction 

The theory of human capital* provides a convcment framework for 
analyzing the choice of occupation by individual workers The present 
paper is devoted to an investigation of the implications of this framework 
Secuon II briefly discusses the human capital approach to occupational 
choice Section III reviews the conditional logit statistical model 
elaborated in McFadden (1968) Section IV discusses the generation of 
the data Section V presents the empirical results of this study, that is, 
tests of hy{x>theses about the vanabies influencing occupational choice 
and estimates of the relative weights given to vanabies in selecting an 
occupation Section VI offers a bnef conclusion 


n. The Human Capitsd Approach to Analyzing Occupationsd 
Choice 

The application of the theory of human capital to occupational choice is 
simple and straightforward ^ In choosing among occupations (defined 
broadly enough so workers m different occupations are not perfect 

I would like to thank Daniel McFadden both for introducing me to condiuonal logit 
analysis and for the use of his computer program I also received valuable advice on this 
paper from Mel Reder, John Pencavel, Michael Hurd, Arthur Goldberger, Takeihi 
Amenuya, and the referee Thomas Moore provided computational assistance Finally, 
I un pleased to acknowledge the support of the Manpower Admimstrattoti, Depart¬ 
ment of Labor, through research grant 91-06-72-26 I am solely itsponaible for what 
remains 

' See Schultz (1962) and Becker (1964) 

’ See, for instance, Fleisher (1970) 


389 




390 


JOURNAL OF POUnOAL ECONOMY 


substitutes), a potential worker will weigh the benefits—potential earnings 
and nonpecuniary returns—and costs—of training, foregone earnings, etc. 
The worker will invest in changing occupations only if the returns are 
sufficiently large to make the particular change of occupation the most 
profitable use of his limited resources. 

In this simplest formulation, we would always expect to find workers 
in those occupations with the highest discounted present value of future 
potential eamipgs.^ However, given imperfect capital markets, resources 
for investing in oneself will not be equally accessible to all workers. The 
wealth position of an individual will partially determine his capabihty 
of making any particular profitable investment in himself 

It IS thus clear that decisions on occupation choice will be governed 
by the returns—primarily expected potential (full-time) earnings—and 
costs—primarily training and foregone potential earnings—relative to 
the wealth position of the individual worker in alternative occupations 
That IS, the probability that a worker, t, will enter a particular occupation, 
j, will be a function of the relative present values of potential post¬ 
investment lifetime earnings, £, training costs and foregone earnings 
relative to wealth, TjW,, and the present value of expected income 
foregone due to unemployment, U, in alternaUve occupations,^ that is, 

PtJ > ^IJ> > ^it)< ^ii> > ^tj> 

. TJW,, 

We turn now to a discussion of the estimation of such a relationship 

m. A Conditional Logit Model of Occupational Choice^ 

If we assume that preferences are such that all occupations have a 
positive selection probability for each individual, that the odds that a 
particular occupation will be chosen over another are independent of the 
presence of other piossible occupations, and that the determination of the 
odds of choosing alternative occupations is additively separable in its 
arguments representing the occupation effect and the individual worker 
effect, we may invoke certain theorems proved by McFadden (1968) 

^ Abstracting for the moment from any differential nonpecuniary costs and benefits 
among occupations More ngorously, the worker must choose simultaneously (1) the 
optimal number of occupations over his lifetime, (2) the optimal length of stay in each 
occupation, and (3) the optimal sequence of occupations That is, we have an integer 
dynamic programming problem Therefore, our worker will always be in an occupation 
which enables him to maximize present value considering all possible occupauonal shifts. 

* This formulation may be motivated by an appeal to stochastic choice theory For 
example, foUowmg Block and Manhak (I960), we may adopt a model whereby the 
probability of each outcome is proporuonal to the utility derived from the choice 

’ The diictission that follows is based on McFadden (1968). This esbmator has also 
been used to study choice of college by Radner and Miller (1970) 


. TJW,) 


( 1 ) 



OOKMUNIOATtONI 39I 

These theorems essentially allow us to wnte the selection probabihty in 
the form 

PiJ “ c*P[*W - *0)31 (2) 

The weight b{k) assigned to an occupation is a function of the attributes 
of the occupation The individual observes a vector of variables, X^, 
such as wage rates, unemployment rates, training costs, etc, for each 
occupation The b{k) which determine the selecUon probabihues are 
given by 

bik) = B{X^, 0), (3) 

where 0 is a vector of unknown parameters specifying the functional 
form of B. For each individual, define a variable_4 = 1 if occupauon k 
IS selected and = 0 otherwise Then 

E/. = i 

To estimate 0, we observe that the likelihood of a given sample is then 


w n 

^ = n n pi^ 

1 1 

Then, 

log £ = - E i: /» logjE 9) 

I- 1 k- I U- * 



(4) 

(5) 


The method of maximum Ukchhood can be applied to (5) to obtain an 
estimator for 9 with opumal asymptotic properties McFadden (1968) 
shows that the estimator for the case where Xf and 9 arc K x 1 vectors 
and B is the linear funcUon B{Xi, 9) = 9'X is consistent and asymp¬ 
totically normal This result is used to construct approximate large 
sample confidence bounds for the estimates 
We thus have for our case of a linear funcuon of occupiauon attnbutes 
that the ratio of the odds of choosing occupauon j over occupation k is 
given by 




( 6 ) 


Taking logarithms yields 


log = e'{Xj - X,), 

“ (K 


(7) 


the log of the odds that occupauon j will be chosen over occupauon k 
is a hnear function of the attnbutes of the occupauon. Thus, we have the 



392 


JOXJRNAL or POLITICAL ECONOMY 


multinomial extension of traditional logit analysis (sec Theil [1969] for 
another discussion of this statistical problem) 


IV. The Data 

We begin this discussion of the empirical results with an explanation of 
the generation of the data We estimate the present value of expected 
lifetime whole income (expected wages x hours available for work, 
assumed = 2,000) for individuals in our sample for various occupations 
they might enter These observations are taken from the 1967 Survey of 
Economic Opportunity, a detailed set of 30,000 household interviews, 
for a discussion of these data, see Boskin (1973) or Hall (1973) First, we 
esUmate the expected real hourly wage rate facing each individual as a 
function of his personal characteristics This formulation is suggested by 
recent advances in the hedonic method of price measurement * We 
employ here an extended version of the particular type of wage equation 
proposed by Hall (1973), that is, we adopt an analysis-of-vanance 
regression model, 

log (wlp) = y'Z, (8) 

where the Z’s represent personal characteristics such as race, sex, age, 
location, education, health, union membership, occupation, family 
status, etc We run separate regressions for each racc-scx-occupation 
group, thereby allowing a complete set of interactions between these and 
all other effects In addition, wc allow interactions between union 
membership and location, age and education, etc 

These results are not without interest in themselves, the interested 
reader should consult Boskin (1972) and Hall (1973) For our purposes, 
however, they are important because they give us, for each individual, 
a method of estimating the wage he/she faces in each of 11 broad 
occupational classes In addition, we can estimate how that wage rate 
vanes with age We thus can predict the course of potcnUal lifetime earn¬ 
ings for each individual in each of several occupations he imght choose 
to enter Of course, several refinements have to be made. We must at 
least attempt to allow for productivity growth, current 20-ycar-old 
workers will be working with an improved technology when they are 
40, so we must estimate this productivity growth and adjust the wages 
of current 40-year-old workers accordingly to estimate wage rates facing 
20-year-old workers 20 years from now. We estimate a constant rate of 
producuvity growth from the average annual rate for l%0-70 Wc also 
assume that future potential earnings are discounted at a 5 percent rate 


« See Hall (1973) 



COMMUNICATIONS 


393 


TABLE 1 

Sample Calculation of the Preient Value op Full-Time Eaeninoi by Occupation 
FOR A Repreientative Indivioual* 


Occupation 

PrcMnt Value of Full-Time Earnings 
(Thousands of Dollars) 

Professional/technical 

134 

Farmer 

76 

Manager 

141 

Clerical 

99 

Sales 

97 

Crafisman 

109 

Operative 

95 

Private household 

67 

Service 

86 

Farm laborer 

59 

Laborer 

83 


* White male high school graduate age 40 


of interest, modest variations in the discount rate do not affect the 
results ’ 

We thus estimate the present value of exjiected lifetime potential 
earnings as 


pv, = £ 

»-'o 


(I + ry- 


0 +» 


( 9 ) 


where PV, =■ the present value of potential work time remaining between 
age at Iq age 65, = expected wage rate in year t, H, = hours 

available for work in year t (= 2,000), y = expected rate of productivity 
increase, assumed to be constant at the average annual rate from 1960 
to 1970, and r = discount rate, aissumed to be 5 percent 

The second vanable used in the study is the ratio of an index of training 
costs to current net worth * We use the data denved by Scoville (1966) 
on spedhe training requirements by occupaUon Out-of-pocket expicnscs 
are assumed to be one-third of foregone earmngs Foregone earnings are 
computed by multiplying the present wage rate by necessary work time 
foregone in retraining We then take the ratio of traimng cost to net 
worth, the assumption being that the worker finances his retoohng out 
of his own resources 

The final vanable we examine in this study is cxjjectcd lifetime earmngs 
lost due to unemployment If time spent unemployed were completely 
unproductive, that is, contained no element of leisure or investment in 
search activity, the worker would be indifferent, cetens panbus, between 
two occupations, one offenng $1 more in the present value of future 


^ A sample calculation for a representative individual is presented in table I 
' Net worth IS defined as cashable net worth, including the value of consumer durables 
and excluding the value of human capital 






394 


JOURNAL OP POUTICAL ECONOMY 


full-time earmngs, the other offering $1 less in expected earmngs foregone 
due to unemployment We could then subtract expected earnings foregone 
due to unemployment (net of unemployment insurance) from expected 
full-time earnings and use expected wage income as the focus of study. 
If, however, the measured unemployment includes a component of 
leisure or search, the time spent unemployed is valuable and the worker 
will require less than a $1 decrease in relauve expected earnings foregone 
due to unemployment to be indifferent to an occupation with a $1 larger 
lifeume full-umc earnings potential We have therefore separated these 
two components of expected income in order to attempt to test this 
hypothesis We estimate the expected duration of unemployment in a 
manner analogous to our procedure to estimate wages ® We estimate 
analysis-of-vanance hedonic unemployment equations of the Hall type 
(sei- Hall 1970) by regressing time unemployed on a set of personal 
characteristics We thus get an estimate of the expected unemployment 
facing a potential worker m each occupation and how this unemployment 
vanes by age Following the procedure described above for potential 
earnings, we estimate the present value of potential earmngs lost due to 
unemployment by using a formula similar to (7) in all but two respects 
We replace H„ hours available for work in year t, with u„ expected hours 
lost due to unemployment, and we replace w„ the wage, by {w, — /,), 
the wage net of the hourly equivalent of unemployment insurance * ® 


V. Empirical Results 

Tables 2 and 3 present the empirical results, disaggregated by race and 
sex Table 2 presents occupation charactenstics such as discounted 
present value of potential future earmngs, discounted present value of 
expected lifetime earnings foregone due to unemployment, and our 
estimate of the ratio of training costs to net worth Since real wage and 
unemployment rates, even within race-sex groups (holding other variables, 
such as education, constant), differ substantially from occupation to 
occupauon,** the substantial vanation in our vanables is hardly sur¬ 
prising These variables summanze the information (potentially) available 
to the worker in choosing an occupation * * 

* 1966 was a year of relaUvely full employment Projecting unemployment over the 
life cycle based on these data is the most reasonable procedure available to us but could 
result in misestimation, as relative unemployment by occupation vanes over the business 
cycle 

Unemployment insurance is based on 1966 figures and u assumed to grow at the 
rate for the 1960-70 penod 

‘ ‘ See Reder (1955) for a discussion of these difierentials 

We do not claim that they are the only ones which conceivably could affect occu¬ 
pational choice, rather, we assume that the influence of other variables, eg., those 
measunng tastes for certain types of work, is small relative to the variables considered 
here 



TABLE 2 

OOCWATION CHARAOTBUinCI VOR POPULATION SUBOROUPI 
(THOUtANtX OP DoLLARI) 



Mean for 
Adopted 
Occupations 

Standard 
Deviation 
for Adopted 
Occupations 

Mean Deviation for 
Nonadopted Occupations 
(Nonadopled — Adopted) 


Present Value of Lifetime Whole Earnings 

ToUl 

63 

56 

36 

White males 

97 

63 

07 

Black males 

42 

64 

1 4 

White females 

52 

48 

-5 1 

Black females 

16 

26 

-12 8 


Ratio of Estimated Training Costs to Net Worth 

Total 

0 20 

1 8 

0 25 

White males 

038 

29 

008 

Black males 

0 19 

1 6 

0 70 

White females 

0 19 

1 6 

0 28 

Black females 

0 13 

1 5 

1 02 


Present Value of Lifetime Earmngs Lost Due to 



U ncmployment 

Total 

1 2 

36 

0 16 

White males 

1 2 

20 

0 15 

Black males 

26 

82 

-1 60 

White females 

08 

60 

-1 93 

Black females 

03 

86 

-528 


Source —Ciomputed frooi 1967 Survey of Econonuc Opportumty 


TABLE 3 

Relative WEioim op Potential Earninoe, Trazninc Costs, and Foregone Earnings 
Dub to Unemployment in the Conditional Loorr Decdion Model 


Estimate No , Population Group, 
and Variables Included 

Estimate of 0 
(x 10-') 

Standard 
Deviation* 
(X 10-') 

Likelihood L 
(X 10») 

1) White males 

Present value of potential earnings 

1 084 

0 075 

-54 15 

Training cost/net worth 

-0001 

0 001 


Present value of tune unemployed 

-0 051 

0090 


2) Black males' 

Present value of potential earmngs 

0 072 

0013 

-44 64 

Training cost/net worth 

-0 010 

0 001 


Present value of time unemployed 

-4 35 

0 567 


3) White females 

Present value of potential earnings 

0 875 

0 076 

-47 07 

Training cost/net worth 

-0005 

0002 


Present value of time unemployed 

-18 74 

1 609 


4) Black females. 

Present value of potential earnings 

0 378 

0 132 

-33 40 

Training cott/net worth 

-0012 

0.002 


PreKnt vsdue of time unemployed 

-20 78 

1 82 



8ouROE.-^IWn»d OB 2,500 randomly Edocted ^eervatioBS from each popuUtioB froup from the 1967 
oT RcnBomic Opportunity 

ConvergcBoe wai obtained after apimnimatety 20 iteratiooi, standard dcvutioni rqwr t ed arc frtmi the 
l« Iteration 


395 










JOURNAL OF POLITICAL ECONOMY 


396 

The data offer some interesting insights into the workings of labor 
markets. The most obvious is the fact that opportumties and outcomes 
vary markedly among individuals—both within and across population 
subgroups—as evidenced by the large standard deviation of all variables. 
It is also interesting to draw some inferences across population subgroups 
Relative to other population subgroups, white males enter occupauons 
with far higher training costs (the higher foregone earnings included in 
the numerator are mostly offset by larger net worth in the denonunator) 
In addition, the mean deviation for the nonadopted occupations for all 
three occupation charactensucs is much smaller for white males than 
for the other groups Given age, education, and the like, occupation is 
thus of less relative importance for white males than the other groups 
Finally, we note that blacks (male and female) have a much larger mean 
deviation of the rauo of traiiung costs to net worth, this implies that the 
nonadopted occupations are relatively more expensive for blacks than 
for whites 

Table 3 presents the evidence on the relative weights given to earnings, 
unemployment costs, and training costs The results for all population 
subgroups confirm our a prion expectations from human capital theory 
that (1) workers will tend to choose those occupations with the highest 
discounted present value of potential future earnings; (2) workers will 
tend to choose those occupations where retraining costs, in relation to 
net worth, are lowest, and (3) workers will tend to choose those occupa¬ 
tions where, cetms panbus, the discounted present value of expected 
earnings foregone due to unemployment is lowest ’ * 

All effects have the expected sign and almost all are measured quite 
precisely 

The most striking result is that white males tend to weight training 
costs and expected income foregone due to unemployment relative to 
expected full-time earnings much less heavily than the other groups 
This IS consistent with the hypotheses of differential access to financing 
of training and education costs and of differential risk aversion 

It IS instructive to examine the ratio of the coefficients for expected 
full-time earmngs and expected earnings foregone due to unemployment 
This figure vanes markedly by race and sex The expected earnings 
foregone due to unemployment do not appear to exert much of an influence 
on white males The pnee ratio of trading SI in full-time earmngs for $20 
m decreased earnings foregone due to unemployment suggests (perhaps) 


‘ ^ The predicted probabilities for adopted occupaUons ranged up to three tunes the 
predicted probabilities if random behavior was observed (probabilities equal to the 
percentages of jobs in each occupation). 

The results are similar when the traimng cost variable is left out of the equation 
These results are available upon request from the author It also should be pointed out 
that the likelihood rauo method may be used to test hypotheses about the coefficients 



aOKUUNlOATIONI 


397 

that measured unemployment for white males includes a not insignificant 
amount of voluntarily enjoyed productive activity—for example, leisure 
or job search. For the other population subgroups—^females, many of 
whom work part-time and/or on and off throughout their lifetime, and 
black males—the price ratio is reversed. For example, white females 
appear to be wilhng to trade SI less in foregone earmngs due to unemploy¬ 
ment for S20 of full-time earnings. This strongly hints of risk aversion or 
nonpecuniary costs involved in white female unemployment 


VI. Summary smd Conclusion 

We have apphcd the conditional logit decision model to the choice of 
occupation by individual worken to test some important implications 
of the theory of human capital The empirical results are qmtc consistent 
with the human capital hypothesis that workers choose occupauons to 
maximize the discounted present value of potcnual lifetime work ume 
Allowing for imperfect capital markets by includmg traimng costs 
relative to wealth and for unemployment by including the discounted 
present value of expected earmngs foregone due to unemployment also 
yielded results consistent with a pnon expectations 

The conclusions apply to all four major race-sex population subgroups 
The apparent differences among subgroups arc consistent with well-known 
labor market phenomena 


Referencea 

Becker, G Human Capital New York' Nat Bur Econ Res., 1964. 

Block, H , and Marschak, J “Random Orderings and Stochastic Theories of 
Responses ’’ In Contnbulums to Probability and Statistics Essays m Honor of Harold 
Hotelling, edited by I Olkin et al Stanford, Calif Stanford Univ. Press, 1960 
Boskin, M. “Unions and RelaUve Real Wages ’’ A.E.R 62 (June 1972) 4€&'72. 

-. “The Economics of the Labor Supply ” In Labor Supply and Income 

Maintenance, edited by G Gam and H Watts. Chicago Markham, 1973 
Fleisher, B. Labor Economics Theory and Emdence. Englewood Cliffs, N J.. PrenUce- 
Hall, 1970. 

Hall, R “Why Is the Unemployment Rate So High at Full Employment?” 
Brookings Papers Econ ifrtimty 3 (1970) 369-402. 

-. “Wages, Income, and Hours of Work in the U.S. Labor Force.” In 

Labor Stpply and Income Maintenance, edited by G. Cam and H. Watts. Chicago: 
Markham, 1973 

McFadden, D The Revealed Preferences of a Government Bureaucraiy Economic 
Growth Project, Technical Report no. 17. Berkeley Umv. California, 1968 
Radner, R,, and Miller, L “Denumd and Supply in U.S. Higher Education A 
Progress Report.” A.E R. 60 (May 1970) 326-40. 

Reder, M. “The Theory of Occupational Wage Differentials." A.EJi. 45 
(December 1955): 833-52 



39^ JOURNAL OP POLITICAL ECONOMY 

SchulU, T., ed. '‘Rdlectioiu on Investment in Man." J.P.E. 70, no. 5, tuppl. 
(October 1962): 1-8. 

Scoville, J. “Education and Training Requirements for Occupations.*’ Rm. Eton. 

and Statu 46 (November 1966). 387-94. 

Theil, H. “A Multinominal Extension of the Lmear Logit Model.” InUmat 
Ecen. Rev. 10 (October 1969) ■ 251-59 



Occupational Licensing and the 
Public Interest 

Alex Maurizi 

Office of Research, Assistant Secretary for Policy, Evaluation and Research, US Departmeni of Labor 


Introduction 

Occupational licensing has been justifiable in the view of legislatures on 
the grounds that it protects the public interest, often, however, it is the 
producers of the good or service who present this argument to the state 
legislatures This is hardly surprising, since the typical consumer is 
likely to suffer too small a wealth loss (in the form of higher prices) m the 
licensing of one more occupation, as has already been pointed out by 
others (Fnedman 1962, p 143) The end result is the promotion of the 
interests of the producer group rather than those of the pubhc Though 
this is what we econonusts would expect to happen, the principal difficulty 
in making progress with our case has been m trymg to convmcc legis¬ 
latures that the occupations receiving licenses do use their power (o 
promote their own wealth at the expense of the consumer It is the purpose 
of this paper to present evidence on this point in order that the ground 
rules for discussing the problem with state legislators can be altered. 

It will first be necessary to present the theoretical apparatus with which 
I will inspect and evaluate the data It should be noted at the outset that 
some of the ingredients of my model were mentioned in one very interest¬ 
ing earher study of the purpose of hcensing In particular, Moore (1961, 
p 112) expressed the proposition that “the more demand is growing for 
the occupation, the greater the return firom hcensing.” Moore concluded 
that the evidence indicated the tendency to hcense was not correlated 
with Its monetary return (1%1, p 113) I have formulated the hypothesis 
somewhat more precisely and have used appendix data from the same 
study utilized by Moore but which appiendix data he neglected to 
analyze (Council of State Governments 1952, p 92), my mvestigation 
tends to support the same basic hypothesis which his investigation failed 
to support 

A Simple Model of Licensing 

The usual short-run and long-run supply curves are presented in figure 1 
together with a market demand curve; we assume the initial position to 
be one of short-run competitive equilibrium licensing initially raises 


399 



400 


JOURNAL OF POLITICAL ECONOMY 



entry costs by imposing at least a fee and usually also training require¬ 
ments,* and shifts the short-run and long-run supply curves upward and 

' In addition to requiring payment of fees (usually nominal, but amounting to tl,500 
for glaziers in Chicago and S500 for truckers in Seattle in 1955) and completion of formal 
traming in approved institutions, there are numerous ways in which the licensing agency 
or body may increase the coat and inconvenience of attempting to enter the occupation 
For example, by requinng that examinations be conducted at only one locaUon in the 
state, the agency increases travel costs for some applicants By sometimes requiring appli¬ 
cants to bring all their own tools and equipment to the examimng station, the agency 
adds a considerable cost to some applicants Some cosmetologisu, for example, must 
bring their own human model, some truck drivers must bring their own trucks Further¬ 
more, in some occupations knowledge about what applicants must do to pass the examina¬ 
tion is diflScult to obtain, plumbers’ examinations are so unpredictable that applicants 
often must take the exam once (figurmg to fail it) in order to find out what it is like. 
And, of course, most occupations require applicants to be of "good moral character”, 
even the (otherwise) best qualified applicants have been turned down on these grounds 
sometimes, as the expenence of one top-ranking law school graduate who refused to sign 
a loyalty oath testifies For details regarding these and numerous other similar cases, 
see (^Hhom (1968, pp 105-52) For simplicity, I shall summarize these various elements 
of cost to the applicant under the category of "fees and training requirements." There is a 
distinction to be made, to be sure, between the effects of requiring an exam and charging 
a fee for examinees and the establishment of training requirements The former usually 
are nominal requirements, though the above discussion clearly indicates there are excep¬ 
tions Though some people are probably turned away even by these requirements, their 
numbers are probably n^igible m companson to the numbers tum^ away by the 
forma) training requirements In most occupations applicants are required not only to 
complete a specified number of years of high school or coll^ education, but also to 
attend an approved school designed to impm the necessary skills Though I shall speak 
of “fixf and traimng requirements” in connection with licensing, it should be understood 
that the principal entry restriction occurs as a result of required training. 




COMMUNICATIONS 


401 


to the left. Under these conditions, after licensing is imposed there will 
be a drop in entrants and an income gain distributed to members of the 
profession. Once the magmtude of these long-run gains are realized and 
become widely known, there will be an increase in entrants once again 
In the absence of other changes, prices (incomes) will decline in the long 
run to posiuon C The entry restriction will prevent position D from being 
reached. Arriving at this long-run eqmlibnum position may take con¬ 
siderable time, during the early years of which period, of course, estab¬ 
lished practitioners are enjoying higher incomes 

Increases m fees and traimng requirements do occur over Ume in an 
occupation, of course, one need only compare the length of trainmg 
required to become a physician today with that required less than a 
century ago, “The training legally required of medical practiuoners has 
thus been extended since the beginning of the century from three or four 
years of professional education preceded by one or two years of high 
school to six or seven years preceded by graduation from high school” 
(Friedman and Kuznets 1954, p 10) Changes of this nature occur 
slowly over time and are not likely to be a major instrument through which 
licensed occupations make alterations in their wealth position from year 
to year For this latter purpiose it is more hkely that licensed occupations 
will utilize the pass rate on the yearly examinations The extent to which 
the pass rate is altered will depend on, among other things, the extent 
of excess demand or supply existing at the ume The greater the extent-of 
excess demand^ generated by a shift in D (iniually evidenced by actual. 


‘ My discumon will be in temu of a postuve excen demand, though the same model 
applies to a negative excess demand (excess supply) Hicks has pointed out that economic 
expansion followed by stagnauon provided the historical early trading centers around 
1400 with an even greater incentive to enter into arrangements in restraint of com¬ 
petition “The growth of guilds and simiUr associabons is well known.” “As opportunities 
in general close in, or seem to close in, the fields in which it becomes tempting to protect 
oneself by agreements with one’s competitors become more extensive ” See Hicks (1969, 
p 57) More recently, the depression of the 1930s had its impiact on the admisnon of 
students into medical school Following annual mcreases iiom 1929, horn 1934 to 1939 
there was a decline in the number of and percentage of applicants accepted A comment 
made then by the New York State Board of Medical Examiners’ secretary u of mtercit 
here "The trend is still in an early stage, but in law, mediane, denustry, and other 
professions under control of state licensure, the signs are apparent Without intention 
or design, the far-reaching steps taken by the physicians to raise educaUonal standards 
during the past twenty-6ve years has resulted in limiting the number of students. Now 
[1933], realizing the advantages of this unplaimcd restriction, leaden are taking defimte 
steps to cut down the profesuonal clau” (Friedman and Kuznets 1954, pp 12-14). If a 
decline in the number of applicants should prove madequate during periods of large 
excess sui^ly, the licensing agency must sedt an altemaUve instrument It is likely at this 
point more thought will be devoted to idenufytng and eliminating “quacks” or in¬ 
competents. During the 1930s in medicine, graduates of foreign medical schools laced 
increasmgly difficult entry requirements (Friedman and Kuznets 1954, pp. 17-20) At 
some pomt the licensing agency might even ofTer practitioners m the area at exoea supply 
a monetary incentive to leave. Though I could 6nd ix> evidence id this h a ppen ing, it u 
interesting to note that labor umons somedmei pay same of their members to cease seeking 
emidoyment in the industry. The Amidgamat^ dothing Woiken at the Chicago plant 



403 


JOURNAL OF POLITICAL ECONOMY 


P 



temporary shortages at initially prevailing prices), the more rapidly will 
prices (hence incomes) nse, the more rapidly will entry nse, and thus the 
greater will be the incentive for the hcensing agency to lower the pass 
rate ^ The reasomng here may best be illustrated with the aid of figure 2 
The nature of the price adjustment process* is desenbed in figtire 2 
Curves I (A’g) and (A’g) assume no excess demand and a short-run 
equihbnum position initially above the long-run position; with no shifts 
in D or S, price will eventially fail to Us long-run level, higher for the 


of Hart, Schaflher, and Marx had experienced a change in cutting method! which 
created an excels supply of cutters. The union took 125,000 from their unemployment 
insurance fund to help compensate any dismissed worker, who took the money with “the 
understanding that he was to leave the industry altogether and go into some other line 
of woii.” See "American Trade Unions and die Problem of Unenqiloyment” (1928, 
p 483), tee also Slichter (1941, p 131) I am indebted to Melvm W Reder for thu 
reference. 

* The relation between the extent of aggregate excess demand and the rate of change 
of labor earnings (wage rates) has been investigated empirically by Liptey in relation to 
the PbiJbps curve, of course, Usmg the inverse of the unemployment rate as an indicatm 
of excess demand, Liptey (1960) found that wage rale changes from 1862 to 1957 were 
well explained by the level of excess demand and the rate of change of the level of excess 
demand. 

* See Reder (1960) for a discussion of the atijustment process in connection with the 
union influence on wages: “The ethics of die queue affect the ac^ustment proecH of 
labor markets, but only rarely their equilibrium vidues." As Reder points out, to raise the 
long-run equilibrium price level woidd require entry restriction activity on the part cd 
the union or, in our cate, licensed occupations (see Reder I960, pp. S51-S4). 




aCWMUNlCATIONS 


403 


licensed occupation than the nonlicensed one Curve 11 (A^,) begins at 
time zero with the short-run equilibnum value of P which u lower than 
for the licensed occupation indicated by II^ both are above their 
long-run equilibnum values Curve II (^|) corresponds to the {Mice 
res{X>nse to a small amount of excess demand (P, the slo{>e of the curve, is 
iniually relatively low) in a com{)etitive situation with unrestncted 
entry, while III {X 2 ) corres{)onds to the {nice rcs{x>nse to a larger amount 
of excess demand with unrestncted entry. Note that III (^^ 2 ) faster 
to a higher {leak value and then falls to P faster than II ; this rapid 
falling off, we sup{>ose, is due to the strength of the signal to {lotential 
entrants generated by the rapid pnee increase ^ In a competitive situation 
with entry restnction, however, the pnee will rise to a higher peak value 
than II (Xi) (given the same excess demand) and will fall off much more 
slowly, as indicated by 11 2 , (X,) But with a greater excess demand (P is 
greater) entry will be restneted more, prices will rise to a higher {leak 
value and trail off more slowly, as indicated by III 2 , (X 2 ) com{>ared to 
II 2 , (A'j) Thus, even if the increased entry restnctions do not alter the 
long-run equilibrium price level, there is a substantial difference in the 
nature of the adjustment process * 

Our primary task now is to measure the empirical counterparts of 
these two theoretical variables. 


The Data 

Though several studies have collected data on licensed occu{>ations by 
state (Counal of State Governments 1952,1968, Vaughan 1963; Shimberg 

’ Alchian has recently pointed out that even in a competitive situation, downward 
shifts in demand would cause downivard acfjustments in pnee, notably the pnee of labor 
services, to occur with a considerable lag Worken must discover by searching that such 
a demand shift has occurred Obviously, the greater the downward shift, the sooner the 
pnee acl)ustment occurs Once the pnee shift has occurred, it will in itself serve to com¬ 
municate what has happened In thu study I am concerned vntb the change m price 
Itself once it has occurred. The same pnnciple u at work, however I as sume that informa¬ 
tion about the permanence of any shift in demand is costly and the crteit of the pnee rise 
IS one indicator of thu permanenoe to potcnual entranu Sec Alchian (I9G9) 

* If we suppose that the demand and supply curves are linear, then the following u 
true Da —ap + c,Saip + J,pm k(D — S) a kX, where X a D — S a —ap + 
c - (bp + d) a _(a + 4)p + (e _ d), dXJdi - -(a -f 4)> Let^» he the short-run 
eqmlibnum pnee and p the actu^ pnee. Then p* satisfies jf • 0 • — (a + b)p* + 
(c — d). Multiplying by k 

0 - -*(a + b)p» + t(e - d). (1) 

But smee ^ •* kX, 

p a _*(a + b){p* - p) (2) 

That u, the rate of increase in pnees will imtially be larger, the larger the shift m D-, 
as p approaches p*, the rate of increase in p will gradually decreiue until .t reaches aero 
when the new shoit-run equilibnum pnee u reached. See Arrow and Capron (1959) 
Thus, the peaks of the curves drawn in fig 2 co rres p ond to the positions of new short-run 
equilibnum vAere jl u mMHentanly aero. It should be pomted out that the acfjustiiiem 
piooest fiom the new short-run equilibrium posttson to die final long-run equilibnum 
position has not been previously mvesUgated. 



^ jovtoiAL or rouncAL soomow 

1968; Greene 1969, Shimbecg, Esser, «nd Kruger 1973), only one previous 
study (Council of State Governments 1952^ has collected and published 
the kind of data needed for our study. For approximately 24 occupations 
licensed in various states, the Council of State Governments collected 
and published by state the number of practitioners licensed in the state, 
the number of applicants for licensure, and the number of new licenses 
granted (1952, appendix tables 6, 7) for 1940 and 1950. For each 
occupation in each year I chose the number of states for which data were 
available for all three of these concepts. Most of the occupations chosen 
were licensed in nearly all of the states, but the data were usually available 
for only a small fraction of them The data do present some difficulues, 
however First, there is seldom umform agreement among the statutes 
regardmg the definition of what constitutes a licensed occupation, so that 
some caution must be used in interpreting the results Second, the number 
of new licenses granted is sometimes greater than the number of new 
applicants This is because new licenses granted by reciprocity are 
included in the former category while applicants for licensure by re¬ 
ciprocity are not included in the latter I attempt to make an adjustment 
for this possible bias in the data in my study 


The Variables of the Hypothesis 

As an indicator, X, of excess demand, I choose the ratio of new applicants 
in the state to tfic number of licensed practitioners already in the state 
The greater this raUo, the greater I assume excess demand for that 
occupation in the state to be in that year Using figure 1 we can see that 
X IS essentially (jj — ?o)/9o Given jg, obviously the greater is y,, the 
greater is X It seems qmte reasonable to suppose that the number of 
new people who would like to enter the occupation (and indicate as much 
by applying for a license) is a good approximation to (gi — qg) 

As an indicator of the leftward shift in the supply curve caused by entry 
restriction, I choose the pass rate, P, or the ratio of the number of licenses 
granted to the number of applicants ^ The pass rate is clearly a variable 
under the control of the licensing board Decisions regarding the pass rate 
normally would be made after all the applicauons had been received. If 
the hcense exams are held, say, once a year, then the board members 
clearly have knowledge of X before the exams are given The same holds 
if the exams are given periodically dunng the year, except that applicant 
rates for shorter periods must be examined for possible trends in yearly 
data. Smee in most cases the licensmg board is not required even to 

^ In a ptevious study it was assumed that P was an exogenous vanabte, though it was 
recognized that this assumption mi^t be &ulty In this study P is not exogenous, it is 
the vanabie whose behavior we wish to explain. See Benbam, VUunzi, and Reder (1968) 



OOUMUNIOATIONS 405 

declare the reaioiu for failure, it ii dearly possible for the board to alter 
P in response to X. 

My hypothesis can now be stated as follows 

P ~ A + BX, B <0 (3) 

(To test this hypothesis I ran regressions for each occupation by state for 
1940 and 1950) 

As mentioned above, the data are such that my estimate of P is greater 
than one for some observations for which P is actually less than one To 
correct for this I have run the regressions with estimated P'i greater than 
one eliminated 

Although the number of observations eliminated in this way was quite 
small (only 12 of 238 observauons were eliminated m 1940, only 29 of 
370 in 1950, 17 of which were among pharmacists), I mvestigated the 
possible bias introduced by runmng the regressions without elinunating 
these observations The results were not substantially altered * 


The Results 

The results of my investigauon are reported for 1940 in table 1 and for 
1950 in table 2 Regression coeffiacnts with their Nvalues in parentheses 
below arc recorded, as well as corrected and the degrees of freedom 
For each regression I have also calculated and recorded the elasuaty of 
P with respect to X evaluated at the pomt of means 

The reader should note that of 36 cases, A is sigmficantly piositive in all 
36 and B is negative for 27 of them, and of these B is significant 17 times 
The amount of vanaUon explamed in most of the significant cases is 
quite high for a simple bivariate cross-secuon relaUonship, it is over 
1 in 16 of the significant cases, over 2 m 12 of the significant cases, and 
over 3 in nine of the 17 significtuit cases. The elasticity is less than — 1 0 
in all but one of the significant cases and vanes pnmanly between — 1 0 


' The bivanate regrenion coefiiaent reznaued negaUve and at least as significant m 
nine out of 10 cases, actually increasing negatively in six of these nine for 1950, in one 
case the sign changed from an insignificant positive to an insigmficant negative one. The 
did drop somewhat in two of these nine cases (from 28 to 17 and from .28 to 21) 
and qmte substantially in one (from 17 to 04), that one beuw pharmacuts, for which 
the number of observations was nearly doubled The drop in m the other six cases 
was slight The results for 1940 were similar The bivanate regression coefficient became 
larger negaUvely and remained as significant m six of 11 cases, m one case the negative 
coeffiaent became sigmficant; in three of 11 cases the sign of the coeffiaent went from 
(insigmficant) posibve to (insignificant) negative, m one case (physioans) it went from 
sigmficant negative to insignificant positive. The was only ahghtly lest m all but this 
one case, where it decreased from .61 to .05. In other words, except frir physicians m 
1940 and pharmacistt in 1950, excluding all observabons where P > 1 does ix>t alter the 
results. 



4o6 


JOURNAL OF POUnCAL BOONOlfY 


TABLE 1 

P’m A + BX 1940 



A 

B 




Occupation 

T{A) 

T(B) 

df 


E 

01 Accountants 

0.521 

-0.844 

12 

07 

- 52 


(3 51) 

(-1 39)* 




02. Architects 

0 766 

-0 665 

9 

.32 

-.31 


(5 23) 

(-2 38)** 




03 Attorneys 

1 225 

-8 5M 

9 

45 

-.69 

(7 70) 

(-3 04)*** 




04 Barbers 

0 776 

1 357 

12 

00 

05 


(6 88] 

(0 38) 




05. Beauticians 

1 023 

-0 8% 

10 

84 

- 20 


(27 65) 

(-7 63)*** 




06 Chiropodists 

0 558 

2 334 

13 

.12 

23 

(3 96) 

(1 70)» 




07 Chiropractors 

0 940 

-0 056 

9 

00 

- 01 


(24 48) 

(-0 49) 




08 Dentists 

1 016 

-5 305 

22 

36 

-.26 


(16 62) 

(-3.75)*** 




09 £mbalmer& 

0 837 
(15 87) 

-0 847 
(-4 36)*** 
09% 

11 

60 

- 13 

10 Engmeers, professional 

0 655 

15 

07 

M 

(9 98) 

(146) 




11 Funeral directors 

1 067 

-5 488 

3 

34 

- 33 


(6 05) 

(1 76) 




12 Nurses, registered 

0 906 

-0 205 

11 

00 

- 02 

(8 59) 

(-0 17) 




13 Optometrists 

0 877 

-2 469 

12 

05 

- 30 

(5 53) 

(-131) 




14 Osteopaths 

0 850 

0 091 

14 

00 

01 

(10 79) 

(0 29) 




15 Pharmacists 

0 829 

-0 049 

11 

00 

- 01 


(15 46) 

(-0 22) 




16 Physicians 

1.002 
(21 17) 

-0 611 
(-5 28)»** 

16 

61 

- 15 

17 Real estate brokers and 





salesmen 

1 017 

-0 828 

5 

90 

-.47 


(1581) 

(-7 30)*** 




18 Veterinarians 

0 928 

0 047 

9 

00 

01 

All occupations combined 

(21 18) 

0 83 

(0 34] 
-0460 

237 

13 

-.08 

(43 50)*** 

(-6 lO)*** 





* SignificAot at the 10% level 
** Significant at the 5% level 
*** Significant at the 1% level 


and —0 50 That is, a 10 percent increase in excess demand will generate 
a fall in the pass rate of between 10 percent and 5 percent.* In general, 

* Since my dependent variable u of the form xly, and my independent variable, of the 
(onaylz, the questions arise as to whether there is spurious correlation present and if so 
whether it tends to generate a negative relationship. It should be noted, however, that 
the question of spurious correlation generally arises when a hypothesis pertains to a set 
of uodeflated variables and the data are adjusted by dividu^ through by another variable. 
Thus, if an initud hypothesu suggests x •‘/{y, z) and the data for x,y, uid z are all 
divided fay to so that the regresnon run is of the form xjw ■■ zjw), spurious conela- 

Uon may arise. See K.uh i^ Meyer (1955) for a detsuled discussion To the best of my 
knowledge, the extent of ^lunous correlation when the regressiosi run is of the fenn 
xjy m. f{jjz) has not beetiInvestigated. Therefore, 1 have made a simulation test to 
determine the extent of any negabve bias, if any. The values for x,y, and z were picked 






COMMUNICATIONS 


TABLE 2 

P A + BX. 1950 


407 


A B. 

Occupation TiA) T(B) df E 


19 .23 -.49 


01. Accountants 
02. Architects 
03 Attorneys 
04. Barbers 
05 Beauticians 
06 Chiropodists 
07 Chiropractors 
08. Dentists 
09 Embalmers 

10 Engineers, professional 

11 Funeral directors 

12 Nurses, registered 

13 Optometrists 

14 Osteopaths 

15 Pharmacists 

16 Physicians 

17 Real estate brokers and 
salesmen 

18 Veterinarians 

All occupations combined 


0 486 

-0 329 

(6 45) 

(-2 63)*** 

0 713 

-0 139 

(9 79] 

0.662 

(-104] 

00% 

(7 50) 

(0 52) 

0 860 

-1 251 

(1131) 

(-108) 

0 916 

-0 909 

(11 12) 

(-0 86) 

0 917 

-0 557 

(9 56) 

(-0 72) 

0 822 

-0 135 

(11 33) 

(-2 08)** 

0 955 

-1 650 

(19 55) 

(-2 36)** 

0 839 

-0 881 

(12 05) 

(-2 87)*‘* 

0 693 

-0 001 

(16 17) 

(-0 01) 

0 866 

-0 749 

(8 31) 

(-2 23)»* 

0 786 

-0 023 

(7 08) 

(0 02) 

0 853 

-1 095 

(11 19) 

(-2 49)*** 

0 731 

0 347 

(7 15) 

(0 717) 

0 849 

-0515 

(17 30) 

(-2 60)*** 

0 890 

0 131 

(23 45) 

(0 67) 

0 938 

-0 636 

(12 99) 

(-581)*** 

0 917 

-0 028 

(18 49) 

(-0 15) 

080 

-0 29 

(54 10)*»* 

(-7 90)*** 


22 

00 

-.74 

5 

.00 

04 

22 

01 

- 10 

14 

.00 

- 07 

19 

00 

- 07 

17 

16 

-06 

26 

14 

-.12 

18 

28 

- 13 

25 

00 

- 00 

p 

28 

- 13 

16 

00 

- 00 

25 

17 

- 24 

17 

00 

04 

28 

17 

- 09 

17 

00 

01 

19 

.62 

- 54 

17 

00 

- 00 

369 

14 

- 07 


•• Sisniftcsnl St the 5% level 
••• Sisniilcsnt at the 1% level 


at random from a normal distribution -with unit mean suid variance This choice was 
made because the data indicate that the coefficient of vanaUon (the ratio of the standard 
deviation to the mean) for each of the three variables vanes around unity, table 3 pre¬ 
sents the cocffiaents of vanauon for 1940 only, since the results for 1950 are similar As 
IS clear from table 3, the variation in K is sometimes greater than that m X or Z In six 
cases the variation in K is greater, but in seven cases it a less than that m either X or 
Z, in the remaining cases it is in between and nearly the same as that for either A or Z 
The numbers in all three cases appear to be scattered above and below, but generally 
close to unity. The simulation run consuted of 20 regressions whose sample six£ varied 
from 34 to 64 The mean regression coefficient equaled — .027; the l-test on thu mesm 
equaled — 3.07, ngmficant at the 5 percent level In other words, there is a negative bias 
present in the regremon coeffiaent. The sue of the bias, however, is no more than one- 
fifth to cme-tenth, often only tme-hundredth, the size of the legresnon coefficients pre¬ 
sented in the tabla. The bias will cause an overestunation of the effect we are measuring 
but IS not likely to cause a change m the sign of the coeffiaent itself 





4o8 


JOURNAL OF POLITIOAL ECONOMY 


TABLE 3 

COBFFIGIENTS OF VARIATION, 1940 



Occupation 

No Passing 

X 

No Aj^lying' 

Stock 

Z 

01 


0% 

1 08 

0 93 

02 


1 32 

1 06 

1 17 

03 


063 

0.76 

0 71 

04 


0 89 

0 89 

0 78 

05 


I 05 

1 05 

1 08 

06 


2 79 

2 85 

1 99 

07 


2 19 

2 17 

1 86 

08 


1 02 

096 

1 10 

09 


1 05 

1 08 

0 95 

10 


0 82 

0 76 

1 03 

11 


0 66 

0 72 

0 53 

12 


1 24 

1 26 

0 93 

13 


1 19 

) 00 

0 95 

14 


1 23 

I 22 

I 33 

15 


081 

0 72 

080 

16 


3 21 

1 38 

1 34 

17 


080 

1 94 

I 14 

18 


080 

0 77 

0 91 


for Significant cases in 1950 the corrected is less than m 1940 With 
only two points it is difficult to dctcmunc a trend, but if these points are 
indicative of such a trend, it would indicate that my model has less 
explanatory power in 1950 That may be due to changes in some missing 
explanatory variable, one obvious candidate for inclusion is the level of 
practitioner incomes 

The addition of practitioner incomes along with the applicant stock 
ratio as independent variables raises some question regarding the possi¬ 
bility of multicollineanty Though the income variable pertains to the 
level of income while the applicant-stock ratio is derived from and is an 
indicator of the rate of rise in the level of income, there is nonetheless a 
question as to whether high-incomc states and rising-income states are 
coincident In my pooled sample, the correlation matrix indicates mostly 
a small, positive correlauon between the two vanables, the zero-order 
correlations equal 12, 38, 19, — 02, 05, — 16, and 01, respectively, 
for the occupations in my pooled sample The problem of multicollineanty 
docs appear to be of minor imjKirtance m this case 

If in two states an occupation is expenencing the same increase m 
excess demand, but the average income of practitioners m one state is 
higher, then the capitalized gain from entry restnction is greater in the 
higher-incomc state. I conclude that the tendency to restnet would 
therefore be greater in the higher-income state. In order to test this 
hypothesis I have introduced one additional explanatory vanable, Y, 
the mean income of practitioners in the state in the years 1940 and 1950. 
My hypothesis now is P = A + BX -h CY, 5 < 0; C < 0 Since mean 
incomes for some occupations are not available m these yean, my sample 







COMMUNICATIONS 


409 


of occupabons u considerably reduced. Of the original 18 occupations 
only eight are left in 1940 and 10 in 1950 The regression results are 
reported in table 4 for 1940 and table 5 for 1950. 

Of the eight cases in 1940, X is significantly, negatively related to P 
four tunes and ptosiUvely but not significantly related to P four tunes 
The variable Y is negatively related to P five tunes and signihcantLy 
twice, while being positively related to P three umes, once significantly 
Corrected is increased in only two cases by adding Y The elasticity 
estimates with respect to X are quite similar to those reported in table I 
except for attorneys. The result m table 4 indicates a very large, negative 
elasucity of —5 87 for attorneys in 1940 with Y present, compared to 

— 0 69 in 1940 with Y absent. Both variables are sigmficandy, negauvely 
related to P, and corrected is quite high ( 78) and significant This 
result would indicate that for two states m which attorneys were ex- 
penencing the same average incomes, the one in which the excess demand 
was 10 percent greater would exhibit a pass rate nearly 60 percent lower 
For all other occupations m 1940 (and 1950), this elasucity was less than 

— 0 5, indicating that (holding average incomes constant) a 10 percent 
increase in excess demand would result in at most a 5 percent decrease 
in the pass rate The elasucity of the pass rate with respiect to Y was less 
than — 1 0 in 1940 (and 1950) For occupauons in two states experiencing 
the same excess demand, the pass rate would fall by 10 percent (at most) 
for every 10 percent increase in average incomes 

The results reported in table 5 for 1950 are siimlar to those in table 4 
The variable X is negauvely, significantly related to in five out of 10 
cases, while positively in five out of 10 cases but only once significantly 
The variable Y is negauvely related to P in eight out of 10 cases, three of 
them sigmficandy, while posiuvely related twice, once sigmficandy 
Corrected R^ is increased in six out of 10 cases and decreased in only two 
cases The elasucity of P with respect to A' is quite similar when Y is 
present The eltisuaty with respect to Y vanes mainly between —0 1 
and —0 5, as in 1940 

The improvements from including Y arc somewhat better in 1950 than 
in 1940 Since unemployment was much more widespread in 1940, it is 
understandable that reported incomes would not be an accurate indicator 
of permanent incomes and nught therefore produce less nouceablc results 
On the whole, however, it appears that introducmg the vanable Y was 
worthwhde The results are generally as expected 

Finally, to enlarge the sample size I have pooled the results for 1940 
and tfiSO occupauons whenever data are available for each The results 
are presented m table 6. The regression coefiiaent for the apphcant-stock 
ratio 18 significant in only two of seven cases, but both those umes it is 
negauve. The R^ for these two (dentists, and real estate brokers and sales¬ 
men) IS .44 and .65, respeedvely, and highly significant. The size of the 




410 





TABLE 5 

A + BX+ CY 1950 


II 

ft. 



412 


JOURNAL OF POUnCAL ECONOMY 


TABLE 6 

CoiCBlNBD 1940 AND 1950 RstULTS P n + BX + CY 


Occupation 

A. 

T{A) 

B 

T{B) 

C- 

r(C) 

df 

7!* 

F 

Bpx 

12 Archttecta 

0 662 
(4 07)*** 

-0 249 
(-1 181 

-000000 

(0.30) 

-0 00000 
(-145)* 

14 

.00 

0 79 

-0 13 

13 Attorneys 

0 793 
(12 38)*** 

0 ll3 
(0 73) 

15 

00 

1 06 

0 02 

18 Demists 

0 987 

(23 84)*** 

-4 787 
(-5 53)*** 

0 00001 
(2 34)*** 

41 

41 

1621*** 

-0.23 

0 Professional 

engineers 

0 858 
(8 04)*** 

0006 
(0 72) 

-0 00005 
(-1 63)** 

35 

03 

1 60 

001 

5 Pharmacists 

0 833 
(13 66)*** 

0 022 
(0 14) 

-0 00000 
(0 08) 

32 

00 

0.01 

000 

S Physicians 

0 965 

(21 80)**‘ 

0 038 
(0 25) 

-0 00001 
(-1 18) 

23 

02 

0 72 

000 

7 Real estate 
brokers and 

salesmen 

0911 
(7 78)*** 

-0 627 
(-6 04)*** 

000001 
(0 26) 

20 

61 

18 23*** 

-040 


Note — P •> paM rate, ^ » (applicant stock ratio) exees* demand mdtcator, Y « average income of practitioners 
a state Regression coefficients are reported with t*vsilues in parentheses below 
* Signihcant at 10% level 
** Significant at 5% level 
••• Significant at 1% level 


regression coefficients in these two cases is approximately the same as 
before The income variable is negatively significant twice (for attorneys 
and professional engineers) and is positively significant once (for dentists) 
The elasticity estimate, for dentists and for real estate brokers and 
salesmen is — 23 and — 40, respectively, similar to previous findings 


Conclusion 

With a rather simple model of the short-term behavior of licensing boards 
we are able to explain a substantial part of the pass rate of licensing 
boards across states for nearly half of all the cases studied m 1940 and 1950 
The findings indicate that a 10 percent increase in excess demand 
generates a decrease in the pass rate varying primarily from 1 percent to 
10 percent depending on the occupation, and that a 10 percent increase 
in average practitioner incomes produces up to a 10 percent decrease in 
the pass rate This evidence tends to confirm the notion that the power of 
licensmg boards is often used to prolong the period of higher incomes 
resulting from increases in excess demand for the services of the occupation 
in question and that the instrument then used to accomplish this purpose 
IS alteration of the pass rate on the beensing examination. 

It IS disturbing, however, that my model has no explanatory power at 
all in approximately half of all the cases considered. I have made only a 
first step in gathenng evidence to support the notion that hcensing of 







COMMUNICATIONS 


4*3 


occupations is not in the public interest My hope is that this avenue of 
investigation can be pursued further when better, more reliable, and 
more comprehensive data become available. If this study speeds the 
accumulation, publication, and analysis of such data, it will have 
accomphshed a great deal. 


References 

Alchian, A C “Information Costs, Pricing, and Resource Unemployment ” 
Western Econ J (June 1969), pp. 109-28 

“American Trade Unions and the Problem of Unemployment ” Monthly Labor 
Rev 26, no. 3 (March 1928) 483 

Arrow, K. J , and Capron, W M “Dynamic Shortages and Price Rises The 
Engineer-Scientist Case ” Q.J E 73, no. 2 (May 1959) 292-308 

Benham, L ; Maurizi, A , and Reader, M W “Location, Migration, and 
Remuneration of Medical Personnel Physicians and Dentuts ” Rev Econ 
and Static. 50, no 3 (August 1968) 332-47 

Council of State Governments. Occupational Licensing Legislation in the States 
Chicago, 1952 

- Occupations and Professions Licensed by the States, Puerto Rico, and the Virgin 

Islands Chicago, 1968. 

Friedman, M “Occupational Licensure ” In Capitalism and Freedom Chicago. 
Univ Chicago Press, 1962 

Friedman, M , and Kuznets, S Income from Independent Professional Practice New 
York Nat. Bur Econ Res , 1954 

Gellhom, Walter “The Right to Make a Living ” In Individual Freedom aAd 
Governmental Restraints New York Greenwood, 1968 

Greene, Karen Occupational Licensing and the Supply of Nonprofesstonal Manpower 
Washington Department of Labor, 1969. 

Hicks, J R .4 TTiexny of Economic History London. Oxford Umv. Press, 1969 

Kuh, Edwin, and Meyer, John “Correlation and Regression Estimates When the 
Data Are Ratios " Econometnca 23 (October 1955) 400-416. 

Lipsey, Richard G “The Relation between Unemployment and the Rate of 
Change of Money Wage Rates in the U K , 1862-1957.” Economica 28, no 105 
(1960). 3-31 

Moore, Thomas G. “The Purpose of Licensmg ” J Law and Econ (October 1961), 
pp 93-117 

Reder, M W “Job Scarcity and the Nature of Union Power ” Indus and Labor 
Relations Rev. 13, no 3 (April I960)' 349-62. 

Shimberg, Benjamin. A Pilot Stuify to Determine the Feasibility of Irwestigabng 
Nationally the Impact qf Licensing Practices on the Availability and Mobility qf Non- 
professional Manpower in Occupations where Shll Shortages Exist Prmceton, N J. 
Educ Testing Service, 1968 

Shimberg, B , Esser, B , and Kruger, D Occupational Luensutg Practices and 
Policies. Wa^ngton Public Affairs Press, 1973. 

Slichter, Summer H Uraon PoUcies asui Industrial Management. Washington 
Broolungs Insf., 1941. 

Vaughan, M. R. Occupational Licensing tn Wisconsin Madison' Wisconsin Legis¬ 
lative Reference Bur, 1963. 



On Equalizing the Distribution of Political 
Income 


Dennis C. Mueller 
CtnuU Umetrntjt 

Robert D ToUison 
Texas A St M Umetrstty 

Thomas D Willett 
U S Tnaswy 


I. The Problem 

Individuals obtain utility from the consumption of both public and private 
goods There has been considerable mterest recently in how money 
income—the power to purchase private goods—should be distnbuted 
(Fair 1971, Thurow 1971) In this paper we explore the complementary 
quesuon of how political income, or voting nghts, should be distnbuted 
to obtain a desirable distnbution of individuals’ utihty from public goods. 

It IS important to distinguish two concepts of equality in the discussion 
of public policy which we will employ to discuss the distnbution of pohtical 
income, ‘ Ex ante eqmty refers to fairness in the sense that risks or oppor- 
tumties before the fact are equalized Ex post equity is fairness in the 
sense of actual outcomes of social processes involving risk and uncertainty 
In most cases ex post equity is probably of greater importance than ex 
ante equity, although, as we shall discuss, equahtarians have cunously 
failed to extend considerations of equity m the distnbution of pohtical 
income to an ex post setting * There arc, however, cases where ex ante 
equity would be more impiortant For instance, this would be the case 
where equality of opportumty is the objective or where it may be more 
desirable to substitute a mechanism to achieve ex ante equity (or some 
combination of the two concepts) because ex post equity is impossible or 

We are gratefitl to Jamei M Buchanan for useful comments Any remaimng errors are 
our responsibility, 

‘ For an extended treatment of the two concepts of equity, see Pauly and Willett (1972) 
and the relaences cited there 

* Note, however, the following quotation from a paper by Kolm (1971, p. 8). “Lea 
Amtecains ne s’lntiressent pas i la justice sociale, du moms au sens oil I’entendons en 
France (Lfon Walras toivait dt}k. ‘La justice cat iineidiegr{co-latineetiran 9 aise*).En 
paruculier, la noUon d’6galitansme dans lea rilsultats—plutdt que dans lea chances—a 
beaucoup moms d'attraita pouveux que pour nous ” 

414 



COMMXnnCATIONS 


415 


too costly to achieve. We will argue that the lack of consideration of estab¬ 
lishing equality in the ex post distribution of political income cannot be 
explained by either the impossibihty of achieving such an outcome or the 
high costs of pursuing such a goal 

Political equality is generally thought to be more feasible and desirable 
than econonuc equality. Economic equahty—for example, in Lerner’s 
sense of complete ex post equality of utility distnbution—is typically 
expected to generate senous excess burden and disincentive effects, since 
the generation of private income takes a large part of an individual’s time, 
that IS, the elasticity of supply is higher ^ Thus, emphasis in achieving 
greater equality of the pnvate income distribution focuses on a mix of 
mechanisms which trade off some degree of ex post inequality in pnvate 
incomes against the disincentive effects on the total amount of pnvate 
income generated For example, mechanisms such as fair employment 
laws can be viewed as generating greater ex ante equity, or equality of 
opportunity, while a progressive income tax can be viewed as generating 
greater ex post equality 

Discussion of the analogous problem of the distnbution of political 
income has been characterized pnmanly by the discussion of generating 
mechanisms to achieve ex ante equality of opportumty to participate 
(e g , one man, one vote) Perhaps the primary reason that this form of 
ex ante equality has been stressed in the literature and institutions of 
democracy is that it is considered a basic sort of nght, or value judgment, 
which goes hand in hand with political democracy (Vlastos 1%2, Tobin 
1970) * A second imjjortant reason for one man, one vote could be that 
there are only minor disincentive effects associated with guaranteeing this 
form of political equality. ’ 

Given the relative lack of disincentive effects associated with me¬ 
chanisms for reallocating voUng nghts, it is interesting that the discussion 

^ For Lemer's proposal and some discussion of it, see Lerner (1935, 1970), Breit and 
Culbertson (1970, 1972), Morrison and Higgins (1971), McCain (1972), and McManus, 
Walton, and Coffman (1972) 

* Tobin (1970, p 269) also aiigues that “obviously an cqualitarian distribution of the 
vote can be enforced without any loss of efficiency The aggregate supply of votes is in¬ 
trinsically inelasUc ” This is an overstatement of the issue, since although voting can be 
enforced or subsidized and in the extreme all would vote, this does not guarantee that 
votes are of any given level of informaUonal quality We have disciused elsewhere some 
ways out of this dilemma See Mueller, Tollison, and Willett (1971, 1972a, in press) 

’ For example, an alternative distribuUon of the nght to participate in the political 
process might be to give votes proportional to years of formal education, 1 e , some sort 
of scheme of weighted voting rights (Vlastas 1962) Counteracting the need for this sort 
of system, however, are such factors as the effect of education on one’s perception of his 
"obligation” (elasticity) to participate in poliucs which tend to lead to very few observed 
disincentive effects for educated voters under a system of full, ex ante poliucal equality 
with everyone voting and weighted equally Thus, on pracUcal grounds poliucal equality 
of this sort could be defended For empincai evidence on this point, see Milbrath (1963) 
For a discussion of the economic i theory of voUng behavior, see Tollison and Willett 
(in press) 



JOURNAL OF POLITICAL ECONOMY 


416 

of political equality has never seriously considered establishing an ex post 
equality of the distribution of political income.^ In markets for pnvate 
goods, each voter>consumer allocates his income so that the expected 
marginal utility from each dollar vote on a good is equal. Planned con¬ 
sumption at equilibrium prices is identical with actual consumption. In 
the absence of mistakes due to lack of sufficient information, this assures 
equahty between expected and actual utihty These conditions arc not 
likely to hold where public goods are allocated via one-man, one-vote 
majonty rule The utilities a voter expects from each vote may vary widely 
And there can exist a large divergence between what he expects (hopes’) 
to receive in public goods as indicated by his voting decisions and what 
he actually receives after all votes are cast and counted 

These two characteristics of voting have been typically described as 
giving nsc to problems of “minority nghts” or the “tyranny of the 
majority ” It is imjxirtant to note, nowever, that these charactenstics 
create two separate problems, and thus require separate remedies Let us 
illustrate their differences with an example 

Suppose a community plans to build a town hall A referendum is held 
on whether the building should be of Gothic or Greek architecture Fifty- 
one percent of the voters have a slight preference for the Gothic style, the 
other 49 percent are strongly opposed On a straight majonty vote, the 
Gothics win even though their utility gain is far outweighed by the 
mmonty’s loss We can describe this as a situation violating the principle 
of ex ante equality Each citizen’s vote does not represent an equal 
potential utility change 

Consider next the case where each citizen’s vote docs represent an equal 
potential utility gam Then a simple majority rule will maximize the net 
utility gain for the community on this decision Suppose, however, the 
town grows and new public buildings are erected Each time the town 
votes on the architecture of the next building, and each time the pro- 
Gothics win Every decision will be efficient in the sense that the utihty 
gams of the winners exceed the losses of the minonty, but one might well 
regard the long-run outcome as inequitable, as all the utility gains accrue 
to the pro-Gothics (ignoring the utility losses both groups might expen- 
ence from mixing Gothic and Greek revival buildings) Thus, even where 
there is full ex ante political equality, there is still the piossibility of ex post 
inequality of the utility income derived from the political sector A person 
consistently in the minonty may receive ex post a far smaller share of the 
gains generated by public collective action, even where there is ex ante 

* Braybrooke (1968) is a partial exception In his proposed tests for whether actiisd 
governmental systems conform closely enough in practice to the theoreticid norms of 
democratic theory to be appraximately labeled democracies, Braybrooke suggests the 
criteria that no voter be in the minority more than 90 percent of the tune and that not 
more than 10 percent be in the nunonty more than two-thirds of the tune (p. 171). 



COMMUNICATIONB 


4>7 

equality over all voters ^ Indeed, we would expect considerable ex post 
inequality of political outcomes to be the general case, rather than the 
exception. 

Can piolitical systems be devised to achieve both ex ante and ex post 
equality of political income? In the following section we consider mech¬ 
anisms which seek to achieve this objective 


n. Solutions 

The problem created by differing intensities of preference among voters 
can be solved by introducing point voting Since point voting has been 
discussed in detail elsewhere, we shall only briefly outline it here (Mus- 
g^ave 1959, pp 130-31, Mueller, Tollison, and Willett 1971, m press) 
To begin with, voters and issues must be grouped into pohties such that 
in any gfiven piolity each voter has an “equal stake” (in terms of his poten¬ 
tial aggregate utility change) in the set of issues decided by the jjolity * 
That IS, governmental units are broken up into, say, neighborhood, city, 
regional, and national junsdictions, so that the integral of voters’ utility 
distributions over the issue set to be decided by a given piolity is approx¬ 
imately equal Each member of a polity is given an equal number of 
vote points to be allocated over the set of issues to be decided by the polity 
If the voters honestly allocate the vote points in proportion to their relative 
intensities of preference on all issues (i e , they do not engage in strategic 
behavior), the point allocations will reflect each voter’s marginal rates of 
substitution between issues ’ If the equal-stake criterion is met, the total 
expected utility of each voter in a piolity will be equal, and the outcomes 
obtained from aggregating the point-vote allocations will maximize the 
utility gains of the community from voting The equal-stake criterion 
produces ex ante political equahty for each voter over the set of issues, 
while point voting allows each voter to achieve ex ante equality on every 
issue by revealing his relative intensities of preferences over the issue set 
Together they form a system for achievmg ex ante piohtical equahty for 
each voter on every issue 

There are two contexts in which one can conceive of equahzing the 
distribution of political income ex post First, one could adjust each 
individual’s vote stock at the beginning of a voting pienod, depending 
upon his success over all previous periods Suppose that in the first round 
an individual won on issues totaling 90 percent of his vote pomts, while 


’’ The point here is similar to Arrow's (1971) discussion of input and output equaliza¬ 
tion for public expenditures Also, evidence from simulaUon studies of the gams from 
trade from vote trading indicate that the qiumtilativc importance of the problem of 
political income distribution may be sigmficant (Mueller, Philpotts, and Vtmek 1972) 

' For analytical discussion of the equal-stake critenon, see Mueller (1971) 

* Mueller (1973) discusses the conditions under which pomt voung proceeds honestly 



JOURNAL OF POLITICAL ECONOMY 


418 

the average voter won only 70 percent. If the initial stock were 100 votes, 
the individual would be given 80 votes at the start of period 2. If he again 
won on 90 percent, he would win on 72 vote points, and his tax for the 
next period would be two-votcs He would thus have 78 votes for period 3, 
and as long as he continued to win on 90 percent of his vote point alloca¬ 
tions, he would be winning on issues totaling 70 vote points, the average 
for the community If, on some round, either his tastes or the com¬ 
munity’s changed so that he only won on half of his allocations, his total 
successful allocations would fall to 39 In the next round his cumulative 
overall tax would be 20 -I- 2 — 31 = —9, and he would receive 109 
votes If he continued to win on only 50 percent of his vote-point alloca¬ 
tions, his vote holdings would continue to rise until he began receiving 
140 vote points, assuming the average voter was still winning on 70 vote 
points ' ° 

This procedure is a compromise between the ex ante cqualitarian con¬ 
dition embodied m the equal-stake criterion, and a full ex post equality 
of political income as described below The stability of voter preferences, 
and therefore of voter winning percentages, is relied upon to adjust their 
vote holdings so that each voter’s expected successful vote allocations, 
based on past outcomes, are the same for the coming round (If the voters’ 
winning percentages are not stable over time, there is no long-run tyranny 
of the majority problem, and hence no need to adjust individual vote 
stocks ) In the above example, the voter would continue to get 78 votes as 
long as his winning percentage stayed at 90, because he would then con¬ 
tinue to win on issues totaling 70 vote points, the community’s average 
No adjustments would ever be made, however, for the 22 “extra” vote 
points he won in the first two rounds He is always “ahead of the game” 
by that amount Alternative tax-subsidy schemes could be designed to 
achieve equal cumulative ex post political income, but these would have 
the serious disadvantage of requiring a constant potential total utility 
gam for each voter from period to period * ‘ 


' One might chink at first giance that the tax would not have to be the integral of all 
past deviations from the median, but just the divergence since last period But such a 
system does not converge to the median, since the voter is only taxed the incremental 
divergence of winning or losing votes from the median in the immediately preceding 
period In the above example the individual’s tax at the end of two periods would only 
be two votes, allowing him a total of 98 votes as opposed to 78 votes where the integral 
of differences over all past votes is taken 

‘' This problem could possibly be handled by introducing equal increments of votes 
say each voter in the polity gets 10 additional votes in the period in which a new public- 
good issue (of importance roughtly equal to 10 percent of the normal total) is introduced 
Then the vote-tax system can continue, and “new money” in the form of the extra votes 
will keep “purchasing power” proportionate to total income Such a solution would 
follow, of course, if voting periods were time dependent An alternative way to meet this 
problom is to make voting periods not time dependent, but dependent on the total value 
of issues, e.g , vote every x issues worth of political income However, the problem of 
holding the total utility gam constant for each voter from period to period would involve 



OOMMTJNICATIONS 


419 


No such restnctive assumption is reqmred for the lagged vote-tax 
scheme, since its goal is merely to equate each voter’s expected winning 
number of vote points in the coming period By basing these expectations 
on each voter’s past history of successes, and taking into account the 
stability of these winning percentages, individual vote stocks could be 
adjusted to bring ex ante and ex post political income into equality over 
time.** The voter who has done badly m the past can only write these 
losses off, and console himself with the knowledge that he may expect a 
more equal share of public-good pie in the future, whatever the size of that 
pic ought be * * 

Alternatively, one might decide to equalize the realized utility gains 
over the set of issues a polity decides at any pioint in time, by a system of 
taxes and subsidies on voting rights and a sequential, nonbinding senes 
of votes on issues Such a system could achieve full ex post cquahty of 
political income Initial votes would be tentative, subject to the vote- 
income distnbution that resulted from voting As an example, suppose 
that after the initial vote the set of outcomes would be such that each 
voter would have on average decisions corresponding to 70 percent of 
their vote-point allocations satisfied If a particular voter won on a set of 
issues corresponding to 90 percent of his vote-point allocations after the 
first round, then his total vote holdings would be reduced for the second 
round of voting Thus, vote stocks would be adjusted by taxing abnor¬ 
mally high winners (say a tax of 20 votes on the 90 percent winners) and 
subsidizing losers, after which the vote points would be cast again Over a 
senes of recontracts with adjusted stocks of vote points, the gains from 
winning and losing would be equalized To avoid excessive recontracting 


gams and losses analogous to the windfall gams and losses that accrued m the htit few 
periods to above-average winners and losers That is, changes m the size of the pie would 
create additional gains and losses unul the lagged vote-tax system equated expected 
voters’ winnings over a longer run To handle either of these problems neatly would 
probably make the vote-tax system unduly complicated and also raise additional con¬ 
ceptual problems 

Of course, where there are strong dificrenccs in preferences on an issue (1 e , an 
intense m^onty facing an intense minority), the system of poliUcal taxation may never 
succeed in equalizmg income For that matter, the equal-stake cnterion may not be 
satisfied Such issues should raUonally be decided under condiuons of equal ignorance 
about the future in a constitutional period, and if agreement cannot be reached there, 
then separate states should be set up Sec, in parucular, Mueller (1971, 1973) and 
Mueller et al (1971, 1972a, 1972i, in press) 

' ’ Problems exist in applying the lagged vote-tax system to a representative body If 
the legislators represent geographic units, a newly elected member could simply inherit 
the stock of vote points of his precedesior. This would introduce a certsun amount of 
instability if newly elected representauves had significantly different tastes than their 
predecessors Redistncting would alto cause problems, as would a proportionsd repre¬ 
sentation scheme as discussed in Mueller et al (1971, in press) These problems vam^ if 
immediate recontracting is employed 



430 


JOintNAL OF POLITICAL ECONOMY 


under this scheme, one might be satisfied with bringing everyone within 
five or 10 points of the average utihty gain.** 

Thus, as IS illustrated by these examples, operational systems of vote 
taxation that attain greater equahty of political income can be devised 
and applied in representative democracies The importance of such a 
mechanism m a well-working model of representative democracy lies in 
Its abihty to reduce or chimnate the potential for a tyranny-of-the- 
raajonty problem to persist over a long penod of time. 


m. Summary 

This pajjer has discussed possible mechanisms for achieving ex ante and 
ex post equality of political income While any of the structural reforms 
could be implemented separately, it makes the most sense to combine 
them Indeed, the “equal-stake” criterion is to some extent a necessary 
condition for the implementation of a redistributive vote tax A voter’s 
winning percentage on an issue set cannot be regarded as an index of his 
political income unless each vote has an equal value ex ante to all voters * * 

The idea of the state arising from a “social contract” formed by all 
citizens to achieve collective ends can imply the concepts of both ex ante 
and ex post political income equality Of the two, only ex ante equably 
has received attention in the literature Yet, to those who embrace the 
equalitarian ethic, the ex post equality of political income should be of 
even greater importance than ex ante equality of voting rights This paper 
argues that the attainment of ex post equality of the political income dis¬ 
tribution is conceptually plausible and quantitatively important 

Further, a policy of vote taxation does not appeal to be subject to as 
serious disincentive effects as taxation to obtain complete equality of 
ex post pnvate economic income This follows by analogy to elasticity 
conditions where voting takes very little time as oppiosed to the length of 
the work week, for example, and attempts to equalize realized private 
income 

There do not appear to be any functional equivalents under present 
democratic systems to achieve ex post equality of political income * * 

‘ * TTie recontracting procedure can be simplified by the use of a computer The voter 
need reveal only once the percentage of his vote stock he assigns to each issue (i e , his 
relative intensities over the issue set) The computer can then adjust stocks to bring each 
voter’s realized gams into equality 

' ‘ It should also be noted that the application of the vote-tax system relaxes the prob¬ 
lem of assuming that identical tastes obtain for the distribution of public goods in models 
of simultaneous determination of optimal distribution and optimal allocaUon with both 
private and public goods See, e.g , Winch (1969, 1970) and Roberts (1970). 

' * Approximations in the private sphere can be found in the way competition is set up 
in some sports Perhaps the handicapping system in golf is the best example corresponding 
to the vote tax However, such institutions as the player draft in professional sports show 
much the same purpose as the vote tax in the sense that successful teams are penalized 



COMMUNICATIONS 


431 

Indeed, taking the U nited States as an example of a two-party, geographic- 
based democratic system, one could argue that there are perverse mech- 
amsms, such as monopolistic systems of committee power and the like, 
which tend to lead to g ,ater inequality of reahzed utility for many 
voters Many voters from lower income-education classes do not vote 
under the present system, which, although it docs not mean that these 
voters do not receive pohtical incomes, does mean that the equalization of 
political income would be a difficult task without also implementing a 
thorough reform of public choice mechanisms to insure efficient revelation 
of all voter preferences in polities defined on the equal-stake criterion 


References 

Arrow, K J “Equality in Public Expenditure ” Q.J E 85, no 3 (August 1971) 
409-15 

Braybrooke, D Three Tests for Democracy New York Random House, 1968 
Breit, W., and Culbertson, W P, Jr “Distributional Equality and Aggregate 
Utility Comment " A E R 60, no 2 (June 1970) • 435-41. 

- “Distributional Equality and Aggregate Utility Reply ” A E R 62, 

no 3 (June 1972) 501-2 

Fair, R C “The Opumal Distribution of Income " Q J E. 85, no. 4 (November 
1971) 551-79 

Kolm, S C “Justice et <quit< ” Mimeographed Pans CEPREMAP, 1971 
Lerner, A P The Economics of Control New York Macmillan, 1935 

- “Distributional Equality and Aggregate Utility Reply " A E R 60, 

no 2 (June 1970) 442-43 

McCam, R A “Distributional Equality and Aggregate Uulity Further Com¬ 
ment " AER 62, no 2 (June 1972) 497-500 
McManus, M , Walton, G M , and Coffman, R B “Distnbutional Equality and 
Aggregate Utility Further Comment "AER 62, no 2 (June 1972) 489-%. 
Milbrath, J Political Participation Chicago Rand-McNally, 1965 
Morrison, C C , and Higgins, R S “On Distnbutional Equahty ** Southern 
Econ J 38, no 4 (October 1971) 260-62 
Mueller, D C “Fiscal Federalism and Constitutional Democracy ” Public Policy 
19 (Fall 1971) 567-94. 

-“Constitutional Democracy and Soaal Welfare ” Q J E. 87, na 1 

(February 1973) 60-80 

Mueller, D C , Philpotts, G , and Vanek, J “The Social Gains from Exchangmg 
Votes ” Public Choice 13, no 2 (Fall 1972) 55-79 
Mueller, D C , Tolhson, R D , and Willett, T D “Solvmg the Intensity Prob¬ 
lem in Representative Democracy ” Mimeographed Cornell Umv ,1971 


after the season with lower draft picks in an effort presumably to make next season's 
competition more equal ex ^nte The ability of teams to trade personnel and their lack 
of ability to forecast perfectly future players’ performance hinders the attunment of full 
ex ante equality (in the sense that each tesun becomes equidly likely to wm the title) over 
time in such sports (thus the Green Bay Packers, Boston Celtics, and so fiirth) However, 
there do not appear to bd' any equivalent reasons to think that the vote tax would opera¬ 
tionally generate long winning or losing streaks Indeed, it is to avoid such winning 
streaks in the political process that one would advocate a vote tax 



432 


JOtniNAL OP POLITICAL BCONOUY 

-. “Representative Democracy via Random Selection." PubUe Choic* 12, 

no. 1 (Spring 1972): 59-68. (a) 

-. “The Utihtarian Contract A Generahzation of Rawls’ Theory of 

Justice.” Mimeographed. Cornell Umv., 1972. (b) 

- “A Normative Theory of Representative Democracy " In QumMatm 

Methods of Represmtatum New York; New York Acad. Sa., m press. 

Musgrave, R The Theory of Public Fmance. New York: McGraw-Hill, 1959. 

Pauly, M. V , and Willett, T. D. “Two Concepts of Equity and Their Implica¬ 
tions for Public Pohcy.” Soc Set. Q. 53 (June 1972). 8-19. 

Roberts, D. J “Pareto, Public Goods, and Politics.” Canadian J. Eeon , no. 3 
(November 1970), pp 615-16 

Thurow, L. “The Income Distribution as a Pure Public Good ” Q.J.E. 85, no. 2 
(May 1971) 327-36. 

Tobm, J. “On Litmting the Domain of Inequality ” J Law and Econ. 13 (October 
1970) 263-79 

Tollison, R D , and Willett, T D “Some Simple Economics of Voting and Non- 
Voting." Public Choice (in press) 

Vlastos, G “Justice and Equality ” In Social Justice, edited by R. B Brandt New 
York Prentice-Hall, 1962 

Winch, D. M “Pareto, Public Goods and Politics ” Canadian J Econ., no 2 
(November 1969), pp 492-508 

- “Pareto, labile Goods, and Politics Reply ’’ Canadian J Econ , no 3 

(November 1970), pp 616-17 



Some Principles of Inflationary Finance 

Donald A. Nichols 
Umverstfy qf Wtsconsm 


It is well known that a government can use inflation instead of taxation 
as a means of financing its expenditures.* The amount of inflationary 
finance possible has been shown to depend on the demand for money 
Previous work, however, has ignored the fact that a government can shift 
the location and elasticity of that demand curve through the use of various 
controls and regulations, and thereby change the level of inflationary tax 
receipts that is associated with a given level of inflation I will argue here 
that exchange controls, interest rate restrictions, and debt management 
policies all affect the demand for money and the jxissibilities for mflation- 
ary finance Since a well-established model of inflationary finance exists 
in the literature cited, I will confine myself to an analysis of the links 
between the policies in quesUon and the demand for money, and will 
point out that both an increase in the demand for money and a reduction 
m Its elasticity with respect to the nominal interest rate will increase 
inflationary tax possibilities in the traditional model ^ 

I 

The demand for money obviously depends on whether g^ood substitutes 
exist With good substitutes easily available, it is likely that the demand 
for money will be less than otherwise and more elastic with respect to the 
rate of interest ^ Thus we are led to our first principle, which I beheve is 
the rationale for many of the capital market impediments we observe in 
many countries today 

The inflation that results from a given government deficit will be lower than 
otherwise if the availability of close substitutes for government momy is curtailed 

The author gratefully acknowledga the financial lupport of the National Science 
Foundation and the Graduate School of the Univeisity of Wisconsin The research was 
completed while the author was a visitor at the London School of Economics He wishes 
to thanli Harry Johnson and Marcils Miller for their useful comments on an eailier draft 
of this work 

‘ See, for example, Bailey (1956), Mundell (1965), Marty (1967), Sidrauslu (1967), 
andFn^man (1971) 

‘ The model is spdled out in detail m a previous venion of the paper available from 
the author 

’ See Tobm and Brainard (1963) for a discusuon of monetary policy tsdiere good 
substitutes for money exist 



joxnHAJL or rounoM, aooiKaiY 

4^4 

One good substitute for domestic money is fwdgn excftnqge. If we 
assume the existence of flexible exchange rates, long-run eqt^lxium 
requires that the exchange rate change at a rate equal to the difference 
between the rates of inflation of the domestic and the reserve currencies. 
One way for a money holder to avoid the inflationary tax in this situation 
b to hold all money balances in foreign currency and to use domestic 
currency only when it is absolutely necessary. Thus with foreign currency 
a good substitute for domestic currency, an immediate corollary of the 
above principle is* 

TAs rate of vflatum mil be lower if the holding of foreign currency is restricted 
or prohsbtted 

The exchange controls needed to pohee this prohibition are not for 
the purpose of restrictmg imports; free trade can still be allowed Rather, 
the purpose is to prevent the decline in the demand for domestic money— 
and therefore of inflationary tax revenue—that would occur in the 
presence of a free currency market It is the holdmg of foreign assets that 
must be curtailed, not the consumption of foreig^n goods 
Private assets can also compete with pubhc ones in individual port- 
fohos. If the private assets bear a market-determined interest rate, and if 
that monetary interest rate increases with the long-run rate of inflation, 
then we would expect large portfobo shifts from currency into the com- 
petmg private assets as the rate of inflation increases In this case again, a 
higher rate of inflauon will result from a given government deficit than 
would occur if these private assets were restricted. One way to limit the 
appeal of these completing assets is by restneting the rates of interest they 
may legally bear. Once the legal ceibng is reached, further increases in 
the rate of inflation will not increase the relative attractiveness of private 
over pubhc assets.* Of course, an alternative to reduemg the attractive¬ 
ness of pnvate liabilities is to require that their issuers hold pubhccurrency. 
Thus a second corollary emerges. 

Restneting the interest rates legally payable on private assets will decrease the rate 
of inflation that results from a given government deflcit. 

Should these restrictions be successful in curtailing the availability of 
good subsututes, we would expect the demand for money to become more 
inelastic with respiect to the rate of inflation than it would otherwise be. 
One instance that seems to suppiort this prediction is that of Brazil in the 
last two decades Kafka (1967) repxirts that dunng this pieriod, the 
Brazihan money supply remained a reasonably constant fraction of annual 
gross domestic product (about 20 percent) despite very large changes in 
the rate ofmflation. Capital market restrictions of the kind I discuss above 
were in use at that time and may explain the inability of the Brazilian 


* See Tobin (1970) foe a ducumon of monetary policy with interert cetlingi on private 



qOIOIUNICATIOMI 425 

money holder to escape the extremely large taxes which were levied on 
his cash balances. 

Thus governments need not take the demand for money as fixed when 
determining their infiationary poliaes They can affect it through legal 
restrictions. Thus the estimates provided by Friedman (1971) of the rates 
of inflation that maximize inflationary finance must be mterpreted m this 
light. They should be thought of as rough lower bounds of the true esa* 
mates, smce they are applicable only when 1^1 restrictions of the kind 
just mentioned are unavailable. The true revenue-maximizing rates can¬ 
not be calculated in the absence of knowledge of the degree of substituta¬ 
bility of other assets for money 


n 

Consider next a government which issues two liabilities, one mterest- 
beanng and one non-intcrest-bcanng If the non-mterest-beanng hability 
IS currency, it is interestmg to ask what the characteristics of the interest- 
beanng asset should be if all possibihties for inflationary finance are to 
be fully exploited The answer to that question is seen most easily in the 
framework outhned by Tobin in his essay on debt management * We can 
state It in the form of a pnnciplc 

A government can lower the rate of tr^Hon caused bj> tts deficit if its interest' 
bearing debt is made to be a poor substitute for its curreiuy and a good substitute for 
other assets in private portfolios 

What characteristics of debt make it a better substitute for pnvate 
assets than for moneys The answer depends on what the pnvate assets arc 
hke, of course, but since we have already shown that an inflationary 
policy requires that private financial assets be restneted, the remaimng 
assets are likely to be real—either consumption or investment goods, 
equities which are claims to goods, or land We wish to give the mterest- 
beanng debt some characteristics which allow it to satisfy the same ]x>rt- 
folto objectives as goods, and to avoid those characteristics which allow 
It to satisfy the same objectives as money. 

The debt might be made illiquid, perhaps nonnegotiable, and of long 
matunty to make it a bad substitute for money It might be given an mter- 
temporal pattern of payouts which dominates that of goods for certain 
purposes—a retirement annuity, for example, might be preferred to real 
capital goods by many asset holders Many different kmds of debt could 
be issued, of course, to satisfy the different objectives of savers. Interest- 
bearing debt should be made very useful to wealth holders yet a very poor 

’ See Tobin (1963). This paper owes a great deal to the work of Tobin, iinoe he, more 
than anyone else, has develop^ the policy implicaUons of having aaets be imperfect 
•ubsututes for eadi other in individual portfolios. 



JOVJtNAL or rOUttOM. MOOffOtty 

way of storing short-term purchasiag power if the parpoee of debt is 

to Jower the rate of inflation that results from a given fiscal deScit. 

One result established in the previous literature is that deficits financed 
by money cause inflation only to the extent that the rate of growth of the 
money supply exceeds the rate of growth of the real economy. At any long- 
nm equilibrium, all forms of government debt can be expanded at the rate 
of growth of output without any changes in relative asset prices. This 
imphes that even an interest-beanng debt will yield steady-state revenue 
if the rate of growth of the economy exceeds the rate of interest the debt 
bears The interest rate on each form of debt could be chosen to maximize 
the revenue denved from growth plus inflation for the total debt. Note 
that a purchasing-power guarantee could make the debt preferable to 
some private assets, yet if the real interest rate is kept below the growth 
rate, the debt could still yield steady-state revenue 

m 

Foreign aid is sometimes mentioned as a useful device for slowing an m- 
flation It should be obvious, however, that only permanent foreign aid 
that grows at the rate of growth of the economy can stop a permanent in¬ 
flation Ultimately, a country must tax itself if it wishes to finance its 
government expenditure in a noninflationary fashion and if it is unable to 
secure perpetual foreign aid of the appropnate size 

Foreign aid cannot stop an inflation permanently unless the aid is given forever. 

A puppiet regime might be supported by perpetual foreign aid. Puppet 
regimes are often too unpopular to raise tax revenue domestically, they 
can stay in power only if they make a sufiicient level of government ex¬ 
penditure, and these must be financed from abroad 

Loans that must be repaid with the lender’s currency will have a long- 
run beneficial effect on the inflation only if the interest rate charged net 
of the lender’s rate of inflation is less than the rate of growth of the 
domestic economy Otherwise, steady-state repayments will ultimately 
exceed receipts of new loans Repayments obviously have the reverse 
effect on the inflation to that of aid Loans specified in the borrower’s 
currency can yield steady-state revenue if the rate of growth exceeds the 
stated interest rate minus the rate of domestic inflation. 

IV 

I have ignored so far the question of private capital accumulation. I have 
been concerned only with a government that wishes to find some means of 
financmg its own given level of projects. Bnefly, I will now sketch in some 
of the considerations that should influence the government that is in¬ 
terested in furthering certain private forms of investment as well. Those 



OOMMUNICATTOIW 427 

underdevelqjed countries, for example, for whom much of this analysu 
is most relevant wish that the public investment projects they undertake 
replace pnvate consumption rather than private investment. In this case, 
It IS not sufficient to worry about only nununizing the rate of inflation 
that IS caused by the government expenditure, since other g^als are im¬ 
portant as well. 

I will concern myself here only with private mvestment Mundell (1965) 
and Marty (1967) have analyzed elsewhere the case in which the govern¬ 
ment chooses its financing as a function of the investment opportunities it 
faces Clearly, in that case it must also consider the extra future revenue 
to be yielded by a project when determining the profitability of further 
investment and finance Then it is the rate of return on public projects 
which becomes the maximum interest rate payable on government debt, 
not the rate of growth * 

In the analysis above, we were concerned with the degrees of substi¬ 
tutability in pnvate portfolios between goods in general and different 
forms of government debt If pnvate investment in producer durables is 
also a government goal, then the characteristics of the vanous forms of 
debt it issues should be chosen so as to make them substitute for con¬ 
sumption goods, not investment goods In this case, debt should be tailored 
so as to fit well in pnvate portfolios that otherwise would hold no plant and 
equipment or other assets that traditionally finance plant and equipment. 
These portfolios are likely to be the smaller ones 

To illustrate, consider the different effects on investment of two al¬ 
ternative forms of debt the government might issue ( 0 ) a small-denonuna- 
tion retirement annuity beanng a very low (perhaps slightly negative) real 
rate of interest, or (6) a large-denonunauon bond beanng a monetary 
interest rate above the legally allowed rate on pnvate debt Both assets 
a and b might be better substitutes for goods than money, and therefore 
either of them could be used to finance the government’s projects But 
asset a is likely to replace the purchase of g^ld, transistor radios, or motor 
bikes, while asset b might well replace bank deposits, pnvate loans, or 
other debts of productive organizations Clearly, asset a is the one the 
government should issue if it is concerned with the level of pnvate plant 
and equipment investment Debt management policy should make the 
government debt substitute as little as possible for funds that would be 
spent on other soaally desirable projects 

It should be noted that the government could also lend money to any 
worthwhile orgamzation it feared had been harmed by its financial poli¬ 
cies. Such loans have the reverse effect of government borrowing, of 

* Since the proceeds of debt finance in my analysis are to be used to reduce taxes, it is 
the rate of growth rather than the rate of return which limits the interest rate payable 
on government debt if it is to yield revenue in the steady state See Diamond (I9U) and 
Cass and Yaari (1966) for proofi of this in different contexts 



JOVRSAL or POUnOAL ECONOMY 

course, but as part of a whole program, they may be comtennfUtioaary, 
That IS, if the wilhngncss to make such loans enables the govermnent to 
issue a form of debt it would otherwise avoid, the inflation may be coun¬ 
tered by the lending policy With interest rate restrictions, private borrow¬ 
ers profit from an inflation too, of course, so the willingness to lend to 
certain productive organizations is one way of providing them with the 
incentive of a subsidy. Alternatively, should certain forms of debt be used 
traditionally to finance undesirable projects, it is these that the govern¬ 
ment should restrict and replace by its own debt 
One of the common victims of inflation is the savings of the poor. Indeed 
one could interpret the analysis above as a set of devices whose purpose is 
to marshall the savings of the poor for national development purposes But 
It may be true that the poor’s elasticity of saving with respect to the real 
rate of return is so great that saving goes to zero once a large inflation is 
permanently established, or, more accurately, that all saving is then done 
in the form of durable consumer goods If this is the case, then the infla¬ 
tion has caused to be wasted a lai^ potential source of saving. What is 
needed in this case is a new form of government debt which will restore 
the profitability of saving to small savers yet which will not force the 
government to abandon the profits of inflationary finance 

Examples that might be tried abound and arc limited only by the 
imagination of the government A ticket of admission to a retirement 
home, for example, is a real debt that nught be sold in small parts at a rate 
that gives quite a profit to the government, yet which appeals to pnvate 
asset holders Clmms to food during retirement, a vacation, or some 
durable good to be delivered in 5 years time all are assets that are guaran¬ 
teed against inflation, yet which can be sold at high enough prices to make 
them profitable for the government Debt, in short, can be made to sub¬ 
stitute for many things besides money and productive investment 

V 

In conclusion, it is useful to place in perspective the kinds of controls I 
have discussed, but not necessarily recommended Economists have long 
been in favor of untrammeled markets and have usually been critics of the 
controls I describe. Yet we should judge these controls with respect to the 
practical alternatives available, not according to some theoretical ideal. 
All practical forms of taxes have welfare and collection costs. All have 
loopholes which must be closed at some administrative expense. Viewmg 
inflation as a tax on the holders of government money, we can then view 
the ready availability of substitutes as a loophole for avoiding the in¬ 
flationary tax. Closing the loophole takes some admmistrative effort and 
requires that we prevent certain transactions from taking place. But since 



OOMMUNICATIOHB 439 

this is true of all taxes, it becomes an empirical question whether the 
restrictions are more costly for inflation than they are for other forms of 
taxes. 

Previous estimates of these costs indicate that the question is at least 
an open one A true calculation of the costs could best be done m a larger 
model with a social welfare function. Only more limited partial analyses 
have been earned out The cost of mflation as shown by Bailey (1956) 
derives from the fact that with inflation, asset holders hold less money 
than they otherwise would and therefore receive less of the benefits that 
are denved from money holdmg Sidrauski (1967) has shown that this 
can lead to more or less steady-state consumption of other goods Marty 
(1967) argues correctly that there is no a pnori way to tell whether these 
costs exceed the cfHciency and collection costs of other practicaUy useful 
forms of taxation He compares estimates which show the efiiaency costs 
to be about the same for mflation as for other forms of taxes. The pohcies 
I have desenbed have obvious efficiency costs, but equally obviously 
increase the revenue available to the government Unless their costs can 
be shown to be unusually large, they ment consideration as practical 
possibilities for finanang government expenditure. 

Finally, this analysis warns us that where capital market restrictions 
already exist, premature removal of them as part of a countennflationary 
policy may actually accelerate the inflation if other steps are not taken to 
offset their impact on the demand for money. The elimination of capit&l 
market restrictions may be one of the golds of a coimter-inflationary 
policy, but It IS not one of the tools to accomplish that end. 

VI 

I have tned to outline some of the pnnciples of mflationary finance The 
purpose was to show that many different rates of mflation may result from 
a given government deficit The roles of exchange controls, bank regula¬ 
tion, and capital market restrictions were discussed as part of a broad 
policy of inflationary finance Debt manag^ement policies were described 
as well as the interaction between the pohey of inflation and other g^als 
New forms of debt were suggested as being likely to provide finance to the 
government without hindenng other objectives 

I have considered only balanced growth situations In many instances, 
however, the adjustment from one steady state to another provides an 
opportumty for one-shot financing that may be of substantial size A social 
security scheme, for example, that pays an interest rate equal to the real 
rate of growth collects no net revenue in a steady-state equihbrium But 
the introduction of such a scheme can provide a massive inflow of revenue 
in the short nm, and such a device should be considered as a useful way of 



43 ° 


JOUtUfAL or POUnOAL BOONOMY 

overcoming the problems of adjusting from one steady-state rate of in¬ 
flation to another—in this ease—lower one» Other permanent policies will 
have to be invoked simultaneously, of course, if the new lower rate of 
inflation is to be permanently maintained. 


References 

Bailey, Martin J “The Welfare Cost of Inflationary Finance.” J P E. 69 (April 
1956) 93-110 

Cass, David, and Yaan, Menahem F. "A Re-Exanunation of the Pure Consump¬ 
tion Loans Model.” J.P.E. 74, no 4 (August 1966)' 353-67 

Diamond, Peter A. “National Debt in a Neoclassical Growth Model.” A.E.R. 55, 
no 5 (December 1965); 1126-50 

Friedman, Milton “Government Revenue from Inflation " J P E 79 (July 
1971) 846-56 

Kafka, Alexander “The Brazilian Stabilization Program, 1964-66.” J P E, 75, 
no 4, suppl (August 1967) 596-631 

Marty, Alvin L “Growth and the Welfare Cost of Inflationary Finance.” J P E. 
75 (February 1967) 71-76 

Mundell, Robert A “Growth, Stability and Inflationary Finance.” J P E 73 
(April 1965) 97-109. 

Sidrauski, Miguel “Inflation and Economic Growth.” J P.E 75 (December 
1967) 796-810. 

Tobin, James “An Essay on Principles of Debt Management ” In Fiscal and Debt 
Management Policies Commission on Money and Credit Supporting Papers 
Englewood Clifls, N J Prenticc-Hall, 1963 

- “Deposit Interest Ceilings as a Monetary Control ” J Mon^, Credit and 

Banking 2, no 1 (February 1970) 4-14 

Tobm, James, and Brainard, WiUiam C. “Financial Intermediaries and the 
Effectiveness of Monetary Controls.” A E.R 53, no. 3 (May 1963). 383-400. 



A Bias in Current Measures of 
Economic Growth 


Milton Fncdman 
Umtrsfty of Ottcago 


It IS common practice to measure the growth in economic welfare in a 
country over any considerable penod by the rate of growth in real per 
capita income However, this measure can be senously biased for a penod 
during which the country expenenced substantial immigration or emigra¬ 
tion 

Consider the Umted States from, say, 1870 to 1914 Durmg that period, 
real pier capita income as measured by Simon Kuznets grew at the annual 
rate of about 2 piercent However, there was heavy immigration, reaching 
a peak rate of over 1^ milhon persons a year in 1907 By 1914, roughly 
one-third of the total pxipulation of the United States was foreign bom 
or of foreign or mixed parentage, and one-fiilh of the population over 14 
years old was foreign bom Their counterparts m 1870—for, of course, we 
want to measure the growth in economic welfare for a “representative” 
population, not for a collection of identical and aging individuals—lived 
or worked in other countries, not in the Umted States We have every 
reason to beheve that their 1870 counterparts had substantially lower 
incomes than the then population of the United States—presumably that 
IS why the Umted States expienenccd heavy immigpi^tion, not emigration 

The foreig^-bom in 1914 probably also had lower mcomes than the rest 
of the U.S. population, but again the fact of continued large-scale immi¬ 
gration, let alone a wide vanety of histoncal evidence, sugg^ts that their 
incomes in the United States were substantially above the incomes that 
they could have received in the countries from which they came In any 
event, the incomes of the 1914 foreign-bom are included in the aggregate 
income underlying the 1914 pier capita mcome estimate, while the m- 
comes of their counterparts in 1870 are completely excluded. The result 
IS to bias the estimated rate of growth downward Excludmg the foreig^- 
born in 1914 as well as their counterparts in 1870 would raise the 1914 
pier capita income figure and leave unchanged the 1870 figure Including 
the fomg^-bom in 1914 and also their countcrpiarts in 1870 would leave 
unchanged the 1914 figure while lowenng the 1870 figure Either way, the 
rate of growth would be raised. 


432 


JOURNAL OF POLITICAL ECONOMY 


For Simplicity, I have stated the problem simply in terms of foreign- 
bom, but clearly a change in the population of first-generation, second- 
generation, etc., residents would also affect the results. 

One of the great economic achievements of the United States in the 
period from 1870 to 1914 was the absorption of millions of residents who 
came to the United States with little but their bare hands, were able to 
make a better life for themselves than in their countries of origin, and to 
lay the foundation for a still better life for their children Yet not only is 
this achievement not recorded piositively in the common measure of eco¬ 
nomic growth. It actually enters as a negative factor, reducing the 
measured rate of growth. 

As a purely arithmetical matter, it would, for example, be possible for 
the average income of all residents of a country to remain constant, even 
though the income of every person separately was rising, provided that 
the immigrants entered at an income lower than the average, though 
higher than their base income in the country of origin, and that either the 
income at which they entered was falling relative to the average or the 
percentage of foreign-bom was rising sufficiently rapidly. 

It is almost surely impossible to correct the bias by including the 
counterparts of the 1914 foreign-bom in the initial income estimates In 
principle, however, the bias could be corrected with U S data alone by 
using imtial and terminal average incomes for a population standardized 
not only with respect to distribution by sex and age, but also by length of 
residence in the country of the individual himself or his forebears I do 
not know whether data exist that would permit such standardized averages 
to be computed even roughly 

What seems clear is that such standardized averages would show a more 
rapid rate of rise in real per capita income in the United States over the 
period 1870 to 1914 than the current crude averages And it is possible 
that the difference would be substantial 

A similar problem arises today in the income estimates for countries 
using or providing much foreign labor How should the more than one- 
half million Yugoslav workers now employed in Germany be allowed for 
in the income estimates for Germany^ for Yugoslavia^ How are they^ I do 
not know the answers, but any serious student of rates of growth in eco¬ 
nomic welfare in Germany and Yugoslavia in the postwar penod must 
face these questions 



Miscellany 


Human Migration: A Survey of 
Preclassical Literature 


The study of human migratory behavior is not recent in the studies of 
social analysts In fact, a thorough review of the hterature reveals 
information concerning this phenomenon even in the Old Testament 
The phght of the ancient Hebrews is a case m point. Prior to their 
residence in Egypt, the Jewish religion found its mystical roots in the area 
of current Israel However, due to increasing economic hardship resulting 
from years of drought (7 to be exact, witness the case study of Joseph of 
the “coat of many colors” [Gen , chaps 37-46]) Jacob and his son and 
relations migrated to Egypt in search of an easier and more prosperous 
life (Gen 46 1) This migration was not made to a totally unknown 
environment, because a member of Jacob’s family, Joseph (mentioned 
above), was already an estabhshed member of the Egyptian community, 
holding a position comparable to secretary of agriculture Furthermore, 
an omniscient information source also recommended the migranon 
(Gen 46:3) In fact, the migration did improve welfare, although most 
notably the long-run welfare of the Egyptians mcreased due to enslave¬ 
ment of the Jews. 

And the Lord said, I have surely seen, the affliction of my 
people which are in Egypt, and have heard their cry by reason 
of their taskmasters, for I know their sorrows . Now therefore, 
behold, the cry of the children of Israel is come unto me. and 
I have also seen the oppression wherewith the Egyptians 
oppress them. [Exod. 3.7, 9] 

Thus, in this case the long-term consequences of migration were economic 
and social subjugation. 

Several generations later, under the leadership of a man named Moses 
(who supposedly talked to God), the Jews migrated from Egypt once 
again to obtain a substantial increase m their material and spiritual 




434 


JOURNAL OP POLITICAL ECONOMY 


welfare The destination was reputed to be highly “promising*’—“a land 
flowing with milk and honey’’ (Exod. 3*8). In this case, the probability 
of obtaimng the expected income stream in the destination was certainly 
in that the Source of their information was once again unquestionably 
rehable. (But, come to think of it. He suggested that they go to Egypt in 
the first place.) Well, in any event, even if the information was not of 
perfect rehability, the subjective probability estimate for the relative 
income stream differential between the origin, Egypt, and the “Promised 
Land’’ was no doubt sufficient to induce migration. A problem exists 
with regard to relatmg the imgration decision to the distance between the 
origin and destination in that the route they traveled was highly circum¬ 
scribed and the time distance was not known prior to migration. No doubt 
had most of the Jews known that the time distance was 40 years (Deut 
1'3), this would have compensated for the increased economic welfare 
in the “Promised Land” and perhaps migrauon would not have occurred. 

In addition to the standard neoclassical welfare maximization theory 
of migration, the Old Testament provides two other explanations for 
migration The first is that of ostracism One sociological explanation for 
migration is that the migrant has been ostracized by the dominant 
group in the population of a given geographic area The reason for the 
ostracism may vary but most commonly involves the violation of the 
dommant group norms The Garden of Eden is an illustrative example 
(Gen.,chaps 1-3) It is clear that in this case God represents the dominant 
“group ” It IS also clear that Adam and Eve violated one of the primary 
norms of behavior in the Garden of Eden in that they ate of the fruit of 
the Tree of Knowledge (Gen 3.6) Thus, grounds for ostracism existed 

However, the mere observaUon that they violated the norms of “the 
first society” is insufficient, we must ascertain the reasons for the violation 
A more detailed analysis of the text reveals that shortly pnor to the norm 
violation, the migrants, Adam and Eve, interacted with a member of a 
deviant minonty subculture in the garden, the snake (Gen 3:1-5). 
This member of a deviant subculture disregarded the norms of God and 
sought to get Adam and Eve to do the same However, mere association 
with the snake would not have been sufficient if Adam and Eve had not 
desired to rise in the social structure by increasing their level of knowledge. 
This imphes that their reference group was not the same as their face-to- 
face group in the garden Rather, it might be suggested that they wished 
to become members of the face-to-face group of which God was a member 
(Gen 3 5) It is clear that God had great knowledge, and perhaps 
Adam and Eve thought that by obtaining one of the symbols of being a 
god, great knowledge, they too might be accepted into that group. Thus, 
two primary factors can be discerned which contributed to the violation 
of the norm: first, the desire of Adam and Eve to rise in the social 



ingcsLLAKy 435 

structure and, second, the interaction with a member of a deviant culture 
who reduced their resistance to norm violation. 

An alternative explanation of the reason why Adam and Eve desired 
to obtam knowledge has been suggested.* According to this view, they 
felt that by increasing their human capital (Becker 1967) they would 
raise their producUvity and thereby positively affect their future income 
streams. However, since the processes of production and distribution were 
divorced in the Garden of Eden, the Protestant ethic not yet having 
emerged, this explanation seems to be spurious. 

The last explanadon of migradon provided in the Bible is natural 
catastrophe. The case of the Great Flood is typical (Gen., chaps 6-9) 
Faced with death if a person docs not move, he will accept migration, 
even when no clear-cut alternative location exists. However, in order to 
become part of the migration stream (or perhaps m this case migration 
ocean) the individual must accumulate sufficient resources to last the 
journey as well as have an acceptable means of transpiortation. In Noah’s 
case, both of these first-order conditions were fulfilled, and therefore he 
and his household unit were able to migrate in response to the approaching 
catastrophe (Gen. 6.14, 15, 21) However, since the rest of humamty 
lacked the requisite first-order conditions, they were unable to migrate 
Since this type of migration has no specific destination, it represents a 
classic and extreme form of nugrauon in response to “push forces ” 
On the other hand, the migration of the Jews from Egypt, as discussed 
earlier, could be viewed in terms of a broader appheauon of the “push- 
pull” dichotomy, with the “pull” of higher welfare ui the “Pronused 
Land” being reinforced by the “push” of intolerable socioeconomic 
condiUons in Eg^t 

In addition to its imphcaUons concerning the nature of the migration 
decision process, this case study also provides considerable tesumony 
concerning the processes of adaptaUon after nugrauon As was mentioned 
above, in Egypt the Jews did not assimilate due to the low economic and 
social cltiss that they occupied m that country Rather, the path of 
adaptaUon was one of economic plurahsm and cxploitauon, with inter- 
acuon between social groups restricted to the market place (Fumival 
1948) After migration to the “Promised Land,” the adaptaUon process 
was reversed for the Jews In this case, rather than attempt to assimilate 
into the indigenous culture of Palestme, they proceeded to appropriate 
their land and aUes (Dcut, chaps. 1-4, and the Book of Joshua), in a 
way not unlike that used by the Europeans with respect to the American 
Indians in more recent times (Brown 1970) Thus, the indigenous pop¬ 
ulation of Palestine was subjugated economically but, m this case, not 
soaally. Little menUon is made, but the implicaUon is that after several 


‘ Many *h«nlt« to Dr Robert McIntyre for hit help m this regard. 



^ jct9iw>mroumMMooiKun 

the tocUgmm JT ^ <*«»««* 

iotennarrinffc or diwippeored tfaiovfb « «ny «W80t, t&e rejuit 

was a coherent economic end todal lyttem demed by the eultun of the 

recently migrated Jews. . j . ^ 

Although there are considerable problems in drawing moaaa parallels 
to the above-mentioned ancient processes, some parallels do ind^ exist. 
Actually, a laiyc portion of the modem hteraturc on migration is merely 
a refinement of the processes described above. The nature of economic 
adaptation, however, has changed considerably since the time of Adam 
and Moses Due to the increased extent of economic and sodsJ stratifica¬ 
tion in most modem countries, the employment and role paths of the 
migrants have changed considerably from those of ancient times. Another 
important factor that has changed between the two time contexts is that 
the Umc perspective of the migprants and adapting postmig^rants has 
become considerably shorter While in anaent times men were willing 
to accept current suffering and hardship so that future generations could 
benefit, modem man’s time perspective is considerably shorter. Thus, 
adaptation has to have an immediacy that did not exist in the past. 

If substantial improvements in welfare do not result from migration 
within a short period after migration, then further migration or back 
migrauon may result This points to yet another difference between 
ancient and modem times that affects the migrauon-adaptation processes. 
Modern man is much less stable, geographically, in that migration over 
a considerable distance is often chosen more than once in one man’s 
hfetime 

In conclusion, while the “ancient” migraUon theories presented m 
this paper require some modification for them to be applicable to modem 
society, they contain considerable “Truth.” Through the far more 
numerous oppiortumties for testing, these theories should be capable of 
both partial venfication and useful extension and modification. 

Robert Weinstein 

OAw State University 


References 

Becker, G. Human Capital and the Personal Distribution of Income. Ann Arbor* Univ. 
Miciugan Press, 1%7. 

Brown, D. Bury My Heart at Wounded Knee. New York: Bantam, 1970. 

Funuvall, J S Colonial Polig> and Practice. Cambridge' Cambridge Univ. Press, 
1948. 



Book Reviews 


Theonts qf Pooerty and Unmpkymmt, By Davis M. Gordon. 

Lexington, Mrm.- Lexington Booki, 1972. I^. uc+ 179. 

This book surveys and evaluate* a latge body of literature bearuig on the personal 
distnbution of eammgi, with speoal emphasis on poverty Gordon argues that 
much of conventional theory on the subject is misplaced, that various distributional 
policies advocated under its perspective have not achieved their mtended effects, 
and that empirical regulantiei uncovered by a different view of low-wage labor 
markets are inconsistent with received theory. The book consists of two parts. In 
the first, Gordon presents a brief, incomplete survey of what he calls orthodox 
theory, an exposition and review of the dual-labor-market hypothesis, and some 
explanation of recent radical theories of labor markets. The second part contrasts, 
compares, and cntiazes the theories from both policy and empirical pomts of 
view, using existmg empincal work as a guide 
The book is orgamzed around Thomas Kuhn’s (1962) concept of a paradigm. 
Gordon evidently can recognize a paradigm when he sees one, for the book u the 
outcome of work initially supported by the Office of Economic Opportumty to 
stimulate “paradigm competition” regardmg labor market behavior. Thu 
methodological paraphernalia, along with the tnparute dutmction among radical, 
dual-market, and orthodox theories makes it temptmg for a reviewer to draw 
some obvious analogies with political and religious ideology That temptation will 
be resuted, however, since the book u hardly polemical In any event, the com- 
petiUon IS marred by the fact that the battlegrounds are not sharply drawn. An 
objective judge, should one exut in such matters, needs some criteria on which to 
choose a victor For that, the theories must be focused on a sufficiently interesting 
and similar set of problems for which their predictions differ. Though Gordon 
attempts the task, he really cannot accomplish it in any systematic manner, for 
the radical and dual-labor-market theones admittedly are far less complete than 
“orthodox” theory Indeed, they may be so incomplete as to make it problem¬ 
atical whether they deserve the label of paradigms, or even of theones for that 
matter Nevertheless, the book does succeed m stunulatmg interat in altemauve 
approaches to labor economics. It contains an excellent review and bibliography 
of dual- and radical labor market literature and will serve as a relabvdy painless 
introduction for those curious to discover what all the shouting u about 

What are the differences between the altemauve theones? All are based m 
maximizing behavior, with varying emphasu on constnunts Orthodoxy, as Gor¬ 
don sees It, relates to the pnnciples of margmal producUvity and human capital. 
Dualists and radicals accept the lifeume-eamings emphasu of human capitalists 
as the empirical magnitudes of pnmary importance In addition, the former 
bring in some specific insUtutional features of labor markets, while the latter stress 
a concept of class interests as a major detemunant of the insutuuonal features. 
Both elevate the importance of specific job expcnence m generaUng lifeume pro¬ 
ductivity and earnings compared with the role of formal educauon as a produc- 
Uvity-augmenting device. 



JOURNAL OF POLrnCAL ECONOMY 


438 

The fundamental innght of dtial-labor-market literature u that the nature of 
work activities and job structure are endogenously determined, something that is 
masked over and often forgotten in writing enterprise production functions m 
terms of labor and capital aggregates. Profit-seeking employees structure work 
activities within their technological constraints to take advantage of existing 
supphes of personal characteristics of workers To this is added a kind of recursive 
fe^back mechanism in which workers’ tastes and job-related personal character¬ 
istics are affected, in an unspecified manner, by the nature of their work activities 
Fmally, technological and product-demand conditions require a class of in¬ 
herently unstable short-term jobs to serve buffer functions. The market becomes 
segmented between job types oftering chances for advancement and large lifetime 
eammgs and those offering high turnover and little in the way of future success. 
Tastes and worker characteristics evolve in ways that are consistent with this 
division, and the situauon is mamtained indefinitely Hence, the term “dual 
labor market '* On this view, altering the lower tail of the earnings distribution 
involves literally breaking into the group holding unstable jobs 

As a doctrinal matter, it is not clear why this theory should be labeled an 
alternative to orthodoxy, rather than an interesting and potentially important 
application of conventional concepts. Contributors to the dual-market literature 
apparently do not obtain their inspirations from conventional modes of thinking. 
Yet their differences in language may be misleading and perhaps not indicative 
of real differences in substance What we seem to have here is a neat application 
of the theory of equalizing differences, with the twist that both employers and 
workers can alter job and personal characteristics exchanged in the market I find 
It slightly scandalous that Gordon scarcely mentions the concept of equalizing 
differences m this connection Virtually none of the literature on the subject is 
reviewed or cited, of which Rottcnberg’s (1956) paper remains the best smgle 
exposition of its scope and linutations A strong case can be made for predicting 
that the major impact of dual-market and some radical literature on future labor 
market research will be to revive and extend the Smithian principles' 

The radical position, revolving around the elusive concept of class, is far less 
easy to summarize. A central element is that employers not only structure jobs in 
their private self interest, but also with an eye toward tbwarung class conscious¬ 
ness Worker choice is severely constramed, and focus naturally shifts from indi¬ 
vidual choices themselves to the constraining institutions The main difficulty 
with the class concept, aside from its unappetizing requirement of large-numbers 
collusion, IS that no operational definition of class is available, a deficiency Gordon 
acknowledges at several points in the book Now collusion in the presence of 
government assistance, a favored theme of economists at the other end of the 
political spectrum, has more a prion appeal Radicals stress the interrelationships 
between public goods production and legislation that furthers dominant class 
mterests and engenders taste formation consistent with these interests among other 
segments of the populaqon. But all economic policies have income effects, some 
more blatant than others In fact, most of Gordon’s more convincing illustrations 
of the uses of the class concept involve ex poet evaluations of gams and losses to 
identifiable groups arising out of specific policies and legislation Ex post analysis 
IS easy Out of all configurations of feasible institutions, can the radicals narrow 
the range of predictable wintung coahtions that will obtain in practice? 

Any theory of income distribution must face up to the fact that huge variation 
in eammgs exists among individuals perceived to be identical within the limits of 
available data. The residual variance of eammgs is at least as large as and probably 
larger than the systematic part. Independent of what interpretation is placed on 
the systematic components, controversies among competmg theories of income 
distribution will remam unresolved until individual earnings can be predicted 



book KXVIEVVB 


439 

much moK accuratdy than ii poMible at preient. All the theories discuMed here 
are highly deterministic in one sense or another. Perhaps the data are telling us 
something, 

Shekwin Rosen 

Umverstty of Roehester • 

and 

Harvard Umverstty 


References 

Kuhn, Thomas S. TTu Structure qf Sctent\fic Revolutions Chicago: Umv Chicago 
Press, 1962 

Rottenberg, Simon. “On Choice in Labor Markets ’’ Indus. Labor Relations Rev 
(January 1956). 


Souut Wages Changes in Structure and Admtmstratum since 1956. By Leonaed J, 

Kirsch 

Cambridge, Mass M I.T. Press, 1972 S12 50 

This volume is undoubtedly the most substantial published study of Soviet wages 
since Bergson’s The Structure of Soviet Wages (1944) Although its focus is on the 
wage reform of 1958-60, it also traces the evolution of Soviet wage administration 
and wage structure (chiefly intraindustry differentiab) through 1968 

In the area of wage administration perhaps the most significant aspect of the 
reform has been the change m systems of wage payment Pnor to the reform 
approximately two-thirds of Soviet industrial workers were paid according to 
straight or progressive piece rates By 1965 the latter bad virtually disappeared 
while the former applied to less than one-quarter of industrial workers The single 
most important form of wage payment has become tune rates plus premiums, 
while the piece rates that remain have increasingly become “collective” in nature 
(with the individual’s earmngs dependmg on the degree of fulfillment of his work 
team’s collective output norm) 

Kirsch’s discussion of the rationale for the continuing Soviet reliance on some 
form of “payment by results”—whether premiums, piece rates, or some combina¬ 
tion of the two—seems particularly useful It is clear that wage payment systems 
that stimulate a more intense work pace than would normaUy prevail under con¬ 
ditions of contmuous full employment have much to recommend them But 
perhaps more important, “the wide application of various incentive systems 
flexibdity to an otherwise relatively rigid system of wage administrauon” (p 4^ 
In an administrative framework in which basic wage rates are detenxuned ay 
central authorities, the extensive use of incentive payment schemes administered 
by enterprise authorities provides a means of adapting earnings differentials to the 
requirements of local labor market conditions Enterprise authorities, in effect, 
can “correct” the centrally fixed basic rates to meet scaraties of pauticular skills 
via the implementation of incentive payment schemes 

The most interesting portions of Kirsch’s volume, at least for me, are those con¬ 
cerned with establishing recent trends in Soviet wage mequahty The evidence 
amassed by Kirsch makes it clear that since the late 1950s, if not earher, there has 
been “a sigmficant reduction of wage mequality among industrial workers” 
(p 181). This trend reflects a reduction m the differentiation of earnings according 
to skill as well as a dechne in mtenndustry wage differentials. Although Kirsch 



440 JOURNAL OF POLITICAL ECONOMY 

ibcxises on wage difTerentiation among industrial workers, a similar leveling 
tendency has also characterized the average earnings gap between workers, on 
the one hand, and “engmeenng-techmcal personnel,” on the other. Thus, the 
declme m earmngs inequality which Kiisch observes among workers may also 
apply to the broader category of “workers and egsployees ” 

Tliese trends in the distribution of Soviet earnings are m striking contrast to 
the increased inequality which Henle has found for the distribution of eammgs 
among wage and salary recipients in the United States in the period 1958-70 
(Monthly Labor Rmew [December 1972], pp 16-27) It is curious that this trend 
should be treated by Henle as “perhaps almost mevitable in an advanced eco¬ 
nomy,” while Kirsch interprets the opposite trend in Soviet Russia as a kind of 
natural accompaniment of increasing Soviet economic maturity (higher educa¬ 
tional attainments, declming relative scarcity of skilled labor, a slowdown in the 
flow of peasant labor into industry). Is there a tendency to treat as “normal,” as 
somehow inherent in a given stage of economic “maturity,” whatever trends m 
wage structure we have discovered? Whatever explanations may be offered for the 
divergent current trends in Soviet and American wage structures, it seems prob¬ 
able that the kinds of labor market stratification which reinforce wage inequality 
in the Umted States are not an obstacle to the implementation of wage-levelmg 
policies in the Soviet Union 

Kirsch’s findings for the period since 1956, when joined to the results of earlier 
studies in this area, now make it possible to trace the broad outlines of Soviet 
money wage differentiation from the early 1930s to the late 1960$ But if, as 
Arthur Lewis once remarked, “socialism is about equality” (at least in part), it 
must be recognized that trends in Soviet money wage differentials are not always 
a reliable guide to broader changes in social and economic inequality under 
Soviet soaalism The campaign against “equahtarianism” and the widening of 
money wage differentials during the early planning years were accompanied by 
the existence of rationmg, widespread shortages of basic consumers’ goods, and a 
substantial rise in communal consumption, all of which obviously limited the 
extent to which greater money wage differentiak could be converted into in¬ 
creased real income differentials In more recent years the opportumties for those 
with relatively high money incomes to gam access to newly available goods and 
services (ranging from an increasmg variety of consumer durables to occasional 
foreign travel) suggests that declining money wage differentials are compauble 
with at least some forms of increased social and economic differentiation. 

TTiese comments are not intended to detract from the value of Kirsch’s study of 
the structure of Soviet money wages, but to mdicate that the nature and forms of 
inequahty in Soviet soaety remam fertile fields for contmued study. The dis¬ 
tribution of political power is the only dimension of Soviet inequality in which 
the important questions have long been settled. 

Murray Yanowttcm 

Hofstra Unmrsxty 


Intact of Untortamty on Loeaium. By Michael J. Webber. 

Cambridge, Mass.: M.I.T. Press, 1972. Pp. xviii-H310. 

It is well known that the introduction of uncertamty into economic models has 
enabled us to explam several phenomena imperfectly analyzed by modeb in a 
situation of certamty. In this book, Midiael J. Webber sets out to discover 
how uncertainty affects location deasions. 



BOOK REVIEWS 


441 

In Webber*! view, the main objective of location theory u to explain why 
economic activity if spatiaUy concentrated. Starting from this position, the first 
part of the book (chaps. 2-4) is devoted to a critical analysis ^ existmg theory 
in certainty situations. In the second part (chaps. 5-9), the author studies the role 
of uncertainty in the processes qf the location of the firm. 

The discrepancy observed in many cases between the ex post optimal locauon 
and the actual location has led some economuts to reject the maximization of 
profit as an objective of the firm when making its location choice According to 
Webber, it is uncertainty which enables us to explain thu discrepancy Indeed, 
the decision may be optimal given the firm’s knowledge at the moment of decision 
but may no longer be to when the firm has obtained a better knowledge of its 
environment. Furthermore, the firm must sustam monetary costs m c^lecting 
information on possible location sites But the cost and the uncertainty may 
increase with distance This leads to a reduction in the number of locations 
considered by the firm. The tame is true of the different markets where the firm 
may sell its output or buy its inputs 

The author distmguishes between three types of uncertainty imcertainty about 
the behavior of rivals, about the state of the environment, and about production 
techniques A section of this book is devoted to each 

The most original part of the work is undoubtedly the part devoted to uncer¬ 
tainty about rivals (chaps 6 and 7) Webber assumes that each firm making a 
location decision will not know its rivals* intentions, that is, the number, date, 
and place of future locations, and cannot change location Given the difficulty of 
analyzing all the possible processes, Webber proposes a very ingemous hypothesis 
the firm makes its decision while imagining that the other firms are going to 
locate at the same time He does not claim that all location decisions arc taken 
simultaneously, but his hypothesis seems to enable us to simulate the decisions of 
firms which are expectmg other firms to locate into the market at a later dkte. 
In this way the decision making may be analyzed withm game models. It is 
assumed that the firms cannot commumcate with each other and that they are 
seeking to miuamize their profits safely in order to stay m business So the firms 
choose the least risky locations, they locate in cxisUng towns mainly m the central 
town, whereas in a situauon of certainty out-of-town locauons would be possible. 
Thus one has a theory capable of explaining urban growth. 

This sumulating conclusion is weakened once the specificity of the hypothesis 
IS underhned It is supposed that the market is linear, and this leads to a highly 
simplisuc representation of economic space In the process of location choice, 
production costs are unknown A demand spread over the market is partidularly 
applicable to consumer goods, but for enterprises produemg intermediate or 
capital goods, the spaual structure of demand tends to be located at a finite number 
of pomts (Webber’s analysis of wholesalers’ locabon is not very satisfactory). The 
absence of informauon and coahtion between firms is mainly true for small 
enterprises Indeed, if the head offices of large corporations are very often located 
in large towns in order to be near information sources and centers of poliucal 
decision making, the locauon of production units is no longer made m the hght 
of the same criterion. 

Furthermore, the decision criterion used by Webber is not defined with enough 
precision. It corresponds in the majority of cases to a criterion of total uncertamty. 
about rivals. However, in the appendix to chapter 7 (p. 183) each firm is assumed 
to known its rivals’ locations at the tune it deades on its own optimal location. 
The game solution, if it exuts, u given by the solution of the system of equations 
so obtained. The game bears a striking resemblance to Cournot’s game, which is 
not a game in a situation of total uncertainty. Why this difimnce betweoi the 
two? 



JOUHMAL OP POUnOAL BOONOMV 


44a 

Finally, Webber impbcitly aaumei the exbtence of an equilibrium in many 
cases (see, for example, p. 138):without analyxing the mathematical conditians 
necessary to ensure its existence. If we accept a more realistic representation o£ 
economic space, if the market area has any kmd of shape, for example, or if demand 
is located at a finite number of points, or if the markets are spatially separated, the 
condition of convexity is not satisfied. In these cases, mixed strat^es must be 
used, but Webber says nothing about them. 

Chapters 8 and 9 discuss the impact of the other two forms of uncertainty about 
the state of environment and about production techmques. In Webber’s view, 
they reinforce the tendouaes toward qiatial concentration. However, hu argument 
is not rigorous enough. Furthermore, in chapter 8, Webber appears to be unaware 
of the contributions to the economic theory of the firm concenung the uncertainty 
of demand. 

FuuJly, chapter 10 is devoted to a synthesis of his results and reflecbons about 
economic policy. 

A better use of mathematical tools would undoubtedly have enabled Webber 
to reach a higher level of precision m his concepts and arguments If this had been 
the case, both the limits and the qualities of the model would have appeared 
more clearly The fact remains, however, that by hu contribution Webber 
reveals several interesting areas for future research. I agree with him in hu con¬ 
clusion that “these results are sufficiently encouraging to indicate that an 
integrated mathematical statement of the general theory of the location under 
uncertamty should be made.” 

Jacques Those 

Foods National da la Recherche Sctenl\fique (Belgium) 



International Capital Flows and 
Portfolio Equilibrium 


Pentti J K. Kouri 

International Monetary Fund 

Michael G Porter 

International Monetary Fund and Reserve Bank of Australia 


This paper develops a model of international capital flows from a 
general equilibrium model of the financial markets of an open economy 
Capital flows are viewed as the mechanism by which a domestic excess 
demand for money is removed and consequently the key explanatory 
variables in the model are changes in domestic income, the current- 
account balance, changes in domestic monetary instruments, and 
changes in foreign interest rates since these will all affect either the 
demand for or supply of money The quarterly estimates of the capital 
flow equation for Australia, Italy, Germany, and the Netherlands 
appear to be consistent with the hypotheses denved from the model 


This paper derives a model of international capital flows from a portfolio 
equilibrium model of an open economy under fixed exchange rates Our 
approach is a synthesis of the stock equilibrium approach to capital flows 
developed by Branson (1968)* and the monetary approach to balance- 
of-payments theory as developed for example in Mundell (1968) and 


The views expressed in this paper are those of the authors and not necessarily those of 
the International Monetary Fund or the Reserve Bank of Australia Earlier drafts of this 
paper were presented at the winter meetings of the Econometric Society, Toronto, in 
December 1972, the meeting of the Project Link in Vienna in August 1972, and at a 
seminar at the Board of Governors, Federal Reserve System We have benefited from 
comments and suggestions from participants of these meetings, from our colleagues at 
the International Moneuuy Fund, and from a referee Mr Fernando Santos has provided 
valuable computational assistance 

‘ This approach has been followed in numerous studies, for example, Lee (1969), 
Bryant and Hendeishott (1970), Miller and Whitman (1970), Branson and Hill (1971), 
and Hodjera (1971) 


443 




446 


JOURNAL OF POLITICAL ECONOMY 


The 

Interest 

^te 



The Demand for and the Supply of Bonds 
Fio 1 —The demand for foreign bonds when capital is perfectly mobile 

interest rate, /?*, the domestic demand for bonds is OB The domestic 
net supply of bonds is OA The net stock of foreign bonds is equal to the 
diflference between the total demand and the domestic supply (AB) 
In this framework, capital flows originate either from shifts in the BB 
schedule or from shifts in the net domestic supply (for instance, brought 
about by open-market operations) or from changes in the world interest 
rate R* While this extreme case may not be empirically relevant yet 
(except when explaining capital flows within regions of a country), it 
would in our view be desirable to have a model that would contain this 
as a special case 

A third problem with the Branson model is the fact that while it rests 
on solid microeconomic foundations it cannot easily be used to analyze 
many macroeconomic problems of importance, such as the effect of income 
fluctuations on the capital account, the link between the current account 
and capital flows, and the effect of changes in investment and saving 
behavior on the capital account In addition, the model treats the domestic 
interest rate as the instrument of monetary policy, thus implicitly 
assuming complete sterilization of the liquidity effects of capital flows 
We shall return to the question of sterilization later It suffices to point out 
here that it would be useful to be able to say something about the 



INTERNATIONAL CAPITAL FLOWS 


447 


magnitude of, for instance, open-market operations needed to offset the 
liquidity effects of capital flows resulting from “independent” monetary 
policy. 

A final comment relates to the monetary approach to the balance of 
payments The monetary theory of the balance of payments as developed, 
for instance, in Johnson (1971), explains the overall balance of payments 
as a function of excess demand for money. This approach is in contrast to 
much of the existing literature on capital flows which assumes implicitly 
that the demand for and supply of money determine the interest rate 
which then has an indirect effect on the capital account Similarly, while 
the monetary model implies a direct positive association between changes 
in income and the balance of payments, most of the capital flow models 
allow only for an indirect effect through the interest rate 

In the next section we develop a simple model of capital movements 
that attempts to deal with the four problems discussed above 

B A Portfolio Equilibrium Model 

Our model is concerned with the mechanism through which equihbrium is 
obtained in the financial markets of a small open economy * It is assumed 
that changes in income, prices, and the stock of wealth are exogenously 
given, along with the associated current account of the balance of pay¬ 
ments Thus monetary factors are assumed not to influence real variables 
in the current period Rather, it is assumed that changes in real variables 
and changes m the domestic component of the monetary base cause 
portfolio substitutions which lead both to capital flows and changes in 
the domestic interest rate By so constraining adjustment in the goods 
markets it is possible to derive capital-flow equations which facilitate 
relatively straightforward estimation and interpretation The model 
would appear reasonable when applied to quarterly data, given the 
abundance of evidence that monetary variables affect the real economy 
with a substantial and highly variable lag 

For simplicity, it is assumed that there arc only three kinds of financial 
assets in the economy base money, domestic bonds, and foreign bonds 
It is further assumed that expectations regarding both the price level and 
the exchange rate are stationary ’ The allocation of a given stock of 
financial wealth between these alternatives depends on the stock of 
wealth, income, the domestic interest rate, the foreign interest rate, and 


* For a full exposition of a general equilibrium approach to monetary theory in a closed 
economy, see James Tobin (1969) A stock equilibrium approach to open monetary 
systems may be found in McKinnon and Oates (I%6), Floyd (1969), and McKinnon 
(1969) 

’ In the empirical analysis we include dummy variables to capture the effect of 
speculation 




The Demand for and Che Supply of Bonds 
Fio 1 —The demand for foreign bonds when capital is perfectly mobile 

interest rate, R*, the domestic demand for bonds is OB The domestic 
net supply of bonds is OA The net slock of foreign bonds is equal to the 
difference between the total demand and the domestic supply (AB) 
In this framework, capital flows originate either from shifts in the BB 
schedule or from shifts in the net domestic supply (for instance, brought 
about by open-market operations) or from changes in the world interest 
rate R* While this extreme case may not be empirically relevant yet 
(except when explaining capital flows within regions of a country), it 
would in our view be desirable to have a model that would contain this 
as a special case 

A third problem with the Branson model is the fact that while it rests 
on solid microeconomic foundations it cannot easily be used to analyze 
many macroeconomic problems of importance, such as the effect of income 
fluctuations on the capital account, the link between the current account 
and capital flows, and the effect of changes in investment and saving 
behavior on the capital account In addition, the model treats the domestic 
interest rate as the instrument of monetary policy, thus implicitly 
assuming complete sterilization of the liquidity effects of capital flows 
We shall return to the question of stenlization later It suffices to point out 
here that it would be useful to be able to say something about the 



international capital flows 


447 


magnitude of, for instance, open-market operations needed to offset the 
liqmdity effects of capital flows resulting from “independent” monetary 
pobey. 

A final comment relates to the monetary approach to the balance of 
payments The monetary theory of the balance of payments as developed, 
for instance, in Johnson (1971), explains the overall balance of payments 
as a function of excess demand for money This approach is m contrast to 
much of the existing literature on capital flows which assumes impliatly 
that the demand for and supply of money deterrmne the interest rate 
which then has an indirect effect on the capital account Similarly, while 
the monetary model implies a direct piositive association between changes 
in income and the balance of payments, most of the capital flow models 
allow only for an indirect effect through the interest rate 

In the next section we develop a simple model of capital movements 
that attempts to deal with the four problems discussed above 

B A Portfolio Equilibrium Model 

Our model is concerned with the mechanism through which equilibrium is 
obtained in the financial markets of a small open economy * It is assumed 
that changes in income, prices, and the stock of wealth are exogenously 
given, along with the associated current account of the balance of pay¬ 
ments Thus monetary factors are assumed not to influence real vanables 
in the current period Rather, it is assumed that changes in real variables 
and changes in the domestic component of the monetary base cause 
portfolio substitutions which lead both to capital flows and changes in 
the domestic interest rate By so constraining adjustment in the goods 
markets it is possible to derive capital-flow equations which facilitate 
relatively straightforward estimation and interpretation The model 
would appear reasonable when applied to quarterly data, given the 
abundance of evidence that monetary variables affect the real economy 
with a substantial and highly variable lag 

For simplicity, it is assumed that there are only three kinds of financial 
assets in the economy base money, domestic bonds, and foreign bonds 
It IS further assumed that expectations regarding both the pnee level and 
the exchange rate are stationary ’ The allocation of a given stock of 
financial wealth between these alternatives depends on the stock of 
wealth, income, the domestic interest rate, the foreign interest rate, and 

^ For a full exposition of a general equilibrium approach to monetary theory in a closed 
economy, see James Tobin (1969) A stock equilibrium approach to open monetary 
systems may be found in McKinnon and Oates (1966), Floyd (1%9), and McKinnon 
(1969) 

’ In the empincal analysis we include dummy variables to capture the effect of 
speculation 



JOVKNAt OP POUnOAL ECONOMY 

Other exogenous factors.® The baste model contains equilibrium con- 
dttions for the demand for and supply of money, domestic bonds, and 
foreign bonds The wealth constraint implies that only two equilibrium 
conditions arc independent, since if excess demand for base money and 
domestic bonds is zero it follows that excess demand for foreign bonds is 
also zero These equilibrium conditions then determine the equilibrium 
stock of money, the domestic interest rate, and the stock of foreign bonds 
The basic model is specified table 1 

The main feature of the model is that the income, wealth, and the 
expccted-yield variables enter all demand functions for reasons set out in 
Tobin (1969). The partial denvatives of the respective arguments in the 
demand functions need little explanation However, it should be noted 
that a rise in income will, ceimspanbus, reduce the total demand for bonds 
since It increases the demand for money Equations (4), (5), and (6) are 
related via the wealth constraint (12) The money supply is the sum of 
the domestic and foreign assets of the central bank The monetary policy 
instrument, NDA, is assumed to change only as the result of open-market 
operauons In empmeal estimation, further refinements are necessary to 
allow for such factors as changes in reserve requirements. 

The foreign component of the money supply changes cither through 
capital flows or through the current account Capital flows are identically 
equal to the change in domestic holdings of foreign bonds minus the 
change in foreign holdings of domestic bonds Domestic bond market 
equilibrium is obtained when the net domestic stock of bonds issued by 
the government is equal to the total demand for domestic bonds, both 
from residents and foreigners ’ Finally, a simplification we have adopted 
IS to express cquihbnum conditions for base money rather than the total 
money supply, this being equivalent to assuming a money multiplier of 
one The essential features of the model are not substantially changed by 
incorporating a more complete banking system, however, we have re¬ 
frained from such extensions m order to keep the model manageable 

The domestic economy is assumed to be small, such that the foreign 
supply of foreign bonds is infinitely elastic at the foreign interest rate, R* 

* The stock of wealth is not determined within the model It changes from one penod 
to another as a result of saving, which is exogenous to the model If, however, we allowed 
for changes in the valuation of existing assets, the value of the stock of wealth would 
become endogenous 

’’We abstract from the effect of budget deficits or surpluses on the supply of bonds We 
sdso ignore equity claims on real capital as well as holdings of physical commodities 
Furthermore, our concept of wealth is a narrow one in that it only refers to the market 
value of financial assets rather than the capitalized value of expected future income We 
also abstract from the problem of term structure of interest rates between different matu¬ 
rity classes of securities These abstractioiu should not affect the argument since our 
primary interest is in portfolio substitution between domestic and foreign securities and 
money. 



bS 

i 


n 

I 



4 


449 


450 


JOURNAL OF POLITICAL ECONOMY 


However, the specific risk attached to domestic securities implies that the 
foreign demand curve for domestic bonds is upward sloping 

Assuming that equilibrium is obtained within the period, the model 
can be solved for net capital flows and the change on the domestic interest 
rate as a function of the exogenous variables By invoking the wealth 
constraint (12), equation (5) and also equations (9) and (14) may be 
omitted, and hence the system can be reduced to the following two 
equations 

AZ,(7, W,R,R*,E) = LNDA + CAB + TC, (15) 

TC = Af’(y*, W*, R, R*, E) - Ay(F, W, R, R*, E) (16) 
Invoking the wealth constraint, that is, 

Lj/f + //^ + = 1, (17) 

and 

+ ^1 + yi = 0, 1 = 7. R, R*, 

and solving, we obtain the reduced-form equations for TC and AT? 

A/? = - - — I (Hg, + Fr.) A/e* + //y A7 + Hfy AW 

L 

(18) 

-t- CAB -f- ANDA -f Fy. A7* + AW* 

^R + ^R i 

+ [(7r - - I^rJy] AF + (/^R - J^) ANDA (19) 

+ {^R “■ + [(y* — Fg)L^, — JwFg] AW 

+ L^Fy. AY* + Ly^Fyy. AI7*| 

In the limiting case of perfect capita! market integration, we assume that 
//r and Jg^, approach plus infinity while Hg„ and Jg approach minus 
infimty In other words, a small change in domestic or foreign interest rates 
generates infinite substitution of domestic and foreign bonds * In this 
case equations (18) and (19) arc greatly simplified For example, the 
expression {Jg - Fg)l[-{Hg -1- Fg)] -* I as Jg^ - oo, since {Jg - Fg)l 
[-{Hg + Fr)] = Ur - Fr)1{Lr + Jr - Fg) = [1 - {FglJg)]l 
[{LrIJr) + 1 - {FglJg)] 

' Stnctly speaking we should note that there is a discontinuity in the partial derivatives 
As long as the initial portfolio contains both domestic and foreign bonds, the partial 
derivatives Hg, Hg,, Jg, and Jg. are infinite in the limiting case However, the total effect 
of an interest rate change on the holdings of, for instance, foreign bonds must be finite 
because of the wealth constraint even in the limiting case 



international capital flows 


451 


The expression on the nght-hand side of the above equation approaches 
one for finite values of and as Jn -* ~ 00 Taking the limiting 
values of the other coefficients gives us the following results under con¬ 
ditions of perfect capital market integration 

AR = AR* , (20) 

+ Fr) 

TC = -ANDA - CAB 

+ + LyAY + L^AIV (21) 

These results, in turn, may be further simplified if it is assumed that 
domestic and foreign securities offer equivalent risk, such that (Jh^IJk) 
and approach minus one ’ In this case equation (20) becomes 

AR = AR* and the coefficient on AR* in (18) is L^, -f 


C Interpretation of the Capital-Flow Equation 

The reduced form capital-flow equation (19) and its limiting case (21) 
contain as arguments the monetary policy instrument, ANDA, the current 
account balance, CAB, the change in the foreign interest rate, AR*, the 
change in domestic income and wealth, AY and AW, and the change in 
foreign income and wealth, AY* and AW* The extent to which monetary 
policy is offset by capital flows is estimated by the coefficients on AND A 
and CAB, and the coefficients should be similar The limiting case of 
perfect mobility implies a coefficient close to minus one It is worth noting 
that in this limiting case the Branson model provides estimates of the 
interest elasticity of capital flow which are biased toward zero 

Fluctuations in money demand caused by income are a source of 
capital flows, in the limiting case the income coefficient is Ly, the inverse 
of the marginal income velocity of base money Foreign interest rate 
changes need not directly be a major source of fluctuations on the capital 
account as long as the domestic interest rate is allowed to adjust In the 
limiting case the coefficient on the foreign interest rate measures the 
interest elasticity of the demand for money Indirectly, changes in the 
foreign interest rate can have a much bigger effect on the capital account 
through the effect on income and the current account 

’ See “International Interest Kate RaUoiu as Behavior towards Foreign Exchange 
Risk and Exchange Race Expectations,” appendix to Porter (1971) In the remainder of 
this paper we assume that as and Jg. approach plus inHmty and and approach 
minus infinity, that HgjHn, and JhIJrw both approach minus one This amounts to 
assuming that securities offer equivalent risk in this limiting case 



JOURNAL OF POLmaAL ECONOMY 


45a 

D. Exchange-Rate Expectations 

It IS clear that from time to umc there are substantial capital flows 
arising from expectations of movements in the exchange rate—and this 
aspect of behavior has been neglected in our model A difficulty with 
developing an adequate model of exchange-rate expectations within our 
framework is that while an adaptive expectations model may explain 
expiectauons within the band, such a model will be of little relevance 
when the spot rate is at the ceiling or the floor, and there is an expectation 
of a shift in the parity Given this discontinuity in behavior, the most 
reasonable approach for our purposes would appear to be to use dummy 
variables for these periods when there were definite expectations of panty 
changes (e g , at the time of the 1969 revaluation of the Deutschmark) 


n. Empirical Analysis 

A The Estimating Equation 

The ideal approach would be to simultaneously estimate the complete 
set of structural equations developed m the previous section. With the 
use of the a prion constraints all of the structural parameters could be 
identified This approach contains, however, several difficulties, a major 
problem being the unavailability of reliable data on the various stocks of 
assets and interest rates needed for simultaneous estimauon. For instance, 
stock data on foreign assets and liabilities do not exist for most countries 
A second problem is the need for considerable institutional detail to 
adapt the theoretical framework to the peculiarities of any given country 
Third, there is the problem referred to previously that, as assets become 
perfect substitutes, individual structural equations cease to exist as well- 
defined functions Thus when domestic and foreign bonds become perfect 
substitutes there is no stable demand function for foreign bonds—the 
demand for foreign bonds becomes equal to the excess of aggregate 
demand for bonds over domestic supply of bonds As we show in the 
preceding sections, the reduced-form equations can be readily defined and 
interpreted even in this limiting case 

The above considerations suggest the alternative approach of estimating 
the reduced-form equauon of the previous section directly This approach 
provides the critical information on most of the questions that we are 
primarily interested in, such as the effect of income changes and monetary 
policy on the capital account and the relationship between capital flows 
and current account deficits or surpluses 

Thus we estimate the following capital-flow equation. 

TC = oq -I- a, AF + a^ Ail* + a^ ^NDA + CAB 
+ Oj AE + flj AW + ay AF* -f a^ AW* + u 



international capital flows 


453 

The variables have the same meaning as in the previous sections Most of 
these variables are readily available in statistical form, the exceptions 
being the wealth and foreign income data, AM^, AF*, and AfF* There 
are no data available on tV or IV* It is also difficult to construct a 
quarterly series on world income After some expcnmentation we decided 
to drop these three vanaljles Since A1V, and AF*, and A IF* affect 
capital flows in opposite directions, and since we include the constant 
term, it seems reasonable to assume that the elimination of these 
variables will not greatly affect the results Furthermore, we know 
from the model that when the degree of capital mobility is high the co¬ 
efficients a-j and <ig tend to be very small approaching zero in the limiting 
case Regarding the other coefficients we know that according to our model 
fl, > 0, ^2 < “ 1 ^ ^3 ^ 0) — 1 114 ^ 0 This provides a basis 

for a one-tail test of the significance of these coefficients Furthermore 
we would expect that ^3 = 124 , and we also know that both of them 
approach minus one as capital becomes perfectly mobile The A£ vanable 
measures changes in exchange-rate expectations As we discussed 
previously, we replace AE by dummy vanables whenever appropriate 
As has been noted above, all those components of the capital account 
that can be regarded as autonomous should be added to the current 
account Thus whenever the data are available we add official capital 
flows to the right-hand side In the case of Germany we also estimate the 
model on the assumption that long-term capital flows are exogenous 

B Sterilization Policies and Possible Bias 

The capital-flow equations estimated below treat the change in the net 
domestic assets of the central bank {ANDA) as exogenous However, it is 
possible that the authorities try to adjust ANDA at times so as to offset the 
liquidity effect of payments deficits or surpluses Indeed Michacly (1971) 
provides evidence that partial sterilization was the normal policy followed 
by a number of industrial countries m the pienod 1950-66 It is also 
possible that the monetary authorities assign monetary policy to the 
balance-of-payments target In this case the Central Bank would increase 
rather than decrease ANDA in response to an inflow of capit 2 d These 
practices give rise to a possible bias when we estimate the above equation 
by OLS It can be shown that m the case of sterilization the offset co¬ 
efficient IS biased toward minus one, while in the other case it is biased 
toward zero This can be shown in the context of the following simple 
model TC = a = b ANDA + cX^ + u, ANDA = a -I- pTC -f yATj -I- v, 
vhere A’j is an exogenous influence on TC, some otlier target of 
monetary pohey, and u and v are the error terms A negauve coefficient 
on TC m the second equation implies sterilization For example, if 
P = — 1, then there is complete sterilization With the usual assumptions 



454 


JOURNAL OF POLITICAL ECONOMY 


regarding the error terms we can derive the asymptotic bias of the OLS 
estimate of the offset coefficient (i) It can be shown that the bias on b 
is given by the formula*® plim {b — b) = [JS(1 — + 

(Tp + P^al)y where = the variance of u, crl = the variance of v, and 
= the asymptotic variance of A'j This formula shows first of all 
that the bias is negative when P is negative (as long as bp is less than one, 
which It must be for stability) Thus b is biased toward minus one. The 
critical question is the magnitude of this bias, and that depends on the 
ratio of the variance of the error term in the capital-flow equation, {(tI), 
to the variance of X 2 , and the variance of the error term in the sterilization 
equation, (a^) This means that so long as the capital-flow equation is 
well specified and so long as discretionary changes in monetary policy 
as opposed to automatic «terilization dominate AND A the bias is likely to 
be small It is important therefore to include dummies to capture large 
speculative capital movements, such as occurred in Germany Otherwise 
the sterilization which took place and was reflected in ANDA would 
seriously bias the offset coefficient toward minus one '* 

A related question to that discussed above is the fact that the Central 
Bank may not directly control certain components of NDA, such as the 
level of commercial bank borrowing from the Central Bank There is little 
that can be done about this short of a comprehensive model of the mon¬ 
etary sector It is not unreasonable to assume that the Central Bank can 
control NDA if it so decides, but whether it has always done so is quite a 
different question Wc shall maintain the hypothesis that ANDA can be 
taken as an approximate measure of monetary policy 


III. The Results for Germany, Australia, Italy, and the 
Netherlands 

A A Brief Summary 

The estimates of the capital-flow equation for Germany, Australia, Italy, 
and the Netherlands are reported in tables 2 and 3 and the behavior of 
the actual and fitted values is illustrated in figures 2-5 Description of the 
variables and the data sources are provided in Appendix II The starting 
point of the estimation period has been largely determined by data 


“* For an analysis ofthesimultaneous-equations bias, see for instance, Johnston (1972), 
chap 12 

“ A recent paper of Argy and Kouri (1972) attempts to isolate the offsetting and 
sterilization effects by a two-stage procedure Their results for Germany, Italy, and the 
Netherlands indicate partial sterilization and oflsetting of similar magnitude to what we 
obtain here The problem with econometric analysis of Central Bank behavior is that in 
most cases discretionary changes are likely to doimnate automatic and regular response 
patterns As argued above, this tends to reduce the bias when the policy measures are 
treated as exogenous 



a 



455 

« 





international capital plows 


457 


pfllln— of DoundM Morki 



Fio 2 —Germany total private capital flows, quarterly 1960 1-1970 + (source 
table 2, eq [2]) 

availability The terminal poirlt was dictated by the flotation of exchange 
rates in early 1971, except in the case of Australia 
A striking feature of the results is that the change in income is found to 
be sigmficant in all four cases, suggesting that quarterly changes m 
income create changes m money demand which are at least partially 
satisfied by capital flows The offset coeffiaent, which measures the 




458 JOURNAL OF POUTTOAL ECONOMY 



Fig 3 —Australia total private capital flows, quarterly 1961 3-1972 3 (source 
tables, eq [I]) 


extent to which capital flows offset policy-induced changes in the monetary 
base, is statistically significant in all cases The point estimates are —0 77 
for Germany, —0 59 for the Netherlands, —0 47 for Australia, and 
— 0 43 for Italy All of these estimates are statistically different from 
minus one, which suggests that sterilization is possible in these countries, 
at least m the short run, if the Central Bank is willing to accumulate (or 
lose) reserves Similarly the offset coefficients on the current account are 





international capital plows 


459 


WUICHII of Lilt 



Fio 4—Italy total private capital flows, quarterly 1964 1-1970 4 (source table 3, 
eq [2]) 


all statistically significant and very high, —0 99 for Germany, —0 98 for 
Italy, —0 98 for the Netherlands, and —0 63 for Austraha In the German 
and Austrahan case the two offset coefficients are not significantly 
different, while for Italy and the Netherlands the offset coefficient on the 
current account is substantially higher We have no good explanation 
available for these differences 

A disappointing feature of the results is the fact that the change in the 




JOURNAL OF POLITICAL ECONOUV 


460 


Minaaaf(.iia<lm 



IVW) l%l 1%’ l'<M )%' l%(> l%7 l%8 l%9 1970 


Fio 5 —Netherlands total capital flows, quarterly 1960 1-1970 4 (source table 3, 

«1 [3]) 

Euro-dollar rate is statistically insignificant in all cases Although it should 
be remembered that the coefficient on the foreign interest rate does not 
measure the interest sensiUvity of capital flows (which is measured by the 
offset coefficient), one would sull expect a nonzero coefficient on the 
£uro-dollar rate as long as the demand for money is interest sensitive. 
There are two possible explanaoons for this result One is that the Euro- 



international capital flows 


461 

dollar rate may not m all cases be the relevant foreign interest rate The 
other and probably the more important reason is the fact that we have 
not satisfactorily incorporated the forward market and exchange-rate 
expectadons into our model. Since the forward premium, in pardcular 
in the case of Germany, has fluctuated considerably and oflen in a 
direcdon opposite to the Euro-dollar rate, it is possible that its exclusion 
tends to suppress the interest rate coefficient. Speculation dummies 
turned out to be sigmficant in the case of Germany and Austraha while 
no dummies introduced on a priori grounds were stadstically sigmficant 
in the case of Italy and the Netherlands. 

The overall explanatory power of the equations is qmte satisfactory 
The Durbin-Watson statisdc indicates no senal correlation in the case of 
Germany and Australia while it falls in the inconclusive region in the two 
other cases. A short discussion of the individual country results is provided 
in the next four sections. 


B. Germany 

Two separate policy instruments were incorporated into the equation 
the policy-induced change in average required reserves (A/W?) and the 
change in the Bundesbank’s net stock of domesuc assets {/^NDA) The 
difference of these two corresponds to the domesuc component of the 
extended monetary base The other vanablcs arc the change in income, 
the change in the 3-month Euro-dollar rate, the current account balance 
plus official capital flows, and a number of dummies designed to capture 
the effect of speculauve capital flows In addition, we include a dummy 
which takes care of the pronounced end-of-year capital inflow, and a 
subsequent outflow 

As IS shown in table 2, the offset coefficients on the two monetary 
instruments are both very high, 0 74 for ANDA and 0 86 for ARR ‘ * 
As expected these coefficients arc not significantly different from each 
other When we combine them we obtain an offset coefficient of 0 77 on 
the composite monetary instrument {ANDA — ARR) with a standard 
error of 0 04 While the coefficient is high, it is significantly different 
from one (for brevity we do not always mention the sign, it should be 
remembered that all the offset coeffiaents are negative) This means that 
the Bundesbank can exercise some degree of monetary independence 
in the short run if it is willing to accept the reserve movements which that 
policy entails This point is illustrated by the following example Suppose 
that the Bundesbank wants to decrease the monetary base by 100 million 
marks This would require a reduction of 435 million marks in NDA, 

‘ ’ These results are similar to those obtained earlier by Porter (1972) by using monthly 
data 



462 JOURNAL OF POLmCAL EOONOMY 

and It would imply an increase in foreign exchange reserves of 335 million 
marks 

The offset coefficient on the current account is also sigmficant and 
close to one It is not, however, statistically different from the offset 
coefficient on {ti.NDA — AAR) The high value of the coefficient on CAB 
suggests that private capital flows have to a large extent financed fluc¬ 
tuations on the current account Thus the behavior of the overall balance 
of payments is dominated by the factors that influence capital flows. 
Monetary policy is one of these factors Income fluctuations are another 
The income coefficient indicates that an increase in income at an annual 
rate of 1 billion marks will generate capital inflows of some 110 million 
marks The income variable explains partly the clear procyclical pattern 
of the overall balance of payments The other reason for this is the counter¬ 
cyclical behavior of monetary pohcy as reflected in the movements of 
^NDA 

The Euro-dollar rate is not significant in any of the equations We tend 
to think that this is the result of the inadequate treatment of exchange- 
rate expectations In future research it would be important to incorporate 
the forward market into the model to tackle exchange-rate uncertainty 
more directly than has been possible in this study 

Three dummies introduced on basis of a prion knowledge of speculative 
episodes turned out to be significant The first dummy captures the 
speculative inflow and outflow surrounding the March 1961 revaluation 
of the Deutschmark The second dummy takes care of the speculative 
inflow in the fourth quarter of 1968, which was due to an expectation of 
a revaluation of the Deutschmark The third dummy captures the inflow 
and outflow surrounding the second revaluation of the Deutschmark in 
October 1969 

The magnitude of the last dummy appears rather small compared 
with the magnitude of capital movements in the last two quarters of 1969 
One reason for this may be that the speculative inflow extended over a 
longer time period than the outflow just after the revaluation The other 
reason is the fact that if the Central Bank does not attempt to sterilize 
the liquidity effect of inflow (by changing ANDA) the inflow will suppress 
domestic interest rates, which will have an opposite effect on the net 
magmtude of capital flows The coefficient on the speculation dummy 
measures the combined effect of these two factors 

Finally, equations (6) and (7) report the results that we obtain by 
treating long-term capital flows as an exogenous vanable The results are 
remarkably similar, indicating that in the shortrun the adjustment takes 
place largely through short-term flows An examination of the data also 
reveals that the variation on the long-term capital account is substantially 
less than that on the short-term account. 



international capital flows 
C. Australia 


463 


Australia has cxpenenced sustained private capital inflow over the period 
studied, 1961-72, and there have been substantial variations in the 
comfiosition and magnitude of these flows Our model does reasonably 
well in explaimng these movements The offset coefficient on ^NDA, 
which has been adjusted for changes m required reserves (statutory 
reserve deposits), is 0 47, while that on the current account is 0 68 They 
are not statistically different, and both are substantially less than the 
coefficients we obtained for Germany The income variable is also 
significant, a nse in income of $1 billion Australian (SA) generating 
capital inflows of $57 million The Euro-dollar rate is again insignificant 
although It has the correct sig[n We include a dummy variable to allow 
for speculauve inflows in 1971 and 1972 The coefficient of $A216 million 
suggests that speculative inflows in this period amounted to a total of 
some $A1 5 billion (see table 3) 


D Italy 

Capital flows, typically outflows, from Italy over this pcnod have vaned 
considerably The model appears to explain these vanauons quite well, 
capturing all the major swings during the period studied All vanables 
except the Euro-dollar rate are significant. The offset coefficient on \NDA 
IS 0 43 with a standard error of 0 099 The offset coefficient on the current 
account, also significant, is much higher, being close to one This is in 
contradiction with what we would exjject from the model The mcome 
variable is also significant It 1 $ interesting to note that Branson and Hill 
(1971) found that the gap between potential and actual GNP is highly 
significant in explaimng Itahan capital flows in the framework of their 
model In addition, we experimented with a number of speculative 
dummies, but these turned out to be insignificant 


E The Netherlands 

The results for the Netherlands are similar to those obtained for Australia 
and Italy The offset coefficient is 0 58 with a standard error of 0 078 
The offset coefficient on the current account is again close to one and 
statistically greater than that on btNDA In terms of the model the sharp 
increase in net capital inflow in the last quarter of 1969 and in 1970 can 
be explained by the rapid increase in income, large deficits on the current 
account, the decline in the domestic component of the monetary base, and 
a fall in the Euro-dollar rate in 1970. 



464 

IV. Conclusions 


JOURNAL OF POLinOAL ECONOMY 


A most interesting aspect of our results is that changes in income are 
highly significant in explaining capital flows in all cases This provides 
confirmation of the view that the behavior of capital flows reflects di¬ 
vergent growth rates and cychcal phases between countries—with capital 
flows accommodating the resultant fluctuations in money demand 
Accordingly, while it may appear that capital movements largely respond 
to changes in interest differentials between countries, a closer analysis 
reveals that the more basic sources of capital flows are fluctuations in 
money demand and supply, with income fluctuations being important in 
explaimng shifts in money demand. 

Our results demonstrate that for all four countries capital flows are to a 
large part the result of changes in monetary policy, as reflected in changes 
in the net domestic assets of the Central Bank and the required reserves 
The current-account balance enters our capital-flow equations as an 
exogenous source of change of base money We find that the current- 
account balance tends to induce offsetting capital flows, thereby stabilizing 
the balance of payments 

This means that the behavior of the overall balance of payments is 
dominated by factors that influence capital flows, that is, changes in 
income (with a positive sign), changes in the world interest rate, changes 
m exchange-rate expectations, and changes in monetary policy instru¬ 
ments 

Our results also indicate that while monetary policy has a strong effect 
on the capital account in all the four countries there is still some room for 
independent monetary pohey, if only in the shortrun, if the Central Bank 
IS wilhng to tolerate the large reserve movements which that pohey 
entails. 


Appendix I 

The Bias in the Branson Modd 

Consider a simplified version of the Branson model; AB'' = + aj \R + 

+ u, where &Jt is the change in the domestic interest rate, Zj is an exogenous 
variable explaimng capital flows, and u is the error term Suppose that capital 
flows affect interest rates, that is, AS = + ^1 + ^ 1^2 + *’> where Zj is 

an exogenous variable explaimng the changes in A/i, and v is the error term. We 
make the usual assumptions regarding the error terms. It can be shown that the 
asymptotic bias of the OLS estimate of o, is given by plim (dj — Oj) => 
[ii(l - + <rl + bW) < 0, where = asymptotic vari¬ 

ance of Zj, assumed to be finite, = variance of u, and al « variance of ti. 
It IS assumed that the asymptotic covanances between ^i> ^2> “• and V are zero. 

See Johnston 1972, chap 12. A detailed analysis of sources of least-squares bias is 
given in Bronfenbrenner (1953) 



INTERNATIONAL CAPITAL FLOWS 


465 

To obtain some definite conclusioni r^arding the magnitude of the bias it 11 
useful to note that as the degree of capital mobility, as measured by approaches 
infinity, the absolute bias approaches infinity What this means is that the higher 
the degree of capital market integration the less appropriate is the OLS estimate 
of interest elasticity of capital flows, since the estimated coefficient becomes 
severely downward biased Furthermore, the OLS estimate of Oj biased down¬ 
ward. Its asymptotic bias is given by: phm (ifj - Sj) “ -I- 

-H < 0 . This bias does not depend on Oj, hence for large values of Sj 
the bias on dj is larger than that on 

Appendix 11 

Symbols and Sources of Data Used in Regressions 

TC B total private capital flows (includmg official capital flows for the Nether¬ 
lands), from IMF data fund, 

STC = total short-term private capital flows (mcluding official capital flows 
for the Netherlands), from IMF data fund, 

CAB = current account balance (includmg official capital flows for Germany, 
Australia, and Italy), from IMF data fund, 

B* = 3-month Euro-dollar rate, from the Intemattonal Ftnanaal Stahstics (IFS ), 

Y, Germany = mass income at an annual rate, from the Bundesbank’s monthly 
reports (BMR), 

Y, Australia = national income, from IFS, 

Y, Italy = GDP at market pnces, generated from leading indicators reported m 
the OECD Main Economic Indicators,^* 

Y, the Netherlands = GDP at market prices, generated from Icadmg indicators 
reported m the OECD Mam Economic Indicators, 

NDA, Germany, Italy, and the Netherlands net domestic assets of the Central 

Bank, from the IFS, 

NDA, Australia => net domestic assets of the Central Bank less restricted deposits, 
from the IFS, 

AAR, Germany = change m required reserves. Computed according to the 
formula ERR = (r — r_,)Z._i, where r is the average re¬ 
quired reserve ratio and L is the total stock of bank liabilities 
subject to reserve requirements, from BMR 

Since there are no quarterly data on income for Italy and the Netherlands, a 
quarterly GDP series was generated from some leading indicators available on qusuterly 
basis For Italy the value of retail sales and the value of industrial production (mdustnal 
production index times the consumer price index) were used as indicators The regression 
of annual GDP on the annual averages of these two variables (JC and Y, respectively) for 
the period 1960-70 yielded 

GDP - -5208 0 346 lA -(- 0 712r 

(0 53) (9 58) (1.56) 

R* = 999 SE - 307 0 D-W - 2 063 

For the Netherlands the qusu'terly series on the value of mvestment was in addiUon used 
as an indicator The annual relationship between GDP at market prices and these 
indicators for the period 1960-70 is 

GDP = 2046 0 + 994^X + 166.9 F -I- 237.3Z 
(0 94) (2 62) (2.33) (3 87) 

- 998 SE - 743 0 D-W - 2.606 
The data on X, Y, smd Z were obtained from the OECD Mam Economic Indtcators 



466 

References 


JOURNAL OF POLITICAL ECONOMY 


Argy, V., and Kouri, P. J. K “Sterilization Policies and Volatility in Inter¬ 
national Reserves.” Paper presented at the Wingspread Conference on 
“National Monetary Policies and the International Monetary System,” 
Racine, Wise , July 1972 

Branson, W H Financial Capital Flows m the US Balance of Payments. Amsterdam 
North-Holland, 1968 

Branson, W H , and Hill, R D “Capital Movements in the OECD Area ” 
OECD Economic Outlook, Occasional Studies (December 1971) 

Bronfenbrenner, J “Sources and Size of Least-Squares Bias in a Two-Equation 
Model ” In Studies in Econometric Method, edited by W C Hood and T C 
Koopmans Cowles Foundation Monographs, no. 14 New York Wiley, 1953 
Bryant, R C , and Hendershott, P H Financial Capital Flow in the Balance of 
Payments of the United States An Explanatory Empirical Study Princeton Studies in 
International Finance, no 25 Princeton, N J Princeton Univ Internal 
Finance Sec., 1970 

Floyd, J E “International Capital Movements and Monetary Equilibrium ” 
AER 59, no 4, pt 1 (September 1969) 472-92 
Hodjera, Z “Short-Term Capital Movements on the United Kingdom, 1963- 
67” J PE 79 (July/August 1971) 73&-75 

- “International Short-Term Capital Movements A Survey of Theory and 

Empirical Analysis ” IMF Staff Papers (forthcoming) 

International Financial Statistics Various issues Washington Internal Monetary 
Fund 

Johnson, H G “The Monetary Approach to Balance of Payments Theory ” 
Unpublished monograph, 1971 

- Inflation and the Monetarist Controvert Amsterdam North-Holland, 1972 

Johnston, J Econometru Methods New York McGraw-Hill, 1972 
Learner, E R , and Stern, R M “Problems in the Theory and Empirical 
Estimation of International Capital Movements ” In International Mobility and 
Movement of Capital, edited by Fritz Machlup, Walter S Salant, and Lone 
Tarshis New York Nat Bur Econ Res , 1972 
Lee, C H "A Stock Adjustment Analysis of Capital Movements The United 
States-Canada Case” J PE 77 (July/August 1969) 512-23 
McKinnon, R I “Portfolio Balance and International Payments Adjustment ” 
In Monetary Problems of the International Economy, edited by R. A Mundell and 
A Swoboda Chicago' Univ Chicago Press, 1969 
McKinnon, R I, and Oates, W E The Implications of International Economic 
Integration for Monetary, Fiscal, and Exchange Rate Policy Princeton Studies in 
International Fmance, no 16, Prmccton, N J Princeton Univ intemat 
Finance Sec, 1966 

Michaely, M The Responsiveness of Demand Policies to Balance of Payments Postwar 
Patterns New York Nat. Bur Econ Res, 1971. 

Miller, N C , and Whitman, M “A Mean-Variance Analysis of Umted States 
Long-Term, Portfolio Foreign Investment.” Q J.E 84 (May 1970) 173-96 
Mundell, R A International Economics New York Macmillan, 1968. 

OECD Mam Economic Indicators. Various issues Washmgton: Org. Econ. Cooper¬ 
ation and Development. 

Polak, J. J. “Monetary Analysis of Income Formation and Payments Problems ” 
IMF Staff Papers 6 (March 1958) 1-50 

Polak, J J , and Argy, V. “Credit Policy and the Balance of Payments ” IMF 
Suff Papers 18 (March 1971) 1-24. 



international capital flows 


467 

Porter, M G. "A Theoretical and Empirical Framework for Analyzmg the Term 
Structure of Exchange Rate Expectations ” IMF Stuff Papers 18 (November 
1971): 613-45. 

- “Capital Flows as an Offset to Monetary Policy The German Case ” 

IMFSUfff Papers {}\x\y \912) 395-424 
Tobin, J “A General Equilibrium Approach to Monetary Theory ” J Mon^, 
Credit attd Bankirtg 1 (February 1%9) 15-29 




Taxation and the Rates of Return from 
Some U.S. Asset Holdings Abroad, 
1960-1969 


Herbert G. Grubel 

Stmon Fraser Urmierstfy 


The paper shows theoretically that U S tax laws are designed to induce 
equalization of private rates of return from investment in the United 
States and abroad but that, unless the marginal government expenditures 
on investment equal tax payments by the investment, there is a dif¬ 
ference between private and social rates of return to U S. foreign 
investment Calculations covering the period 1960-69 for developed 
countries and U S holdings of bonds, equiues, and direct investment in 
manufacturing showed that private rates of return generally were 
positive but that social rates of return from investing abroad rather than 
domestically were strongly negative, assuming zero marginal government 
expenditures and disregarding all externalities 


In the past the basic philosophy underlying U S ttix laws and other 
regulations guiding international long-term capital flows have been to 
encourage the maximization of world income The policies flowing from 
this philosophy by and large enabled capital to move anywhere in the 
world where pnvate owners could earn highest returns ‘ 

An important question facing the U S government and pubhc in the 
1970s IS whether the internationalist philosopy that has guided ptist 
policies should be followed in the future This question is raised by de¬ 
mands for change from two important sources First, U S labor, which 
believes that foreign investment results in a loss of domesuc employment 

I received much assistance in the preparation of foreign rates of taxation from the 
International Tax Affairs Staff of the U S Treasury Department, headed by N Gorden 
Valuable comments on an earlier draft of this paper were made by G Kopitz, P Gray, 
H Shapiro, M Feinberg, G Haberler, and other participants at a seminar at the U S 
Teasury Department, by an unknown referee of this Journal, and by my colleague 
Edward Tower 

‘ For a review of the issues surrounding the internationalist policies see Krause smd 
Dam (1967) and Musgrave (1969) These books also provide references to the original 
contributions to the 6eld 


e 


469 



470 


JOURNAL OF POLITICAL ECONOMY 


opportunities and reduces the growth of wages, demands that the U S 
government “curb thesubstantial outflow of American capital” (Goldfinger 
1971, p 928) Second, some major foreign governments have decided that 
long-term capital inflows from the United States are undesirable because 
they lead to “takeover” of their industries^ and induce unemployment, 
causing deficits on current account, or result in an unwanted increase in 
official dollar holdings 

In a scholarly analysis prepared for the Joint Economic Committee of 
Congress, Musgrave (1972, pp 177-78) concluded “There are sufficient 
doubts about the effects of foreign investment on the U S economy to 
lead to the conclusion that the U S tax treatment of foreign investment 
income should be reviewed and reevaluated This applies especially to 
deferral, but consideration may also be given to limiting the present 
credit for foreign taxes to less than 100 per cent Such measures would 
not be incompatible with opposition to trade restrictions Indeed, they 
might be supportive of free trade policy ” 

I’he present paper has been stimulated by this controversy over the 
merits of international investment in general and the effects of taxation 
in particular It attempts to shed light on the issues by presenting some 
estimates of the after-tax rates of return yielded by some forms of U S 
asset holdings abioad during the period 1960-69 In a rational decision 
about future U S policies, presumably such a rate of return should carry 
important weight 

This paper represents a pioneering effort, and its primary aim is to 
sort out some theoretical issues and provide a rough first estimate No 
other studies of this sort seem to exist for the United States, though the 
broader issues of the efficiency aspects of international taxation agreements 
have been studied theoretically (Kindlebergcr 1968, Musgrave 1969) 
and estimates of the subsidies as dollar flows have been prepared 
(Musgrave 1972) The overall welfare effects of foreign investment have 
received a great deal of attention, and a ready set of references to these 
can be found in Reddaway (1968) and Dunning (1970) Jenkins (1972, 
chap 4) estimates Canada’s taxation gams from foreign-owned capital in 
Canada 

By concentrating the present study on the effects of taxation on the 
returns from only some forms of U S asset holdings, we neglect many 
considerations which shciuld be weighted in the final decision about the 
future of international lax agreements Thus, we disregard the effects of 
capital flows on the terms of trade, income distribution, externalities, the 
efficiency of employment policies, and the benefits from the diversification 

^ Significant political influence along these lines has long been felt in Canada and 
France Leftist factions in developing countries have a tradition of considering U S 
investments evidence of U S imperialism 



US ASSET HOLDINGS 471 

of assets ^ Furthermore, the study does not consider the returns from such 
U S assets as bank loans, investment in oil and mimng, and the tax and 
other benefits from foreign asset holdings in the United States. 

In Part I current regulations concerning foreign investments are 
discussed briefly Parts II and III contain estimates of returns from foreign 
bond and equity holdings and direct investment m manufacturing, 
respectively The paper closes with a brief summary and conclusions 


I, The Taxation of Foreign Income 

The major feature of the U S taxation of income from assets held abroad 
is the tax offset provision of international taxation agreements in existence 
with all major Western countries of the world These tax provisions 
permit a U S taxpayer upon repatriation of his earnings to deduct from 
his tax liabilities to the U S government all taxes paid to foreign govern¬ 
ments Assuming for a moment that all earnings are repatriated during 
ihe period m which they accrue, the U S taix rate on foreign investment 
income, tj, is — ty, where and <y are the domestic tax rate on U S 
income and the foreign tax rate, respectively In cases where the foreign 
exceeds the domestic rate and therefore becomes negative, the taxpayer 
receives no explicit U S tax refund on a return from that country and 
= 0 whenever 5 V because the taxpayer has the option of combin¬ 
ing returns from several countries, the overpayments in some countries can 
be used effectively to reduce obligations for U S tax payment on income 
from other countries, and in practice a refund is provided for overpayment 
to some foreign governments 

As a result of the tax provision, the net private rate of return from 
foreign rather than domestic investment {Rf) is 

Rp = W(i - </ - 0 + ^ - ’-.(1 - h), (1) 

where and are the foreign and domestic rates of return from foreign 
investment, respectively, and is the nontaxable capital value change due 
to exchange rate revaluations expressed as an annual rale of return Under 
the assumptions that /y £ implying = /y, and that = 0, 

the net private after-tax rate of return from foreign investment is positive 
{Rp > 0) whenever 

7 > ( 2 ) 

and it becomes zero when ry = These conditions are well known to be 
a necessary requirement for the efficient international allocation of 
capital, though it should be noted that, because of the offset provisions 

^ The classical theoretical article is MacDougall (1960) The gains from international 
diversification have been discussed in Grubel (1968) and Grubel and Fadncr (1971) 



472 


JOURNAL OF POUTIOAL ECONOMY 


mentioned in the preceding paragraph, if tf ^ and there are no tax 
credits from other countries, it is possible that < 0 even though 
Tj- > and r„ = 0 Therefore, the U S tax laws need not provide the 
necessary condition for the efficient international allocation of capital 

Let us now define the social rate of return from foreign investment 
{R,) as 

Rs = ('■/)(J - tf) + '■x - '•j + gd^ (3) 

where is the government expenditure necessitated by domestic invest¬ 
ment but not charged to the business firm directly, expressed as an annual 
rate of expenditure per $100 investment Under the assumption that r^ 
and are zero, the social rate of return is positive [Rg > 0) only if 

(f/ - > h> 

which means that foreign investment is socially profitable only if the 
foreign rate of return exceeds the domestic rate by a percentage larger 
than the foreign tax rate The analysis also implies that foreign investment 
IS privately profitable but results in a socially negative rate of return 
whenever rj{\ — If) > tf > 

The task of the next parts of this paper is to estimate Rf and Rg using 
the variables shown m the right-hand side of equations (1) and (3) 
However, it will be impossible to provide any estimates of^j, the savings 
of U S government expenditure realized by the location of the investment 
abroad, and g^ will be assumed to be zero It is difficult to know reliably 
the magnitude of tlie bias introduced by this assumption However, it 
seems clear that services provided free by governments specifically to 
private investment projects are a relatively minor proportion of all 
government expenditures, because most government services such as 
public utilities and judicial services arc provided for a fee At the same 
time, such free services as roads and public education to the work force 
would most often be the same with and without the existence of the in¬ 
vestments Final judgment of these matters must await more detailed 
theoretical analysis and some empirical case studies 

The preceding analysis was set out by assuming that all earnings are 
repatriated in the period of accrual, and in the remainder of this paper we 
will retain this assumption We thus disregard the important fact that 
under present tax laws, whenever the foreign is below the U S tax rate, 
the payment of the remaining U S tax can be deferred until the profits 
are repatriated The owners of forcign-reinvested profits thus enjoy the 
benefits of a tax-free loan Musgrave (1972, p 192) has estimated that the 
revenue loss to the U S Treasury from this provision came to about $900 
million in the year 1970 for the case of foreign-incorporated subsidiaries 
of U S corporations 

The arguments over the merits of this deferral provision exceed the 



U S ASSET HOLDINGS 


473 


scope of the present paper The primary economic impact of the implicit 
subsidy IS an inefficiently high level of foreign investment, the welfare 
implications of which take the form of consumer surplus * 


II. The Data and Rates of Return on Bonds and Equities 

The empirical study of private and social rates of return to U S holdings 
and holdings abroad is based on average rates of return during the period 
1960-69 This decade was characterized by the greatest degree of freedom 
from government restrictions in foreign exchange markets since the end 
of the Second World War and by rapid economic growth and capital 
formation throughout the world Of course, the results of this study would 
have been different if any other period had been analyzed This fact does 
not invalidate the calculations made, but limits the inferences we can 
draw about private and social profitability of foreign asset holdings over 
longer past periods and about the future Dunning (1970, p 83) has 
pointed to a difficulty associated with considering a period of rapid 
growth in foreign investments Since it is known that investments tend to 
yield a lower rate of return initially than over the longer run, average 
rates of return during periods of rapid growth are biased downward We 
have no way of quantifying this bias and will disregard it in the subsequent 
discussion 


Overview of Total U S Foreign Investment Position 

Before we turn to a detailed analysis of rates of return, it is useful to present 
an overview of the total U S foreign investment position m long-term 
assets in order to put into the proper perspective the asset categories to 
which the following estimates had to be limited For this purpose table I 
has been compiled The upper half of the table, showing U S assets, 
reveals in column 4 that the overwhelming proportion of foreign assets 
held abroad is in the form of direct investment (73 7 percent) followed by 
bonds (12 2 percent), corporate stocks (7 2 percent), and bank loans and 
other assets (6 9 percent) In column 3 we see thit net growth dunng the 
decade 1960-69 was greatest in direct investments (122 1 percent) and in 
overall investments (116 1 percent) 

In the last row of the upper half of table 1 we show a rough regional 
breakdown of U S assets Canada is the single most important location of 
U S -held foreign assets, with 33 9 percent of the total, followed by Western 
Europe (27 8 percent) and the rest of the world (38 3 percent) It is 
interesting to note that Canadian bonds and stocks represent 59 and 49 

* For an excellent recent study of the welfare cost of U S trade restrictions using this 
approach and providing the relevant references to the literature, sec Magee (1972) 




474 


of Commcrcf, Suriiy ofCurrtnl 



o S ASSET HOLDINGS 475 

percent, respectively, of the U S holdings in these classes European 
corporate stocks also represent a significant 40 5 percent of the total The 
rest of the world has issued a disproportionately large amount of liabihties 
to U S banks (55.3 percent of the total U S bank loans) About one-third 
of these bank loans were to Japan 

The bottom half of table 1 shows the level, composition, and growth of 
foreign holdings of U S assets The most important conclusions from this 
set of data are that overall U S assets were 2 41 and 2 34 times the level 
of liabilities in 1960 and 1969, respectively However, this average hides 
regional divergence, which produces a 1969 ratio of 5 39 for Canada, 

1 23 for Europe, and 2 75 for the rest of the world Of further interest is the 
fact that for all countries U S corporate stocks represent the favonte form 
of asset, 44 3 percent, followed by direct investment, 28 8 percent This 
relationship is dominated by an overwhelming European preference of 
about 3 1 of stocks over direct investment Canadians hold roughly equal 
shares of U S direct investment and corporate stocks 

The following empirical analysis is limited to the estimation of rates of 
return from U S holdings of foreign assets in the form of bonds, cor¬ 
poration stocks, and direct investments This partial approach has the 
serious shortcoming that the results do not provide information about the 
social profitability to the United States of adhering to the principle of 
not restricting international capital flows and partaking in a worldwide 
system of double taxation agreements In order to provide an estimate of 
this broad social profitability, it would be necessary to consider the 
benefits and costs arising from all forms of foreign assets and liabilities, 
long and short term, private and public, just as it is necessary to study all 
sources of income and costs to estimate the profitability of a private bank 
or business 

This problem of partial estimates is particularly important, since the 
U S government tax revenues lost from U S holdings abroad are partly 
offset by the U S taxation of foreign investment income For these 
reasons, the present study must not be interpreted as providing an answer 
to the broad problem of U S benefits and costs from adhering to the 
internationalist principle in foreign investment policies The empirical 
estimates to be presented represent merely an input, though an important 
input, in such a broader calculus 


Portfolio Investments 

Portfolio investments arc assets which do not confer control over the 
operation of the issuing business enterprise Qualifying as such are bonds 
and equities representing no more than 10 percent of the corporation’s 
outstanding stocks 



47 ^ JOURNAL OF POLITICAL ECONOMY 

Calculations on the rates of return to bonds and stocks were limited to 
Canada and Western Europe, the major issuers of U S -held assets Data 
on U S foreign portfolio holdings do not disaggregate “Western Europe” 
by individual countries Average Western European rates of return were 
calculated by weighting individual country data by the value of U S 
manufacturing investment at the end of 1969, for which country data are 
available 


Bonds 

The U S government does not possess information on the actual com¬ 
position or yield of U S private bond holdings abroad For this reason it 
was necessary to assume that the holdings consisted entirely of foreign 
government bonds with a maturity over I years and that the opportunity 
cost of these holdings was equal to the long-term yield on U S government 
bonds These assumptions produce unbiased results as long as (1) the 
yield differentials between government and private bonds are equal in 
the United Slates and the rest of the world, (2) U S wealth holders 
putchase equal proportions of government and private bonds abroad as 
they \sould if they had invested domestically, (3) the maturity com¬ 
position of foreign portfolios is the same as the domestic portfolio would 
have been and there are no international differences m yield curves, and 
(4) iiK oine from foreign government bonds is taxable at the normal rate * 
Kehnemcnls of the estimates will have to await information on the 
act urat y of these hypothetical propositions about the real world 

Row 1 of table 2 shows the foreign bond yields (ry) to have been 5 89 
and G 28 pert ent for Canada and Western Europe, respectively Com¬ 
parison of these yields with the U S yield (r^) shown in row 8 as 4 51 
percent reveals a positive net private profitability from foreign bond 
holdings The foreign withholding taxes (tf) of 15 0 and 3 2 percent 
(row 2) lower after-tax foreign yields to 5 01 and 6 08 percent (row 3) 
for Canada and Western Europe, respectively Average exchange rate 
losses during the period (r„) shown in row 4 and applied to the proportion 
of nori-dollar-denommated bonds (row 5) reduce the gross foreign yields 
to 4 92 and 6 06 percent, respectively, for Canada and Western Europe 
(row 7) After adjustment for the domestic opportunity cost of these funds, 
the net social rate of return {R^ is estimated as having been 0 41 and 1 55 
percent for Canada and Western Europe, respectively 

Three brief comments regarding these calculations should be made 

’ Canadian bonds issued by the provinces and municipalities especially tend to be free 
from Canadian taxation This fact biases downward our estimate of U S rates of returns 
from bond holdings Given all of the other simplifying assumptions, in the extreme, if all 
Canadian bonds held had been tax free, the net social rate of return would have been 1 29 
rather than 0 41 percent 



U.s. ASSET HOLDINGS 


TABLE 2 

V S Returns from Foreion Bond Holdinos, 1960-69 


477 


Canada Western Europe 


1 Average yield, government bonds, 1960-69, 
%/year 

2 Average withholding tax, 1962-67, % of 
interest 

3 After-tax rate of return, %/year 

4 Average exchange rate change, 1960-69, 
%/year 

5 Percentage of U S -dollar-denominated bonds 
1969 

6 Change of portfolio due to exchange rate 

7 After-tax, after-exchange rate adjustments, 
return, %/year 

8 AverageUS governmentbondyield, 1960-69, 
%/year 

9 Net social rate of return, %/ycar 


5 89 

6 28 

150 

32 

5 01 

6 08 

-0 74 

-0 46 

88 4 

96 0 

-009 

0 02 

4 92 

606 


4 51 

0 41 

1 55 


SotTRCE* AND NOTES ON ( OMEUTATIONS 

Vicld Organization for j conomic i oopcraiion and DEvrIopment (1970, vol 2) ^f>u^th-qua^lc^ average 
long-term government bond yields were averaged over J 960-69 for eai h country of W rsfcm Europe Weights 
for calculation of Western European average a»e relative value of U S manufacturing investment in 1969 
from U S Department of (Ajmmerfe, %tvey of Current btismtu (October 1970) 

J ax rate Statutory dividend withholding tax rates compiled by U S Treasury Jnlemational Tax Affairs 
Staff \Kopits 19701 Average for Western Europe weighted by U S manufacturing investment 
Exchange rate change hnd-of-penod exchange rates 19b0 b9 from International Monetary Fund (various 
years) Change in rate expressed ts percentage of 1960 rate Ltiropean rate is average weighted by manu¬ 
facturing investment 

US bond rales 3-5-ycar U S government issues (U S Government 1971, p 264 table C57) 


First, the exchange rate adjustment would be much more unfavorable if 
the December 1971 currency realignments had been within the period 
covered in this study However, the net effect on the foreign rates of 
return would not have been mitigated considerably by the fact that nearly 
90 percent of all foreign bonds held by U S residents are denominated in 
dollars 

Second, the U S bond yield shown is unadjusted for the theoretical 
fall in the U S rate that would have occurred if the funds invested abroad 
had been used to purchase U S bonds This adjustment seems unnecessary 
in light of the fact that the total value of bonds in circulation in the United 
States at the end of 1969 was S49} billion, of which the $7 5 billion of 
foreign bonds represent only 1 5 percent 

Third, It is customary to assume that market interest rates reflect 
expected rates of inflation over the life of financial assets The risks of 
unexpected changes in the price level and the resultant wealth re¬ 
distribution are shared equally between lenders and borrowers, assuming 
that expectations on the average arc correct However, when unexpected 
inflation does occur and is biased in favor of foreign borrowers, the 
resultant wealth gain accrues to non-U S residents ® One may wonder 

* Ahber (1973) has shown that during several years of the post-World War II period 
interest rates typically failed to reflect exchange rate changes 







478 


JOURNAL OF POLITICAL ECONOMY 


TABLE 3 

U S Returns prom Foreion Corporation Stock Holdings, 1960-69 


Canada Western Europe 


I Average dividend yield, 1960-69, %/ycar 

34 


39 

2 Average capital gain, 1960-69, %/year 

3 Total yield (1) + (2) 

96 


2 7 

130 


6b 

4 Average withholding tax, % on dividends 

150 


76 

5 Average capital gains tax, % 

6 Net after-tax yield, %/year 

00 


11 3 

12 5 


60 

7 Average exchange rate change, 1960-69, 




%/year 

-0 74 


-0 46 

8 Yield after tax and exchange loss 

9 Average U S dividend yield, 1960-69, 
%/year 

10 Average U S capital gain, 1960-69, %/ycar 

11 7 

32 

5 5 


6 7 


11 Sum (9) + (10) 

12 Net social rale of return (8) — (11) 

1 8 

99 

-44 


SotfRC PS AND NOTEI ON ( OMPUTAHONS 

Divitlcmi \iel<l Annual yirld for Canada, (.Inited Kmgdofn France, Gertnany Italy, the MetherJands 
Norway and Sweden Irum OrKamzaiion for t con«Mi\ic C «»oprratioo and Development (1^70), each t ountry’s 
annual utr la averageH for the clecadr Western Lun>pc average woghted by book value of U S dirett 
investment m manufteturmg m 1%9 (from U S Dcpartnuid of (ommerre Suntyof Current Business, October 
1970) 

C apiial RTins B-vaed on slfKk nnrkci imlices of tII countrtes in s-vinple Source sanie as dividend yield 
Wrvtern I urope average weighted like divulend >»<ld figure 

Withliohlifig tax rale and exchange nte rhanges see table 2 

I’ S chv idrnd and t ipit il g iins Standatd and Poor’s 'ifK) stocks (Organicition for Economic Cooperation 
Tnd Development 1970 pii lh2 05) Last month of e-vch year averaged over 10 years for dividend yield 
PerceniTge gain ot price end ol I960 to end of 1969 for capital gams 


to what extent the average nominal bond yields shown above can be 
assumed to have reflected fully anticipated rales of inflation during the 
• period ’ However, it is analytically not useful lo consider that investors 
consistently over a decade are injured by unexpected inflation rates, and 
the data presented above will not be corrected for any inflation losses 


Equities 

The cstiinaiion of foreign rales of return on equities is nearly identical to 
that undei taken for bonds In the absence of any information on yields 
of actual holdings, tlic yield figures of rows 1-3 arc based on the assump¬ 
tion that U S holders owned porllolios equal to the “averages” of the 
individual markets, with Western European holdings weighted by the 
share of U S direct investment As can be seen from tabic 3, private rates 
of return from (Canadian stocks were equal to 3 4 and 9 6 percent from 
dividends and capital gains, respectively The analogous rates for Europe 
were 3 9 and 2 7 percent After application of the withholding and capital 

^ During the period 1960-69, when the U h consumer price index rose 23 8 percent, 
analogous percentage rises in some of ihe major countrus in this study were Canada, 
26 7, Unitid Kingdom, 40 0, Germany, 25 7, France, 42 5, Italy, 40 3 (International 
Monetary Fund, various years, country statistics) 







u S ASSET HOLDINGS 


479 

gains taxes (rows 4 and 5), the after-tax yields were 12 5 and 6 0 percent 
for Canada and Europe, respectively Adjustments for exchange rate 
losses yielded the final figures of 11 7 and 5 5 percent for the two regions 

One theoretically very difficult problem I have been unable to solve 
concerns the choice of an appropriate U S opportunity cost for the funds 
invested in foreign equities There appear to be two extreme theoretical 
possibilities On the one hand, we may consider that the purchase of 
domestic rather than foreign equities would have affected only the stock 
of U S financial wealth in the form of equities in the portfolios of the 
public, there would have been no induced new equity flotations, cor¬ 
responding real capital formation, and profits taxable at the corporate 
income tax rate Under these assumptions and given that foreign equity 
holdings are but a minute fraction of the stock of U S shares held by 
Americans, the proper opportunity cost of foreign equity purchases is 
the returns from U S stock market investment Rows 9-1! in table 3 
show the dividend yield and capital gains of stocks in Standard and Poor’s 
500 share index They total 9 9 percent and imply that the holdings of 
Canadian shares resulted m a positive net social rale of return of 1 8 
percent per year during the decade The corresponding figure for Western 
Europe was a negative 4 4 percent 

The other extreme treatment of the opportunity cost of foreign equity 
purchases considers them to have come fully at the expense of real capital 
formation by existing or potential new firms In order to produce after¬ 
tax profits for dividend distribution and reins estment equal to the 9 9 
percent annual yield on equity shares, these firms would have had to 
earn a pretax rate of return of 16 6 percent, assuming the corporate tax 
rate to be 40 4 percent (see table 4, row 4 and note h) While the preceding 
arguments contain many hidden assumptions about the operation and 
efficiency of capital markets and the role of taxes and depreciation, the 
resultant estimate of the rate of return on real capital formation is close 
enough to other data available (see table 4, row 3, last column, and note 
h for the source) to provide a reasonably reliable benchmark for our 
purposes of calculation Use of the preceding assumptions leads to the 
conclusion that the average annual rates ol loss from U S equity holdings 
during the decade were 4 9 and 11 1 percent for Canada and Europe, 
respectively 

Without much further empirical and theoretical analysis, which would 
he outside the mam objectives of this paper, we cannot decide which of 
the above two theoretical models describes the real world most accurately 
However, the ranges calculated under the two extreme assumptions 
indicate that foreign equity investments during the period probably 
yielded a negative rate of return, though a very small positive yield from 
Canadian equity holdings cannot be ruled out 




480 



Table 4 [Continued) 






|§ 

c/5< 


C 

;3 

V) 


2? = 


m in r«. 
o lO p 

— cs ^ 


eg ^ 00 
!>. iT) in 
egcn eg 


CO CO 


cn cn in 

on o p 
eg cn ^ 


lO CO — 

m O eg 
eg CO 


eg 

— inoocooeoioo 


egeo»ninpopinp 
otoor'^O'^iOO 
eg eg eg eg 


•^t^g-O'^o^^eg 

g-pompopcgcg 


O-^^OPOPioP 

oeoouncgocgcop 


OPCiOegOcg—'O 
cgiTiOin^og’ino 


OfoSoon'COOC) 
p —o«no —«-^«n 


ppoopopiog* 
peoOincoocoico 
-- --*11 


c 

0 

V 

^ s 

§ s 

u K 

H V D 
5^3 

2-|2? 

ich 


' 0 
C 3 

Sli! 

Um 

U u h 
|cgp(®|io 

c5 w< 


& 

e 

2 

^ *5 

^ c 

-3-S 
S > 
' 0-0 

3*0 
« 0 2 
“= s s 

«E- 


o u 
o rt ^ 

2 5^2- 

c b I 

Ilf 

Ip 

g rt V 
i> g 

o ^ g 

Is 

S'S I 

<t3o 


B 

""•o 

I § 

o 0 


O c 

“ e 

a S 


z< 


M o —egeo 


S'S - 




i 


§■=12 c * 
>V) c e 3 S C 

= 3*^111 = 


sli I 

ii5= ' 


i ■iRE-a„£S 

a£ ‘•■5 S b ut3 E 

0M Ij e 5 n s g E 

6- - O y - C « O 

2*3 J* « 5 Cu. 

CitSCCOa'S 

s«« a S >®’ fra 


481 




482 JOURNAL OF POLITICAL ECONOMY 

III. Direct Investment in Manufacturing 

United States statistics on direct foreign investment are much more 
complete and reliable than the statistics on stocks and bonds For this 
reason, in row 1 of table 4 we show U S holdings by individual countries, 
the sum of which represents 78 8 percent of all U S direct manufacturing 
investment abroad in 1969 

United States direct investment holdings at the end of 1969 were 41 6 
percent in manufacturing, 8 0 percent m mining and smelting, 28 2 
percent in petroleum, and 22 2 percent in other industries The present 
study IS limited to manufacturing investment for two reasons First, 
because the profitability of investment in mining and petroleum is so 
dependent on the availability of natural deposits, it is impossible to 
establish reliably what the domestic opportunity cost of the foreign 
investment would have been Second, special treaties with foreign 
governments about royalties, special accounting, and depreciation rules 
governing these nonmanufacturmg industries abroad and in the United 
States makes it impossible to estimate rates ol return which would be 
theoretically appropiintc for tiu present purposes of analysis 

The US government publishts regularly “earnings” and the “book 
value” of U S manufacturing (iitcrprises abroad Earnings are defined 
as “the sum of the U S share in net earnings offoicign corporations and 
branch profits after foreign but before U S taxes ” Book value is defined 
as “equity of, plus liabilities to, parent The published U S statistics 
on rates of return fiom foreign investment simply represent earnings per 
year as a percentage ol book value at the beginning of that year A 10- 
year average of these rates of leturn (1960-69) is shown m row 5 of table 4 
In the following we will discuss the adjustment made on these figures to 
deiive the estimates of net social rates of return shown in rows 12 and 13 
ol tabic 4 

Farnin^i and Foreign Income Taxer 

In Older to tcimpare properly U S and foreign business income tax rates. 
It IS insufficient to use statutory rates because in each country definitions 
of income, expenses, dejiicciation allowances, etc , are different Fortu¬ 
nately, there exists a U S government study of effective foreign tax rates 
based on the U S definition of corporate income and dcprcc ration rates 
(U S Ireasury Uepartrnent 1969) These tax rates were estimated from 
data submitted to the Internal Revenue Service for the fiscal year 1961-62 
by cakulatmg for cac h corporation its income according to U S laws and 
expressing the actual income taxes paid to foreign governments as a 
fraction of this income As can be seen from row 4 of table 4, these tax 

“ For an updated and clear definition of terms employed in U S government statistics 
on the U S foreign investment position see U S Department of Commerce, Survey of 
Current Business (October 1971, pp 37-38) 



u S ASSET HOLDINGS 483 

rates vary widely between countries, but the weighted average of 40 7 
percent is only 0 3 percent above the domestic U S rate 

As a result of the absence of foreign tax rate data on years other than 
1961-62, It was assumed that these rates were in force throughout the 
decade under study, and the average before-tax rates of return for the 
period were calculated by inflating the average after-tax rates of row 5 
The results are shown in row 2 of table 4 The weighted average of beforc- 
tax returns on book value is 19 5 percent per year 

Book Value versus Equity 

However, before domestic and foreign rates can be compared, it is 
necessary to adjust the foreign rates so that they reflect return to equity 
rather than book value The liabilities to parent, which represent the 
difference between foreign equity and book value, are loans or other 
extensions of credit, which do not share m the risks accruing to equity 
investment The earnings on these nonequity U S assets should be 
considered as a separate category of U S foreign assets but will be 
disregarded here In this study we will adjust the book value by taking 
advantage of a recent publication (U S Department of Commerce 1971) 
of the financial structure of U S -owned corporate manufacturing 
affiliates abroad It shows liabilities to parents to have been 27 3 and 23 1 
percent of book value for affiliates in Canada and the rest of the world, 
respectively, during the period 1966 69 Under the assumption that this 
financial structure is representative of the full population of U S -owned 
foreign corporations and branches^ and for the full decade 1960 69, we 
estimated average annual rates of return to equity by multiplying the 
rates of retuin on book value shown in rows 2 and 5 by 1 376 and 1 30 
for Canada and the remaining countries, respectively The results are 
shown in rows 3 and 6 of tabic 4 In the remaining parts of table 4 all 
further adjustments on rates of return will be made on the equity base, 
which IS the theoretically proper one for the coinjiarisons with U S 
domestic rates of return 

Dividend Withholding Taxes 

United States income from foreign manufacturing investment is subject to 
double taxation in the same way as income from domestic corporations 
First, the government taxes the corporation’s income directly, which 
calls for the adjustments already made Then the dividends distributed 
by the corporation are subject to a dividend withholding tax In row 8 of 
table 4 we show the rate at which foreign govt rnments withhold dividends 

However, the second form of taxation takes place only on that portion 

’ According to U S Department of Commerce, Surveji of Current Business (October 
1971, p 38), earnings of foreign corporations and foreign branches were $6 3 and $2 5 
billion, respectively, in 1970 



JOURNAL OF POLITICAL ECONOMY 


484 

of earnings that is distributed rather than reinvested In row 7 of table 4 
the average 1960-69 ratio of distributed to before-income-tax earnings is 
shown For all countries (next to last column of table 4) the average of 
10 9 percent withholding tax (row 8) applied to the 52 percent average 
distributed earnings (row 7) is seen to lower the after-corporate-income- 
tax rate of return on equity of 15 1 percent (row 6) to 13 7 percent (row 
9) Therefore, the average withholding tax rate is effectively equal to 
1 4 percent of equity per year 

Exchange rate changes over the period under study increased or de¬ 
creased the capital value of the foreign investment holding m terms of 
U S dollars at the annual rate shown in row 10 of table 4 Row 11 con¬ 
tains the rates of return after taxes and exchange rate adjustments, which 
arc the theoretically relevant base of comparison with the domestic 
opportunity cost of the resources, the pretax rates of return on U S 
domestic manufacturing investment of 19 3 percent during the decade 
(see note h of table 4 for the source of the 19 3 figure) Row 12 of table 4 
presents the net social rates of return of investing abroad rather than 
domestically as defined m equation (3) 

As can be seen, the all-country average shows a negative yield of —5 9 
percent per year Positive yield ofO 1, 19, 1 1,3 5, and 5 5 percent were 
achieved only from the investments in Belgium and Luxembourg, 
Germany, Denmark, Switzerland and South Africa, respectively 

Royalties and Fees 

The theoretical literature on the determinants and welfare effects of 
direct foreign investment'® stresses the comparative advantages accruing 
to multinational corporations from the use of research, management, and 
marketing skills, which are developed at high initial cost for the domestic 
U S market but which can be applied at relatively low marginal private 
cost in the operation of foreign subsidiaries In their accounting con¬ 
ventions, these corporations charge their subsidiaries the average cost of 
developing the production, management, and marketing knowledge, and 
the U S government reports the value of income from these sources as 
“royalties, license fees, and rentals” and “management fees and charges,” 
royalties and fees for short 

In the reporting of the balance of payments it is theoretically unobjec¬ 
tionable to include royalties and fees in “income from foreign invest¬ 
ment,” along with dividends and distributed profits,'' on the grounds that 
without the foreign direct investment this financial flow would not have 
taken place However, in the present analysis we are concerned with the 
productivity of resources, not financial flows From this point of view the 

See Kindleberger (1968), Johnson (1969), and Caves (1971) 

'' The U S balance-of-paymcnts statistics repoi t income from foreign investment as 
the sum of dividends and distributed profits plus royalties and fees to subsidiaries 



V S. ASSET HOLDINGS 485 

full value of royalties and fees represents a net increase in the productivity 
of U.S knowledge capital only if the marginal cost of providing these 
services is genuinely zero More generally, the social income from 
royalties and fees charged foreign subsidiaries is equal to the difference 
between the average cost at which they are billed and the true maigmal 
cost 

Unfortunately, there exists no evidence on the relationship between 
marginal and average cost of these services, and m our calculations we had 
to assume that the marginal cost was zero ' ^ As a result, the estimate of 
the return from foreign investment is almost certainly biased upward, 
since it IS highly unlikely that the marginal cost is zero However, as one 
of the important findings of this paper is the negativity of the net social 
lates of return from foreign investment, the bias introduced by our 
assumption is in the proper direction 

During the period 1960 69, the value ol royalties and fees averaged 2 1 
and 3 9 percent annually of the book value of foreign investment in 
Uanada and Europe, respectively Ihese figures were adjusted for the 
equity base, and we assumed that Australian and South African rates 
were the same as those lor Europe The resultant increase m rates of 
return, which is not subject to any forms ol foreign taxation, was added to 
the net social rates of return shown m row 12 of table 4 The results arc 
presented in row 13 of tabic 4 and indicate that the overall, all-country 
average was —2 7 percent compared with —64 percent without the 
adjustment for royalties and fees Positive adjusted rates of return are 
shown for Belgium and Luxenbourg {) 2), Germany (7 Oj, the Netherlands 
(13), Denmark (6 2), Switzerland (8 6), Australia (2 2), and South 
Africa (10 6) The overall average (—0 9) is dominated by the propor¬ 
tionately large investments m Cianada ( — 6 3) and the United Kingdom 
(—03) France ( — 70) and Spain ( — 59) retained their substantial 
negative rales 

Confiscalion 

In the past, U S investments m many countries of the world, but especially 
in C’uba, Eastern Euiopc, China, and some Latin American countries, 
have been lost to private investors and to the U S economy as a result of 
confiscation or inadequate compensation by foreign governments In 

Theoretically, the opportunity to earn further income on the knowledge capital 
originally produced only for domestic use should increase the yield from this knowledge 
capital relative to other forms of capital formation Therefore, the opportunity to charge 
foreign subsidiaries for a share of these costs should induce increased expenditures Once 
such an expansion has taken place, it is theoretically appropriate to charge foreign 
subsidiaries the full share of the average cost and inappropriate to consider income from 
royalties and fees as representing a net social gam acquired at zero or even very low 
marginal cost In the long run all fixed costs are variable, and the analysis should reflect 
this fact For this reason the adjustments to the rates of return on row 12 must be con¬ 
sidered as introducing a theoretically very strong upward bias to the results 



JOURNAL OF POLITICAL ECONOMY 


486 

principle, these losses should be averaged and incorporated into the 
present calculation of net social rates of return from foreign investment 
Unfortunately, no ready data are available to carry out such calculations, 
and the necessary adjustments of our estimates are not undertaken 
Consequently, the results presented above are biased upward In spite of 
this upward bias, however, the rates still average out to negative 

IV. Summary and Conclusions 

The analysis presented in this paper must be considered exploratory 
Future studies may well lead to important revisions of the mam conclusions 
that during the decade 1960-69 U S holdings of foreign bonds were 
socially prohtable while some of the equity and direct manufacturing 
holdings resulted in a social loss Extensions of the analysis in future 
studies may proRtably take place along the following lines 

First, an attempt should be made to estimate returns to total U S 
asset holdings abroad, including short-term assets, bank loans, direct 
investment in forms other than manufacturing, government holdings, etc 
The effect of investment in mining and oil on U S terms of trade should 
be analyzed carefully 

Second, the U S holdings should be considered as part of a global 
system which results in foreigners also holding U S assets taxable by the 
U S government For example, at the end of 1969 the nonliquid, private, 
long-term U S investment position showed $96 billion assets abroad and 
$41 billion liabilities to foreigners Foieign investment in the United 
States probably results in net U S gams which can be considered as 
offseliing the losses discussed above 

Third, the externalities and government expenditures associated with 
asset holdings of various forms should be explored further As an entity, 
U S taxpayers receive government services that tend to be equal in value 
to all of the taxes paid The relevant question about the welfare effects of 
foreign direct investment therefore is the extent to which tax payments 
by corporate or other business entities can be considered as being m 
return for a government service that can be eliminated in the United 
States and has to be undertaken abroad because of the U S capital 
export 

Until these lines of analysis have been explored further, the results of 
the present paper on the social nonprofitability of many foreign invest¬ 
ments should be used with the proper caution in any policy decisions about 
restrictions on and taxation of U S asset holdings abroad However, the 
evidence is sufficiently reliable to conclude that the export of capital 
from the United States for direct investment in manufacturing and the 
internationalist policies guiding the taxation of income from this capital 
do not clearly raise U S welfare and probably lower it 



u S ASSET HOLDINGS 

References 


487 


Aliber, R “Tests of Fisher’s Expectations Hypothesis ’’ Paper presented at the 
Royal Economic Society Conference of Specialists in International Monetary 
Economics, Ditchley Park, England, January 1973 
Caves, R “International Corporations The Industrial Economics of Foreign 
Investment ” Economica 38 (February 1971) 1-27 
Dunning, J Studies in International Investment London George Allen & Unwin, 
1970 

Goldfinger, N “A Labor View of Foreign Investment and Trade Issues ’’ In 
Williams Commission Report Washington Government Printing Office, 1971 
Grubcl, H G “Internationally Diversified Portfolios Welfare Gains and Capital 
Flows” iER 58 (December 1968) 1299-1314 
Grubel, H G , and Fadner, K “The Interdependence of International Equity 
Markets ” J Etnance 26 (March 1971) 89-94 
International Monetary Fund International Financial Statistics Washington Inter- 
nat Monetary Fund, various years 

Jenkins, G “Analysis of Rates of Return from Capital in Canada ” Ph D 
dissertation, Univ Chicago, 1972 

Johnson, H G “The Efficiency and Welfare Implications of the International 
Corporation ” In The International Corporation, edited by C P Kmdleberger 
Cambridge, Mass MIT Press, 1969 

Kmdleberger, C P American Business Abroad New Haven, Conn Yale Univ 
Press, 1968 

Kopits, G F “The Industrial Economics of Foreign Investment A Comment ” 
Mimeographed Washington 'I reasury Department, 1970. 

Krause, L , and Dam, K Federal Tax Treatment of Foreign Income W'ashington 
Brookings Inst, 1967 

MacDougall, D “The Benefits and Costs of Private Investment from Abroad ” 
Econ Record 36, no 73 (March 1960) 13-36 
Magee, S “The Welfare Effects of Restrictions on U S Trade ” Working Paper 
no 18 Chicago GraduateSchoolofBusiness, University of Chicago, December 
1972 

Musgrave, P United States Taxation of Foreign Investment Income Cambridge, 
Mass Harvard Law School, International Tax Program, 1969 

- “Tax Preferences to Foreign Investment ” In The Economics of Federal 

Subsidy Programs, pt 2 Washington Joint Econ Committee of Congress, June 
11, 1972 

Organization for Economic Cooperation and Development Financial Statistics, 
vol 2 Pans Org Econ Cooperation and Development, December 1970 
Reddaway, W B , et al Effects of U K Direct Investment Overseas Cambridge 
Cambridge Umv Press, interim report 1967, final report 1968 
U S , Department of Commerce Surv^ of Current Business, various years Washing¬ 
ton Dept Commerce 

- Foreign Affiliate Financial Survey, 1966-69 Washington Dept Commerce, 

July 1971 

U S , Government The Economic Report of the President Transmitted to Congress 
February 1971 Washington Government Printing Office, 1971 
U S , 1 reasury Department Internal Revenue Service, Statistics of Income, 1962, 
Supplemental Report Foreign Income and Taxes Reported on Corporation Income Tax 
Returns Washington Government Printing Office, 1969 




Cattle as Capital Goods and Ranchers as 
Portfolio Managers: An Application to the 
Argentine Cattle Sector 


Lovell S Jarvis 

University of California, Berkeley, and the Ford Foundation, Santiago, Chile 


Microeconomic models, treating cattle as capital goods and producers 
as portfolio managers, provide a theoretical framework for an econo¬ 
metric model of the Argentine cattle sector Argentine agricultural 
stagnation has been partially attributed to a lack of producer price 
response, but this paper shows that the cattle sector exhibits strong price 
response and that producers correctly differentiate their behavior toward 
animals of different age and sex 1 he long-run price response of slaughter 
IS positive, but the short-run response is negative because animals must 
be withheld to permit increases in future output This negative short-run 
response implies that devaluation is unlikely to increase beef exports for 
at least 2 years 


In this paper a number of microeconomic models are developed to 
provide a theoretical framework on which an econometric model of the 
Argentine cattle sector could be based The central theme is simple Cattle 
are considered to be capital goods which are held by producers as long 
as their capital value m production exceeds their slaughter value In 
essence, producers become portfolio managers seeking the optimal 
combination of different categories of animals to complement their non- 
cattle assets, given existing conditions and future expectations The 
models developed show clearly the differential impact of parameter 
changes upon the capital values of animals of different age and sex and 
indicate that the equations explaining slaughter and average slaughter 

This paper is a revision of chapters 2 and 8 of my Ph D ihcsis, “Supply Response in 
the Cattle Sector The Argentine Case 1937/38-1966/67,” Department of Economics, 
Massachusetts Institute of Technology, 1969 (which is to be published by the University 
of California Press) Many individuals provided helpful criticisms and encouragement at 
various stages of the study I would like to thank especially Franklin Fisher, Charles 
Freedman, Richard Schmalensce, J Phillip Cooper, Carlos Di'az-Alejandro, George 
Akerlof, and Steven Goldman Financial assistance was received from the National Science 
Foundation, the Center for Latin American Studies at the University of California, 
Berkeley, and the Ford Foundation The theoretical section of this paper was completed 
in 1968 while 1 was a guest of the Instituto Torcuato di Telia, Buenos ‘kircs, Argentina 

489 


490 


JOURNAL OP POLmCAL 


®CONoj^ 


weight in an econometric model should be disaggregated by an 
categories to obtain a meaningful explanation of producers’ resp^n^f 
Such an econometric model was developed and estimated for ((” 
Argentine cattle sector for the years 1937-67 The econometnc mode] 
does a very good job, |udging by conventionai statistical tests, nl f,. 
plaining the operation and past behavior of the Argentine cattle sector, 
and the results comply closely with the previously developed micro- 

thcot y 

Thtie IS ronsideiable historical interest in the functioning of the 
Argentine cattle scctoi, long a prototype of large-scale traditional 
ranching in Latin Anictica and the source of a signifieant proportion of 
world bcc'f exports (nearly 20 percent during the 1960s) This interest has 
become more sharply focused in recent years because of the stagnation of 
Argentine agricultural production and the consequent desire in Argentina 
to formulate an improved governmental polity toward agriculture* 
Among the factors alleged to be a cause of stagnation is a lack of response 
by agricultural producers to economic incentives, particularly to price 
changes Several efforts were made during the 1960s to test empirically 
the price responsiveness of Argentine agricultural producers These 
studies sought to at least partially answer the questions regarding producer 
behavior, and to provide policy makers with a framework for predicting 
the effect on production (and exports) of changes in agricultural prices 
Somewhat surprisingly, given the experience in most other countries 
where similar studies have been earned out, little price response has been 
discovert d m Argentina ^ Almost no response was found for coi n, a major 
crop which has experienced considerable output variation, and the re¬ 
sponse evident during the 1945 65 period for wheat, linseed, and oats is 
of much snialkt magnitude and lower statistical significance than that 
encountered for these crops in most countiic's dins apparent lack of 
price rt sponsc gave support lo the “stria tuialist" interpretation of 
stagnation (and suggested that pra e policy, per se, may have very limited 
usclulness in Argentine agriculture) 


' 'I lie tiadUKinxI exports of Arginlina arc produced in the Pampa, one of the world’s 
richest agricultural areas I hoc products (lood grains, oil seeds, wool, and cattle) still 
accounted lor <13 pt reent of total exports in 196'), despite the fact that gross agricultural 
output in this area showed almost no increase between 1933-39 and 1960 64 Thus, 
stagnation here implied stagnation in total exports, with serious consequencts for 
Argentine development in general Moreover, beef production retains the most im¬ 
portant position among agiicultural activities Not only has beef accounted for more than 
25 percent of total exports in recent years, but it also is one of the principal wage goods 
in Argentina, weighing heavily in the cost-of-living index and being a particularly 
important political symbol to the working classes 

^ Dia/-Alejandro (1965, 1970), Colomc (1966), Prcirc (1966), Williams (1966), anel 
Rcca (1967), all estimated acreage or output response equations for different products, 
and all found little price response Diaz-Alejandro, for example, concluded that other 
factors such as soil erosion, weatht r, the availability of inputs, social overhead facilities, 
and credit affi cted producers more than did prices and had, on balance, shifted the supply 
schedule to the left over time 



441 

cattle AS CAPrrAL GOODS 

.nmc Dfcvious evidoicc of pHCc TtspoasivencM for caiilr 
o^ucuon quah Jivcly consistent wuh the r^ul.s reported here, i,u, 
1 r studies usually did not encounter significant price toefficieni!, 
The models used generally had a very Simple specification, however, and 
the results obtained could have been attributable to the omjssiofl of 
important vanables These earlier studies were also plagued by a lack of 
reliable data, which made suspect their econometric results And the 
time profile of the response of beef supply to price changes also confused 


many observers 

The cattle sector presents an interesting feature insofar as the slaughter 
ol animals responds negatively to a price increase in the short run This 
behavior contrasts with the supply response oi most other agricultural 
products, such as field crops, whose desired output (in the Ncrlovian 
sense) and actual output are both expected to rise immediately m response 
to a price increase Although actual output for these crops is expected to 
adjust only gradually to the long-run desired output, there is never 
reason to expect output to fall when its price increases * But, because 
cattle production can be increased only by increasing the size of the 
breeding herd and/or withholding animals for further fattening, producers 
must bid animals away from consumers to increase the capital slock 
which is the source of higher future beef production And the slow rate of 
biological reproduction causes the negative supply response to persist 
for some time Because of the continuing confusion within Argentina 
regarding the cause and implications of the short-run reduction m 
slaughter, I sought to show that this behavior was rational, that a properly 
specified model would show a significant price response during the 1937- 
67 period, and that one could even show a theoretically correct dif¬ 
ferentiation of producers’ behavior toward animals of different age and 


I 

To begin, micro-models are used to determine the optimum slaughter 
age and feed input for a steei, given growth functions for the animal and 
the following parameters faced by producers the price of beef, the interest 

^ Sec Diaz-Alcjandro (I9G5), Ctolome (1966), Otrrra (1966), and Rcca (1967) 

^ The required daia series for herd stocks were constructed previously, making use of 
official censuses (as benchmarks) and official slaughter data These constructed scries 
were subjected to considerable independent testing for possible bias, a detailed discussion 
of which is contained in Jarvis (1969) and presented in revised form m Jarvis (1971), 
as an example of how economic models may be used to construct unavailable data from 
existing data The rtadtr should be aware that reimTining biases, or those introduced in 
the construction process, could affect the results presented m this paper 

^ Marketed output can decline if the increase in producer self-consumption exceeds 
absolutely the rise in output bee Bchrman (1967) 

® Diaz-Alejandro (1965) provided the first clear outline of the theory developed here, 
pointing out that a relative rise in beef price induces producers to invest further m beef 
production, which can be accomplished only be reducing current slaughter 



JOURNAL OF POLITICAL ECONOMY 

rate, and the cost of other inputs To simplify the exposition, I begin with 
a model where the only input is the steer itself ^ Let 0 = age of the steer, 
w(0) = weight of the steer at age 0, dw/d0 > 0, d^w/d0^ < 0, r = 
interest rate, and V(0) = the present discounted value of an animal 
allowed to live to age 0 Thus if V{0) = w{0)e~'^, where the price of beef 
IS arbitrarily fixed at a constant value of unity, the steer will be slaughtered 
at age 0, which is chosen to maximize V(0) The first-order condition for 
a maximum yields the requirement (dw/d0)jw = r, 0 occurs when the 
rate of growth of the animal is equal to the interest rate The second-order 
condition, d^VjdO^ < 0, follows easily and requires that the rate of gain 
be declining In this model there is no opportunity cost to cattle produc¬ 
tion other than the interest foregone on invested capital An increase in 
r will lower the optimal slaughter age, and vice versa 

(■If) _ 3^ VI{00 dr) ^ p 

3r ” d^VieO^ ^ ’ 


for 


3^V 
30 3r 



< 0 


( 1 ) 


Although this model implies an important lolc for the interest rate, the 
latter actually plays <i relatively unimportant role, as is discussed 
subsequently 

Th( model can be made more realistic by recognizing that a steer 
requires ct^rlatn costly inputs throughout his life, which must be con¬ 
sidered wlun choosing the optimal slaughter age Although the slaughter 
decision still depends upon the animal’s rate of growth, the interest rate, 
and prices, it is conditional on the animal’s being “fed” the optimal 
ration Phe ciuciiori becomes maximization of the present discounted 
“profit” of the fattening process, which m pcrlcct markets will be the 
value of the calf at birth This is given in equation (2) 

niO) = p{i, 0) - Cl 1° e-'' dt (2) 

Tin new variables an n = the present discounted profit of the fattening 
process, i = a fixed bundle of daily inputs to the steer, independent of 0, 
c = the cost ol the fixed bundle, i, and p = the price per pound which 
may be obtained from a beef consumer for the steer at age 0 Both the 
weight and the price of the animal are assumed to be functions of t and 
0, implying that the quality of the beef is reflected in the unit price 
received The inputs required consist primarily of feed, but conceptually 


’’ This might be called the "schmoo” model, except that the steer does not die 
voluntarily A steer is a castrated male valued only for its beef production 



cattle as capital goods 

may include all inputs such as labor, shelter, fences, machinery, and 
veterinary care * For the moment no attention is paid to the process of 
determining market prices, for these are given to the individual producer 
The first-order conditions for a maximization of n require that the 
producer select both the optimal slaughter age and the optimal input 
stream 



which yield 

p — + w — = rpw + c(, (3a 1 

dOdO 

/• 

= - (e'-" - 1) (3b') 

0 >' 

At S the change in value due to changing weight and quality (unit 
price) IS equal to the current interest foregone plus the cost of feeding 
Alternatively, dividing through by pw, the rate of weight gain plus the 
rate of price change due to aging is equal to the interest rate plus the cost 
per day of feeding the animal as a percentage of its total value Similarly, 
at I the present discounted value of ihc marginal net weight gain and price 
increase corresponding to a higher stream of inputs throughout the steer’s 
life, less the present discounted cost of feeding the animal these inputs, 
must be zero 

It is more important, however, to discover how the optimal 0 and t arc 
affected by changes in the parameters faced by producers, that is, the 


cw dp 

p — + w — = c 
di di 


® The assumption that the input bundle is fixed is unrealistic The input bundle vanes 
over the animal’s life, and the animal’s response to current inputs depends on the amount 
and timing of past inputs This becomes complicated mathematically, however, and has 
not been included in this analysis even though such effects ate sometimes important 
Other factors can be more simply incorporated For example, marketing costs paid by 
the producer at the time ol sale will tend to lengthen the slaughter age If these costs 
are fixed, and denoted by z, wc have 


and 


7[(0) = pwe 


ct J* e-'' dt 


00 

dz 


0^n/(ag 0^:) ^ Q 


'0 0. 


e-^(r) > 0 


The producer reduces the present discounted value of the marketing costs by prolonging 
the time of sale During any period he slaughters fewer cattle, holding each to an older 
age Price expectations can be introduced by letting the price vector vary with time, 
Pi^i Ot effect of climatic variation or disease on the animal’s ability to convert 

feed into beef can be recognized by allowing shifts in the growth function, 0, 



494 


JOURNAL OF POLITICAL ECONOMY 


price of beef, the cost of inputs, and the interest rate To determine this, 
the implicit function theorem may be used The function for probability 
is 



n =/(», 0, r, c,p). 

(4) 

where the variables 

t, 0 satisfy the subsidiary conditions 


and 

^ = 0{t,e,r,c,p) = 0, 
oU 

(5) 


$ = '¥{t,e,r,c,p) = 0 

Cl 

(6) 

After writing f, P as 

functions of r, c, p. 



II 

ft 

(7) 


S = P(r, c, p). 

(8) 


X and p may be substituted for i and 0 in <I) and't' Using the chain rule 
for differentiation, we may then solve for the unknowns, 

^ ^ ^ W df 

dp’ dc’ dr ’ dp’ dc’ dr 

Explicit numerical solutions for each of these unknowns can be obtained 
using data from farm management studies However, as only qualitative 
results were needed for this study, rough estimates of the price data and 
the growth functions pertaining to Argentine steers were used to obtain 
the following results These results will be valid as long as the rate of gam 
IS declining in the vicinity of 0 and if the marginal return to increased 
inputs diminishes monotonically 

A rise in p increases (he marginal value product of each 
input, increasing the optimal feed ration and the optimal 
slaughter age 

A rise m the cost of inputs reduces both the daily input 
and the optimal slaughter age Animals are fed not only 
less per day, but for a shorter period of time because they 
grow more slowly at any given age 

A rise in r reduces the daily feed inputs, because higher 
feed investment implies higher interest costs The increase 
in r also reduces the optimal slaughter age, as it increases 
the interest foregone at every age 


dp 


> 0 


dp 

dc I 

dc 

dr 


dd 


> 


< 0 



cattle as capital goods 


495 



Fio 1 

In particular, these results indicate that a negative slaughter response 
for steers is expected in the “short run ” Fewer steers are slaughtered 
temporarily because a higher price causes them to be withheld This, of 
course, is a ceteris paribus result 

Consider now the detcimination of the market prices of diffcrent-aged 
male animals, from calf to steer We know that n{b) represents the calf’s 
value at birth, that is, is the amount which if invested at interest rate 
r would have the same money value at time 5 as the finished steer, less 
the total feed costs compounded from their time of input to 0, at rate r 
The calf has value because it is a “growing” machine 

n0) = p{t, &)w{i, d) e~'^ ~ j* 

Ae'O ^ p{i,0)w(ij) - -(e'9 - 1) 

r 

This result may be illustrated graphically while demonstrating another 
point as well (sec fig 1) In deriving the optimal slaughter age and input 
stream, it was assumed that producers faced known functions for the rate 
of gain and the rate of change of price per unit for each animal 'I'he 
product of these functions would, if graphed as a function of age, yield 
the locus shown as p{{, 0)i(’((, 0) Given our previous simplified assump¬ 
tions, slaughter occurs only at one age, 0, and because we assume perfect 
competition, the market value of the animal at 9 must equal the cost of 
producing the animal This supply cost, reflecting the cost of feed inputs 





JOURNAL OF POLITICAL ECONOMY 


496 


as well as the interest foregone on the value of the calf, can be easily 
obtained by rearranging equation (9) and is graphed as VM{9) in fig 1. 
Slaughter occurs where VM(6) is tangent to pw ® 

In fact, however, animals are slaughtered at many different ages This 
occurs both because some consumers are willing to pay a premium per 
unit weight for meat from either younger or older animals and because 
feed costs differ for different producers For the moment we ignore the 
latter factor and consider only the implications of the former Under our 
original assumption a calf’s capital value dominates its slaughter value 
until age &, and no calf will be slaughtered until this age Any producer 
wishing to sell a calf will find a buyer who will continue to feed the calf 
until age d However, any consumer wishing to purchase an animal at a 
different age could do so if he were willing to pay a premium price per 
pound Meat from animals slaughtered at other ages must bring a 
premium price because consumer prices must vary directly with w{i, 0) 
to ensure that the producer is fully compensated for the original value of 
the calf and the value of the embodied feed inputs, including interest To 
restate, if people are willing to pay/)(@) for meat from an animal aged 5, 
p{0) must be greater than p{&), 0^9 To show this, we return to the 
model where no feed inputs are required and consider the cost per pound 
of producing animals of different ages Take the case 6 < S Because 
wjw > r at this age, the animal must be worth more as a growing 
machine than as a consumption good If the calf is purchased for con¬ 
sumption, It must be at the price determined by the capital value This 
implies 

p(O)w(0) = p(S)w(§) (10) 


But now let w{S) = g > r Then 

PjO) ^ 

p{d) w(9) 


_ w{0) 
w{Q) 


(H) 


and p{0) > p{Q), 0 < 6 A similar proof can be used to show that 
p(0) > p{d) for 0 > d Therefore, although the value of the animal 
Itself, VM(9), increases monotomcally, the price per pound of the beef 
from the animal,/i(0), will have a U-shaped age profile, as shown in fig 2 
The inclusion of feed costs into the model, so long as costs are the same 
for all producers, will increase the bowness of this profile 


^ As IS clear in the mathematical formulation, the optimum slaughter age is a function 
of the relative beef/feed price, p/c, not the absolute price of beef 



cattle as capital goods 


497 



Fio 2 


The denvation of the least-cost-pcr-pound age, and the premium paid 
for animals of all other ages, is useful for conceptualizing how consumer 
preferences can affect the age distribution of slaughter, both cyclically 
and secularly However, although consumers’ preferences influence the 
age distribution of slaughtered animals by determimng the premium 
consumers are willing to pay for beef from different-aged ammals, these 
preferences have only an indirect effect on either the relative or absolute 
differences between the equilibrium market values of different-aged 
animals These relative prices will vary only slightly with changes in the 
price of beef and feed and the interest rate, the changes depending on the 
specific growth function of the animal A similar result holds for the 
absolute difference between the supply-determined market values of 
two steers of different age The supply-determined relative market values 
of different-aged steers, such as at age 6i and $ 2 , are easily found, being 
given by equation (12) 


VM{9,) _ + cilr{e'*^ - 1) 

FAf(02) ~ Ae'^' + - 1) ’ 


e, > 0. (12) 


II 

In the previous section simple economic models were employed to analyze 
the impact of various parameter changes on the market value, input level, 
and slaughter age of steers A similar analysis is now ceuried out for cows 
As may easily be understood, the principal characteristics disunguishing 
cows from steers, from an economic viewpoint, is the ability of the former 




498 JOURNAL OF POLmcAi Economy 

to produce caJvcs * ° Cows may produce beef either directly, by be,n. 
fattened for slaughter, or indirectly, by bearing calves which may tbcj. 
selves be fattened for slaughter This option is reflected by including an 
additional term in the profit equation analyzed previously This term „ 
the present value of the expected calf stream 

A C(i,r) 

l^i(l + O'' 

where C(i, t) is the expected value of the calf born in year (, assuming 
the cow has been fed input stream i throughout its life This expected 
value depends on the probability that the cow will have a calf in year t, 
the respective probability that the calf will be male or female, and the 
expected values of male and female calves in that year 

Equation (13), the profit equation for females, includes three terms 
reflecting, respectively, the calf stream, the inputs required to maintain 
the animal, whose sum is equal to the value of a female animal at birth, 
and the present value of the beef available at the time of slaughter 

t= 1 (1 + r) Jo 

'Ihis equation can be used to determine the optimal slaughter age and 
input stream for cows, as was done previously for steers It is important 
to note, however, that female calves have a distinctly bimodal optimal 
slaughter age This bimodality occurs because more female calves arc 
born than are needed for replacement purposes for the breeding herd 
As a result, some female animals arc slaughtered as fattened heifers at age 
(?[, before they have begun to bear calves, and some are slaughtered only 
after their value as breeding animals has declined, at age O 2 ** Female 
calves are essentially homogeneous at birth, and producers arc therefore 
indifferent at the margin between retaining an animal for the breeding 
herd or fattening it for slaughter If the value of a female as a breeding 
animal rises relative to its value as a slaughter animal, some females 
formerly destined for slaughter will be withheld, and vice versa I'his 
switching will continue until an equilibrium is achieved 

‘ ° Biologically, male animals arc as essential as cows to the breeding process Therefore, 
the profit equation for male animals ought to include a breeding term as well This term 
was deleted in the previous section to simplify the analysis The term docs play an 
important role in some situations, however, and will be discussed in the next section 
” Although cows may conceive until age 13, they are rarely retained in the breeding 
herd beyond age 9 because their teeth wear down, making it increasingly difficult to feed 
This lowers the probability of conception, which is sensitive to the cow’s level of nutrition, 
and makes it increasingly difficult for the cow to suckle a calf Both factors decrease the 
expecicd value of the calf stream, prompting slaughter 

The models discussed in this paper focus on the partial-equilibrium behavior of 
producers facing exogenous changes in prices, although clearly such prices arc endogenous 
to the economic system as a whole Without an endogenous solution, many relationships 
such as the bimodal slaughter distribution would not hold 



499 


AS CAPITAL OOOD» 

As should be clear, an analysis similar to that carried out for steers can 
be used to show that the immediate response of both heifer slaughter and 
cow slaughter to an increase m the price of beef is negative The higher 
beef pnee, or lower feed cost, makes it profitable to feed to heavier 
weights and to retain animals for calf production 


ni 

The models presented in the two previous sections can also be utilized 
to show that the magnitude of the slaughter response will differ for different 
types of animals 

At any single point in lime there is a fixed supply of animals in the 
herd for which there exist two types of demand consumer and producer 
As long as a producer is willing to outbid the consumer to retain the animal 
as a productive asset, the animal remains in the herd When the consumer 
wins, the animal is slaughtered Our interest is then to determine how the 
relative strength of the bidders’ demand for different types of animals 
varies with exogenous shocks to the system, for example, devaluation and 
climatic variation 

Some insight is provided by the partial-equilibrium models developed 
previously We examine first the ceteris panbus change in relative value 
between a steer ready for slaughter and a newly castrated calf, which 
follows an “exogenous” increase of 10 percent in the price of steer beef 
Assume that the slaughter age of the steer is unchanged in the new 
situation and that the value components of the profit equation originally 
have the following magnitudes, which are the approximate relationships 
prevailing for a steer calf in Argentina 

pwe~''* = fl = 5, (14) 


Cl 



dt 


b 


= 4 


(15) 


A steer calf then has value A = a — 4 = 5 — 4 = 1, and we find that a 
10 percent increase in the price of steer beef produces a theoretical rise of 
50 percent in it 


e ^ 1 1(5) - 4 ^ ^ 

it'- 5-4 


(16) 


The further away is the age of expected slaughter, the greater the change 
in the capital value of the animal 

The effect is even greater in the case of female calves, or male calves 
which have not been castrated In either case, the option exists to retain 
the animal to an older age, but more important, an increase m the value 
of a calf increases the calf stream value of the female calf, or the stud value 
of the male calf, and the recursive effect of further calf price increases 



500 


JOURNAL OF POLITICAL ECONOMY 


would be carried on indefinitely if there were no damping force Thus, 
the propiortional “instantaneous” increase in the value of a “breeding” 
calf should be substantially greater than either the original nse in the price 
of steer beef or the nse in the value of a castrated male calf 

The “instantaneous” increase in value which is reflected in these 
models is, of course, a partial-equilibnum result These models take no 
account of the fact that other adjustments, perhaps quite rapid ones, will 
occur m response to a beef pnce increase For example, the change in the 
value of the steer calf discussed above reflects the change in the value 
of the animal as a capital good As this value nses, producers will respond 
by retaimng more such animals to be used for future production, and will 
reduce the number currently slaughtered The resulting reduction of 
current slaughter will increase even more the current price of beef but will 
also increase the future supply of beef, thereby lowering the expected 
future price of beef and perhaps increasing the cost of feed As the capital 
value of an animal depends on expected as opposed to current prices, the 
movement of expected prices will dampen, at least at some point, the 
tendency for the relative prices of animals to change ’ ^ 

The process will be facilitated by the fact that beef from different 
animals is highly substitutable in consumption Thus, as the relative 
prices of certain types of animals begin to increase, the consumption of 
these animals is reduced Market prices are constrained by the high price 
elasticity of consumer demand across animal categories, and this con¬ 
straint allows producers to bid away more easily those animals with more 
sensitive capital prices 

This consumption constraint is important in two other respects as well 
First, because it limits the relative price variation, the differential effect 
of a pnce increase indicated in the partial-equilibrium models will tend 
to be reflected in the slaughter response of the different animal categories 
The slaughter response also depends on the relative availability of the 
animals, but the consumption constraints play an important role Second, 
the limitation on the relative price variation allows the use of a single 
price, such as the price of 2-year-old steers, as the price variable for all 

‘ ’ For simplicity, I have assumed in the model to this point that the price of beef as a 
function of age and of food inputs is given The same results hold if producers’ expected 
prices are deternuned similarly It is necessary, however, to consider the formation of price 
expectations if the dynamics of the adjustment process are to be clear The capital values 
of certain animals may be decreasing even at a moment when their current slaughter 
value IS increasing This movement, which prevents any pnce movement from being 
cumulatively destabilizing, requires that price expectations take into account future 
supply and demand, rather than naively extrapolating current price movements 

** Only when no more cows and heifers can be withdrawn from slaughter will the 
“tie” via consumer demand between their prices and those of other animals be broken 
Under normal conditions, given the number of cows and heifers sent to market each year 
and the cost of making large sudden changes in the size of the breeding herd, this does not 
occur 



CATTLE AS CAPITAL OOODS 

types of cattle m a disaggregated econometric model, without great loss 
of accuracy. This is helpful because it is difficult to identify the consumer 
demand for each type of animal 

Although each animal category should exhibit a negative short-run 
response to a price increase, the information contained in the immediately 
preceding paragraphs suggests an expected ranking of slaughter elasticities 
of the different categones The differences in these slaughter elasticities 
will reflect both the sensitivity of the value terms in the profit equation, 
as shown in the partial-equilibrium models, and also the relative avail¬ 
ability of each animal category The degree of instantaneous impact of a 
parameter change on the capital value of the animal depends on the 
expected time lapse before slaughter and on the presence of the breeding 
term with its recursive effect The relative supply is also important, 
however, for the elasticity of slaughter response refers to the percentage 
change in the number slaughtered The larger is the number of animals 
of a given category relative to the number needed to satisfy the increased 
herd demands, the lower the expected elasticity This point, which may 
appear tautological at first, may be illustrated by considering again the 
role of the breeding term in the profit equation for male animals 

In principle, there is no difference in the equation for male and female 
calves at birth A male calf also has a bimodal optimal slaughter age, for 
It too can be fattened to be slaughtered for beef or retained to enter the 
herd as a breeding animal As each male calf theoretically has the potential 
to do either, each male profit equation should contain a breeding value 
term Accordingly, the value of a male calf at birth may be as sensitive 
as that of a female calf, the value of a male calf becomes less so only after 
castration Following castration male calves’ values will be less sensitive 
because their productive value then depends only on their ability to 
convert feed into beef, but this difference is an artificial factor introduced 
by producers * ’ 

The price elasticity of female slaughter is, however, normally greater 
than that of males because there are fewer female animals available 
relative to the need for the breeding herd One bull may service many 
cows, and hence fewer bulls than cows are required for the breeding herd 
Because male and female calves arc born in roughly equal numbers, the 
proportion of male calves which must be retained from slaughter each 
year for replacement purposes is considerably smaller than the proporuon 
retained of female calves Similarly, when the size of the breeding herd is 
changed, the proportion of male ammals switched from prospective 

* * The decision to fatten or to retain for breeding is generally taken quite early in the 
case of males because very few males are needed for breeding, and also because the costs 
of castration are lower, the earlier the age Castration sacrifices the value of the breeding 
component and generally inhibits growth, but it does make the ammal more docile, thus 
reducing management costs and the likelihood of future injury 



^02 JOVRSAL OF POUTICAL ECONOMY 

sJaughter to the breeding herd is generally much less than the proportion 
of female animals because there are many more male animals destined 
for slaughter and many fewer required for any desired change in the 
breeding herd. The availability factor immediately produces a difference 
in the slaughter elasticities of the two types of animals 

It IS therefore difficult to generalize the expected elasticity of slaughter 
response One must consider the difference between the animal’s actual 
and expected slaughter ages, its breeding potential, and the normal 
distribution of slaughter A convenient rule of thumb suggests that female 
animals should have a higher slaughter elasticity than males, and younger 
animals, a higher elasticity than older For example, while male calves, 
even prior to castration, should demonstrate a lower elasticity than 
females (because of the lower relative demand for the breeding herd, not 
the absolute lack of breeding potential), and while both male and female 
calves should have a more elastic slaughter response than either steers or 
cows, bulls should demonstrate a more clastic response than steers, despite 
the fact that the bull is generally an older animal Further, it is not strictly 
necessary that heifers exhibit a higher slaughter elasticity than yearling 
steers, for although some heifers may be switched from slaughter to the 
breeding herd, those which arc not cannot be withheld for further 
fattening very long It is not profitable In contrast, yearling steers, even 
though lacking breeding potential, are frequently withheld for fattening 
for a longer period of time 

IV 

The previous models suggest that the immediate slaughter response is 
negative for all categories This does not necessarily imply that the 
estimated beef/feed price coefficients in the slaughter equations of an 
econometric model will be negative for all categories First, an increase 
in the price of beef which is not expected to last could lead to increased 
rather than decreased slaughter in the very short run It is necessary to 
differentiate between the response to an expected price and the estimated 
coefficient on a past or existing price variable This is a well-known 
problem 

Second, slaughter response is quite different from production response 
The attempt bv producers to increase production requires a reduction in 
slaughter in the short run, and the stronger is this attempt, the sharper 
is the drop in slaughter Production, however, will increase eventually to 
allow greater slaughter As the period of observation grows longer, the 
(net) slaughter response becomes less negative, eventually becoming 
positive The estimated sign of the bcef/feed price coefficient in the 
slaughter equation therefore depends entirely on the length of the period 
of observation, that is, a quarterly model is more likely to exhibit a 



^rOE AS CAPITAL GOODS 

negative bcef-pnce slaughter elasticity than an annual model The 
rapidity with which the buildup m stocks is reflected in a higher slaughter 
flow IS likely to vary across categories 

Third, the fact that animals can pass through several categories during 
1 year means that “switching” caused by price changes can affect the 
estimated price coefficients According to Argentine definition, an animal 
IS a calf from birth to 9 months, a yearling from 9 to 18 months, and a 
steer from then until slaughter A calf aged 8 months at the beginning of 
the year theoretically could be slaughtered during the year in any of the 
three categories Thus a price increase which causes all animals to be fed 
to heavier weights may cause some animals to be withheld just long enough 
to be slaughtered m a category different from that in which they would 
have been slaughtered before the price increase This effect could make it 
incorrectly appear that the animals in older categories had a positive 
price-slaughter response ‘ ’ 

Finally, because a price increase can alter the age distribution of the 
slaughtered animals within each category, a price increase can also 
change the amount and type of beef produced even if the number of 
animals slaughtered is not affected This probability suggests the need 
to include in the model equations estimating the average slaughter- 
weight of the different categories if a good prediction of total beef 
production is desired 

V 

The micro-models developed previously can also help explain the 
distribution of production activities among regions or among countries 
once the assumption of equal input costs for all producers is dropped 
For example, assume that the cost of transportation for dressed beef is 
negligible but is considerable for live animals, and that consumers are 
willing to pay only very small premiums for beef from different-agcd 
animals This makes the consumption value of the animal essentially a 
function of weight Thus, in regions where feeding costs are relatively 
higher, the capital value of calves will tend to be lower, at least up to 
some age 0, Because 

A = pwe~'^ — ft j* e~'' dt, 

for any given p and r, a higher c will be associated with a lower A Further, 
as VM(0) = Ae'^ + cilr{e''^ — I), a higher c results in a lower VM{9) 
during the early part of the animal’s life, 0 < 0„ and a higher VM(6) 
after some point in time, 9 > 0, This may be graphed as in figure 3 

Recent work by Nores (1972) confirms this 

” Another reason for encountering a positive price coefficient has been suggested by 
Yver (1971) in subsequent work and is discussed later 


5^4 


JOUJiNAL or pouncAL economy 



Fio 3 

Considering regions like Europe and Argentina, we can see why veal 
in Europe is absolutely cheaper per pound than m Argentina, although 
mature beef is much cheaper in Argentina If transportation costs were 
zero, calves born m Europe would have value Jt, at birth and be shipped 
to Argentina to be fattened to age As transportation costs are not zero, 
the value of calves is ^^d they arc fattened until slaughter in Europe 
But they will not be fattened past (?,, because after this age imported beef 
of the same quality is cheaper The cost per animal of varying ages m 
Europe therefore should have the shape of the envelope in figure 4 As 
can be seen, the existence of many producers located in regions with 



Fio 4 






AS CAPITAL GOODS 



Fio 5 

different feed costs implies that the observed relative cost per pound of 
beef aged 6, will vary less than if there were only one producer Further, 
the absolute cost of beef from younger animals is cheapest m the higher- 
feed-cost regions This relationship is shown m figure 5, where it is 
assumed that there is strict consumer indifference among beef from 
animals aged Oj, Q^, Q 3 , and 0* Each country is shown to be relatively 
more efficient in the production of beef from a certain range of animal 
ages 

The regional location of production activities within a country is 
determined similarly Producers with different feed costs will choose 
different parts of the production process For example, breeding operations 
will usually take place in areas where the opportunity cost of feed is cheap, 
that IS, where the cost of maintaining the cow year round is less than the 
value of the calf at birth Because all calves will have the same value at 
birth in a unified market, it will not be profitable to maintain breeding 
herds in high-cost feed areas unless producers there are more efficient, 

‘" Thus, part of the observed European “preference” for veal results from veal’s lower 
relative price therein 








JOURNAL OF POLITICAL ECONOMY 


506 

that IS, unless their herds have higher calving rates and lower mortality 
rates than herds elsewhere * ’ 

Breeding may also take place as a subsidiary or complementary activity, 
in some areas cows are maintained primarily for milk production In this 
case the profit equation should include a component reflecting the present 
discounted value of the future milk stream 

Pm J 0 

where m(i, t) is the quantity of milk produced by a cow aged t, fed inputs 
t, and IS the price at which this milk can be sold In these areas the milk 
component must compensate for the lower net value of the calf stream, 
which IS implied by the (usually) much higher feed costs 

The fattening process may enjoy similar regional specialization, 
depending on the opportunity cost of feed m different regions and the 
length of the period for which it is available Upon weaning, calves arc 
usually sent to fattening regions where feed suitable for rapid weight gain 
IS available After this the animals may be sent to market or sent to lower- 
cost grazing lands for final finishing Whether any individual animal is 
sold 10 slaughter or to further fattening depends on the current market 
price for slaughtered beef, whether any producer has feed available at a 
cost low enough 10 permit (urther piofitable feeding of this animal, and 
transaction and transport costs H current prices for feeder animals and 
expected future prices for finished animals are in the correct ratio, 
producers who have low-cost feed will dec ide to purchase animals to 
utilize It and then sell them at the end of the period of feed availability, 
either to other producers for further fattening or directly to slaughter In 
many aicas there will be feed during a particular part ot the year which 
can be fed to cattle quite cheaply, for example, winter wheat can be 
grazed for several months without damage to the crop, and the wheat 
stubble may be grazed following the harvest Hence wheat and cattle 
may be complementary during these periods, even though they may be 
competitive activities in other situations 

‘ ’ In Argentina the major breeding area is the Salada River basin, where drainage and 
land quality are relatively poor Sufficient pasture for the cows and calves is usually 
available during the crucial periods of the year, but gram and/or forage crops for year- 
round fattening normally cannot be grown here Breeding herds are maintained else¬ 
where, however, and there is evidi nee that these producers achieve higher calving rates 
See Jarvis (1969, chap 8) 

There is no reason that the capital values of male and female calves at birth should 
be identical For dairy animals male and female calf values will differ substantially, and 
the behavior of producers toward calves of different sex should also differ sharply Male 
calves art produced solely as a by-product of the attempt to obtain female calves for future 
milk production, and male calves will be slaughtered shortly after birth 



CATTLE AS CAPITAL GOODS 

VI 


507 


Steers are now slaughtered at younger ages m Argentina than was true 
some years ago There are several reasons for this First, the effective 
interest rate (rate of discount) faced by farmers has increased, for farmers 
have become more sensitive to alternative investment opportunities This 
should reduce S, the least-cost-per-pound age, although the effect would 
likely be small Second, the relative cost of feed inputs has also risen, 
that IS, the cost of pasture and forage has risen relative to the price of beef, 
especially as the gram yields for land have risen Because younger animals 
convert feed into beef more efficiently than older animals, a nsc in the 
relative price of feed will tend to reduce the slaughter age Thus both cost 
factors considered by this model have tended to reduce the slaughter age 
of animals m Argentina 

The slaughter age also depends on the extent of the premium which 
consumers will pay for beef of various ages Here too the trend in recent 
years seems to have favored the slaughter of younger animals Many 
consumers now actually prefer the leaner beef from a younger animal 
and are quite unwilling to pay the premium which was once received for 
older, fatter beef Lucio Reca has suggested that this is due not only to a 
change in tastes for the beef itself but also to the growth of the vegetable 
oil industry, which has provided an attractive and preferred substitute for 
animal oils As one of the primary sources of the latter used to be fatty 
beef, the demand for fat as a complementary product of beef has de¬ 
creased 

New breeds of cattle and better pasture management have also per¬ 
mitted higher growth-conversion rates at younger ages, and an earlier 
“Icvcling-off,” leading to slaughter at an earlier age 


VII 

The previously developed micro-theory was used to specify the equations 
of an econometric model of the cattle sector The model seeks to explain 
the changes in the herd size, the number and weight of the animals 
slaughtered, domestic consumption, exports, and the pnee of beef in 
reaction to exogenous shocks such as climatic variation, devaluation, a 
change in the level of domestic income or population, and so forth 
Because of the scarcity of space only a cursory description of the model 
and the empirical results are presented, a more detailed discussion is 
contained m Jarvis (1969, chap 5-9) 

Personal conversation Reca was then director of the Department of Agricultural 
Economics, Ministry of Agriculture, in Argentina 



5oS 


JOVnSAl OfPOLlTICAl 




The model contains 21 equations, including six identiucs The 
each of the six animal categories an equation to estimate the 
sJaughtcred and an equation to estimate the average weight of ihog^ 
slaughtered The number of calves born is also estimated, but the nuinbcr 
of animals m each of the other five categories m year t is given identiully 
by making use of the constraint relating the herd stocks in 2 different yean 
with births, slaughter, and natural deaths Two equations estiriiate 
domestic and foreign consumption (export) demand, each in terms of 
tons of beef, and a final identity equates beef produced from slaughter 
with total consumption 

The model satisfies the posterior rank-and-order conditions for identi¬ 
fication, all the equations are overidentified The model was estimated 
using ordinary least squares and instrumental variables The results were 
highly similar, and only the latter arc presented here Moreover, as this 
paper is concerned primarily with the behavior of slaughter as a test of 
producer behavior, only the slaughter equations are given An aggregate 
slaughter equation is provided for comparison See table 1 

The slaughter equation has essentially the same form for each category 

S, = aH, +/(x‘, ,x") + E„ (17) 


where the function / contains both current and lagged variables Tins 
approach assumes that there is a permanent or normal component and a 
transitory component to observed slaughter. In equilibrium a constant 
proportion of the herd, or category, is slaughtered each year This number, 
however, may be increased or decreased depending on the desires of 
producers, which in turn depend on the level of certain parameters These 
parameters, such as the current price level or current climatic conditions, 
affect producer expectations and thereby the size of the desired future 
herd 

Because the coelficient on the stock variable should approximate the 
average number of animals slaughtered each year, the other independent 
variables are included as deviations from their mean T. hus, in a “normal” 
year, the “normal” proportion is slaughtered However, the proportion of 
the stork slaughtered in certain categories has varied secularly, and it was 
necessary to allow for this occurrence The secular increase in the calving 
rate has provided a rising number of heifers relative to the replacement 
needs of the breeding herd, and a growng proportion of the lieifcr stock 
has been slaughtered over time Similarly, a growing preference for leaner 
beef has increased the proportion of yearlings slaughtered And finally, it 


** It was assumed that the margin between producer and retail prices is constant so 
that only one price is endogenously determined in the system 

’’A discussion of the interpretation of this form versus that ofa lagged stcxik adjustment 
model IS contained in Jarvis (1969, chap 6) Several nonlinear formulations of this model 
were estimated, but each was less satisfactory than the form shown here 



^rrLE A* CAPITAL OOODt 509 

that a greater, although still small, proportion of bulls is being 
*^5ed”pecificaUy for beef production, thus increasing the rale of slaughter 
Tthe bull herd.** These effects were included by making the “normal” 
proportion slaughtered a function of time 

aH, = (a^ + a[t)H, = a^//, + ai[{lH,) (18) 

The coefficient a\ was significant for several of the categories, and after 
this adjustment a was in fact equal to the mean number slaughtered for 
each category 

Consider also the specification of the price coefficients It is assumed 
that producers respond to an expected price when making their slaughter 
decisions and that this expected price is a function of current and past 
prices and the current rate of change of price The higher the price in any 
year, or the greater the current rate of change of price, the greater is the 
expected price and the greater the desired future herd Several alternative 
specifications for the price variables were tried, as it was not certain how 
producers might form their expectations, or whether the same variables 
would have the same effect in different equations The relevant expected 
price for each category is the price expected to prevail at the time the 
animal, or its product, will be sold The producer need be concerned in 
the case of steers only with the price which will hold m the immediate 
future, whereas the relevant price for a breeding heifer is the average price 
which will prevail over the period in which her calves will be born Thus, 
the forms of the lagged distribution need not be the same for all categories 
Nonetheless, except in two cases they are highly similar 

The three alternative specifications estimated are based on the re*- 
spective assumptions that the relevant expected price is a function of (1) 
current and past prices with no weight being given to the current rate of 
change, (2) the current price, past prices, and the current rate of change, 
(3) past prices and the current rate of change The results were as follows 
For every category the rate of change of price was significant in formu¬ 
lation 2, which produced uniformly better results than formulation 1 
Formulation 3 did, however, provide better results than formulation 2 in 
three cases, steers, yearlings, and heifers That is, the rate of change of 
price was always a significant determinant of slaughter, and for three 
categories, those male animals which are destined to be slaughtered m a 
reasonably short period of time, better results were obtained when only 
the current rate of change of price was included and the current price 
level excluded Following the estimation of these three general formu¬ 
lations several additional equations were estimated for each category to 

** Producers may also be cuUmg their bulls more systematically and slaughtering 
breeding bulls at lower average ages 




510 





Table I (CorUinutd) 



5 ” 





5,2 JOURNAL OF POLITICAL ECOTOMY 

determine the precise specification of the price and climatic lag dis¬ 
tributions which provided the best results A discussion of these results 
and their sensitivity (low) to changes in specification is included in Jarvis 
(1969), only the preferred results for each equation are presented m 
table 1 

The econometnc results are quite satisfactory Each equation explains 
well the variation in a slaughter of the respective category, and, more 
important, the coefficients of the independent vanables are highly 
significant and consistent both m sign and magnitude with the previously 
developed theory A negative coefficient is expected on each of the price 
variables in the slaughter equations As can be seen, this is true for every 
category except steers and bulls, in these equations the coefficient on the 
price in year < is negative, but the other price variables have positive 
coefficients Several possible explanations for this are suggested sub¬ 
sequently The equations in table 1 indicate that the annual rate of 
slaughter of animals of every category but one, steers, is reduced tem¬ 
porarily by an increase in price For five categories the net impact of the 
current rate of change of price and price level on slaughter is negative, 
and greatest, relative to the mean number of animals slaughtered, for 
calves, heifers, yearlings, cows, and bulls in descending order These 
results are consistent with the theory developed previously 1 he impact on 
steer slaughter is probably positive but extremely small 

One might expect that the effect of the current rate of change of pnee 
would be strongest for those animals destined for slaughter in the near 
future, that is, the extrapolation of the current rate of change might hold 
for short periods only, with longer price expectations being based on an 
average of past prices Empirically there is some support for this The 
relative impact of the current rate of change of price is, in descending 
order, strongest for heifers, yearlings, calves, cows, bulls, and steers, 
although the latter two are positive The animals most strongly affected 
are then heifers and yearlings, animals which are destined for slaughter 
within a shorter period of time than are calves or cows The normal 
proportion slaughtered of the heifer and yearling steer stocks is small, 
and It appears that producers are easily able to retain even greater num¬ 
bers Yearlings are retained for further fattening and are held to steer age 
m many cases, while heifers are retained for both additional fattening and 
breeding 

As noted above, the sign pattern of the price coefficients in the steer 
and bull equations appears at first to be contrary to expectation The 
coefficient on the rate of change of price is positive in both equations, 
as are the coefficients on the lagged price distribution after year ( — 1 
The latter pattern appears unreasonable because it implies that the ex¬ 
pected prit e IS an inverse function of past prices However, the sign pattern 
could be explained by cither of two effects First, a price increase inducing 



riiTTLE A8 CAPITAL GOODS 

CATIW. 

producers to withold yearnings from slaughter m greater numbers may 
cause a temporary change in the age distribution of the steer herd This 
change can affect the percentage of the steer category slaughtered in 
future years as the adjustment works itself out In the model used, the 
coefficient on the stock variable is not allowed to vary cyclically, and the 
effect of changing the proportion of the stock slaughtered over the cycle 
IS forced onto the lagged price variables Second, an enduring increase 
in the beef/grain relative price ought to increase the number of steers 
slaughtered relative to the number of yearlings, and correspondingly 
lower and raise the respective proportion of the two categories slaughtered 
annually If so, the positive coefficients on the lagged price variables in 
the steer slaughter equation could reflect this effect Such an interpre¬ 
tation IS supported by the long and significant distnbuted lag on the 
price coefficients, of opposite (and expected) sign, in the yearling and 
steer slaughter equations 

The positive sign on the rate of change of price in the steer equation 
has several possible explanations First, if the price effect causes yearlings 
to be withheld, as it does, but only for a moderate time period, the with¬ 
held yearlings may become steers within the year and thus be slaughteren 
in a different category This could make it appear incorrectly that the 
steers existing at the time of the pnee increase were slaughtered rather 
than withheld The question, of course, is whether enough such withheld 
yearlings could be slaughtered as steers within the year to explain the 
observed coefficients ** 

Another explanation has been offered by Raul Yver (1971), who 
suggests instead that the theory developed in this paper can be made 
entirely consistent with an actual increase in the slaughter of steers, 
that IS, the econometric result need not stem from measurement difficul¬ 
ties He argues that if producers fare a short-run feed constraint, they 


The number of yearlings being withheld and slaughtered as steers in the same year 
need not be large, for the absolute magnitude of the coefficient on the rate of change of 
price in the steer equation is only one-half that of the corresponding coefficient in the 
yearling equation Moreover, the coefficients on the rate of change of price in the equations 
explaining the average slaughter weights of steers and yearlings (not presented here) are 
positive, but insignificant The slaughter-weight results suggest that there is no significant 
change in the average age distribution of either category, as would be the case if the early 
slaughter of steers were prompted by a price increase Yver (1971) cites data on the 
average slaughter weights of the different categories, including their cyclical variation, to 
suggest that withheld yearlings could not reach steer weight within a period of 1 year 
The data given, however, provide evidence only for the mean, not the whole distribution 
of slaughter weights, and hence do not prove the point Moreover, the yearlings withheld 
are likely to be fed more intensively than the average yearling, thus gaining more rapidly 
The yearlings will be retained by producers newly entering the fattening activity, using 
land which was formerly dedicated to gram production and which is of high quality 
Their better feed suggests that these animals will grow more rapidly, and should be 
slaughtered younger than were steers previously, especially as they will also face a higher 
opportunity cost for feed 



514 JOURNAL OF POLITICAL ECONOMY 

Will be unable to increase the herd in the short run as much as they woulc 
like to do in the long run Their desire to retain animals of all ages wil 
cause a rise in the opportunity cost of feed, which in turn will prompt thi 
slaughter of some The animals most likely to be affected will be thos( 
which are nearing their time of slaughter, such as steers, for the capita 
values of animals with longer productive lives will be less sensitive to i 
short-run change in the cost of food 

Yver’s explanation is ingenious and plausible, and I am sorry not tc 
have thought of this possibility earlier I recognized that feed costs woulc 
rise as the demand increased, but had not considered that this could be 
sufficient to actually increase the slaughter of certain categories in the short 
run As discussed shortly, this may be the situation for bulls as well as 
steers However, the feed constraint could not last very long, as additional 
land could be made available for pasture and forage if larger herds were 
desired Therefore the higher rate of slaughter as reflected in the lagged 
price coefficients could not be due to a feed constraint but must be caused 
by something else More important, Gustavo Nores (1972) has estimated 
a quarterly model of the Argentine cattle sector m which the coefficient on 
current quarter’s price in the steer equation is negative and highly signif¬ 
icant Whether yearlings cross categories, or whether an increase in the 
opportunity cost of feed occurs with some lag, inducing producers to sell 
more steers in the intermediate run, the immediate slaughter response of 
steers appears to be negative 

The price coefficients in the bull equation are remarkably similar to 
those in the steer equation but somewhat more puzzling, as bulls being 
breeding animals, were expected to behave similarly to cows A suggested 
explanation is that a large number of slaughtered bulls are not cull bulls 
but bulls raised specifically for slaughter Castration reduces the rate of 
growth of an animal, and some producers are willing to forego the 
advantages of easier herd management to obtain greater efficiency in feed 
conversion This willingness has increased over time, also explaining why 
the bull stock coefficient in the slaughter equation has risen secularly 
Thus, the animals in the bull category are more heterogeneous than are 
the animals composing other categories, and the price coefficients m the 
bull equation reflect the net effect occurring from the withholding of bulls 
for the breeding herd and the increased slaughter of those bulls fattened 
for that purpose The apparent increased slaughter in the latter supports 
Yver’s argument because the average slaughter weight of bulls declines 

’ ® When I estimated the steer slaughter equation without including the current rate of 
change of price, the coefficient on the current price was invariably negative, though 
small in magnitude The use of a polynomial distributed lag could have forced the 
negative coefficient on the price in the current period in my results, but Nores does not 
use a constrained distributed lag 

James H Cothern, extension economist. University of California, Davis, suggests 
that this has occurred m the United States, and I assume it may be true in Argentina as 
well 



cattle AS CAPITAL GOODS 515 

Significantly m response to a pnce increase, suggesting the slaughter of 
younger and lighter animals in greater proportion at this time It is 
difficult to argue that this effect is created by animals changing categories 

An aggregate slaughter equation was also estimated for comparison 
vvoth the individual-category slaughter equations It is presented in table 
] This equation performs very satisfactorily, but it does not yield the 
detailed insights into the behavior of producers, and the information 
covering the changing composition of slaughter, which the individual- 
category equations provide, nor is the predictive ability of the aggregate 
slaughter equation, in the limited context in which this is understood here, 
as great By means of comparison, the squared correlation coefficient 
between the sum of the predicted values for the individual slaughter 
equations and the actual total slaughter is 976, and the regression co¬ 
efficient of the actual total slaughter on the sum of the predicted slaughter 
from the disaggregated equations is 996 Each of these results suggests 
that the set of disaggregated equations explains the variance in slaughter 
better than does a single aggregate equation The same result holds 
when the average-slaughter-weight equations are considered Dis¬ 
aggregation provides more information about the operation of the cattle 
sector with respect to producer behavior and also provides a more accurate 
indication of the quantitative and qualitative changes in the supply of 
beef 

The effects of several other variables are briefly noted First, the lag 
distribution on the climatic variables is similar in sign and form to the 
distribution on price variables Climatic improvement reduces the 
slaughter of every category except bulls in the first year, with the strongest 
impact m the heifer and yearling equations Good climate also reduces 
the slaughter of steers and calves, and if conditions improve for more than 
1 year in a row, cows are retained as well 

The variable measuring the percentage of total exports to Great Britain 
reflects the fact that the British traditionally demanded fatter and there¬ 
fore older steers The production of these animals increased the average 
age of slaughter, thereby decreasing the proportion of the steer stock 
reaching slaughter age each year 

Producers’ reactions toward risk can be measured by including in the 
slaughter regression equations the variability of those product and input 
prices which affect the relative profitability of rival activities (see 
Behrman 1967) In this study the variability of cattle and gram pnees in 
real terms had no significant effect on the slaughter of cattle, and, for 
simplicity, they have been omitted here Government intervention in the 
tenancy and labor markets was important, however This intervention 

The root mean square error of the senes predicted using the disaggregated equations 
15 2,356, while the mean number of animals actually slaughtered is 98,500 Thus, the 
mean error is less than 2 4 percent, and the residual of largest magiutude, 4,430 in 1964/65, 
wolves an error of only 4 8 percent 



JOURNAL OF POLITICAL ECONOMY 


516 

made gram production more risky than cattle production by increasing 
the threat of expropriation and of labor strikes during the harvest There 
was an eventual breakdown in the traditional tenancy system, ac¬ 
companied by the exodus, voluntary and forced, of many rural workers 
Producers discarded their previous rotational system and switched out of 
field crops into cattle production, which was less labor intensive The 
slaughter of certain types of animals was reduced during this period as 
new producers entered production The variable for the change in the 
rural labor force is a proxy for tenancy disruptions in the agricultural 
sector 

Although the theoretical model seems to imply an important role for 
the interest rale, the latter actually plays a relatively small role Cattle 
raised for slaughter have a short life span and their slaughter age is much 
more heavily influenced by changes in the price of beef and food inputs 
(including climatic effects) than by changes in the rate of interest More¬ 
over, changes in the real rate of interest charged on agricultural loans 
have been dominated by changes in the rate of inflation, which averaged 
24 percent during the post-World War II period When the rate oi 
inflation was included in the slaughter equations, us coefficient was 
insignificant The residual pattern indicates, however, that large discrete 
changes in the rate of inflation may have had a brief impact on two 
occasions 


VIII 

Among the implications of the study, the following stand out First, the 
micro-models in this paper indicate that an econometric model should be 
disaggregated to obtain a clear understanding of producer behavior and 
improved estimates of future beef output Animals of different age, sex, 
and breeding ability have different economic functions within the herd, 
and their productive values will accordingly be differentially affected by 
exogenous shocks to the system This theory has been subjected to detailed 
empirical testing and found to be surprisingly well supported The 
equations for both disaggregated slaughter and slaughter weight behaved 
consistently with the theory 

bee Jarvis (1969, chap 8) for a discussion of this, including the importance ot the 
labor force variable in the specification of the model 

Sec Jarvis (1969) for a discussion of the attempts to measure the effect of changes 
in the rate of inflation Norcs (1972) has included in his slaughter equations the net 
change in the loan balance to the cattle sector to capture the effects of credit conditions 
The estimated coefficient on this variable is negative and significant in the slaughter 
equation for yearlings, cows, and bulls 

The relative becf/grain price is theorcucally the primary determinant of both the 
optimal size of the herd and the optimal slaughter age of a given animal (the high price 
elasticity of consumer demand among different types of beef requires that relative prices 



cattle AS CAPITAL OOODS 517 

Second, the estimated equations yield solid evidence that cattle pro¬ 
ducers have responded in an economic manner to changes in the beef/ 
grain relative price and to its rate of change The instantaneous response 
of slaughter to a price increase is negative for every animal category 
However, even though the lagged price coefficients (and their sum) are 
negative in most of the individual-category slaughter equations, the 
econometric model as a whole indicates that the long-run elasticity of 
slaughter is positive This fact has been frequently misunderstood The 
simple single-equation models estimated in most previous studies were 
essentially designed to show the negative short-run response of slaughter 
However, because these models cannot show that a rising herd, through 
higher calf crops, would lead to rising slaughter over time, they were often 
misinterpreted to imply a negative long-run elasticity as well This is 
incorrect The estimated price coefficients in my model (and most others) 
give only the effect of price on the “transitory” component of slaughter, 
not the “permanent” component A reduction of slaughter one year 
increases the size of the herd in the next (and therefore the permanent 
component of slaughter), and it is the net effect of changes in both the 
permanent and transitory components which yields the true effect of the 
price in lagged periods Indeed, the greatest impact on future pro¬ 
duction comes from the increase in the calf crop, as females are withheld 
to increase the size of the breeding herd, and the equation estimating the 
number of calves born is crucial to the estimation of the long-run price 
elasticity 

I believe the results obtained should abolish doubts as to whether 
Argentina producers respond to prices These results, which have been 
obtained using herd data which I believe are much improved over those 


of animals be ralhtr constant, which in turn permits the price of one category to be used 
as a reasonable proxy for all in many econometric models) The climatic index, a proxy 
for certain types of feed costs, may also be important, but the interest rate plays a minor 
role in the slaughter decision because most animals are relatively short-lived Several 
measures of the sectoral terms ol trade were included in preliminary estimations of the 
slaughter equations, but these variables were always insigniBcant This result may reflect 
more upon the poor quality ol the variables used than upon the actual efifect of the 
changing terms of trade on agricultural investment In nearly all similar studies it has 
been easier to obtain an intrasectoral supply response than an intersectoral response 
32 Yver (1971) has developed a method to determine the time profile of the net impact 
of exogenous disturbances on the dependent variables in a model similar (but necessarily 
more simple, for computational reasons) to the one presented here 

A discussion of the speciflcation and estimation of this equation is contained m 
Jarvis (1969, chap 4, 1971) Models which estimate only the number of animals 
slaughtered, without considering changes in the average weight of the slaughtered animal, 
will also underestimate the long-run supply response because a price increase vsill cause 
animals within each category to be fed to somewhat heavier weights, and more animals 
wilt be withheld to older ages, e g , slaughtered as steers rather than yearlings This latter 
effect IS frequently forgotten by those who note the relative constancy of individual- 
category slaughter weights, but it is a significant factor in total supply The cumulative 
elasticity of average dressed weight with respect to the becf/grain price is nearly 0 10 



JOURNAL OF POLITICAL ECONOMY 


518 

previously available, are statistically quite significant and identify the 
slaughter response of different animal categories The results show that 
producers systematically reallocate their portfolios in the expected 
manner when the recursive effect of their decisions is strongly evident, 
that IS, with continuously operating markets for disposable productive 
assets And because much of the indicated response is an interactivity shift 
within the agricultural sector between grains and livestock, the price 
response shown by cattle producers implies a response by field crop 
producers as well 

It has been argued that discriminatory and capricious government price 
policy toward agriculture in the 1940s and 1950s, combined with rapid 
inflation and government intervention in land tenancy markets, so 
frustrated and confused producers in Argentina that they ceased to 
respond to the usual market indicators of profitability I find no support 
for this argument Although producers reacted to nonprice disturbances 
during the period studied, such as when government intervention m the 
tenancy market caused a reevaluation of the relative risk of different 
activities and thereby a demand shift for different productive factors, 
given this shift the sensitivity to market prices continued as before, that is, 
prices seem to have remained an important short-run determinant of 
production variation throughout the period 

Third, the stronger is prodiuer response, the more slaughter will be 
reduced in the short run This fact is important for a country like 
Argentina, whith has frequently devalued its currency when the balance 
of payments has lieen in ciisis The purpose of devaluation, which raises 
proportionately the domestic price ol tradable goods (m the absence of 
export retentions, etc }, is to reduce the consumption of exportables, like 
bed, while simultaneously stimulating inteinal production and in¬ 
creasing external demand lor the same And the evidence suggests that 
devaluation m Argentina has usually increased heel exports in the first 
year because the decrease in domestic consumption and the increase 111 
foreign demand outweigh the decrease in slaughter (as producers attempt 
to increase herds) However, because producer response leads to even 
highei piiccs, partially offsetting the effee t of the previous devaluation in 
foreign markets and aggravating inflationary pressures in Argentina, 

The asymrnttry between these results and past studies of field crops may be due to 
measurement difficulties I have found, for example, that when corn production data are 
disaggregated at the partido (county) level, many individual partidos show significant 
price response even though their aggregate does not The degree of lesponsc for the 
individual partidos stems to be closely related to the level of the yields achieved and to the 
type of competing agricultural aclivitiev in the area One of the factors confounding 
the aggregate results is the difficulty of separating those corn plantings which are desired 
for gram harvest from those to be used for grazing cattle In some partidos a rise in the price 
of cattle IS associated with a rise m the area planted to corn, but with a fall in the per 
centage of the planted area harvested (as corn is planted for forage), while in other partidos 
the beef price rise is associated with a decline in the area planted to corn 



cattle as capital goods 519 

beef exports frequently decline from their predevaluation levels in year 2 
and even year 3 As a result, the government must be prepared to weather 
a period of almost 3 years before devaluation can significantly improve 
the balance of payments The government generally has not been able to 
wait so long, and economic deflation has usually been induced to bring 
the balance of payments into temporary equilibrium 

Price response clearly does not imply that producers allocate resources 
efficiently, even in the steady state It suggests only that producers respond 
in the expected direction and that agricultural policies should take this 
fact into account And the response encountered does not alter the fact 
that Argentine agriculture stagnated for nearly 3 decades As pnee 
response is not the cause, per se, other explanations for this stagnation 
must be sought 


References 

Behrman, Jere “Price Elasticity of the Marketed Surplus of a Subsistence Crop ” 
J Farm Aeon , vol 49 (February 1967) 

Colomii, Rinaldo A “Funciones de oferta agropecuaria de la region Pampeana 
en el ptriodo 1940 1960 ” Mimeographed Unnersidad Nacional de Cordoba, 
July 1966 

de Janvry, Alain “A Socioeconomic Model of Induced Innovations for Argentine 
Agiicultural Development ’’ Q J E, \oI 87, no 3 (August 1973) 
Diaz-Alejandro, Carlos f Exchange Rate Devaluation in a Semi-indu'tnalizeJ Country 
The hxperience of irgenttna, 1955-1961 Cambridge, Mass Ml f Press, 1965 

- Aisayi on the Aconornu AAitory of the Argentine Republic New Haven, Conn 

Yale Univ Pi css, 1970 

Fieire, Remy “Price Incentives in Aigenlinc Agiiculturc ’’ Mimeographed 
Dtvtlopmtnt Advisoiv Service, Harvard Univ , June 1966 
Hall, Robert E “Polynomial Distributed Lags ” Woiking Paper no 7, Massachu¬ 
setts Inst lech , Dept Econ , July 1967 

In my thesis (Jarvis 19G9) 1 suggested that the principal cause of Argentine agri¬ 
cultural stagnation during the 1940s and 1950s was not lack of price response but the 
small amount of technical change which made its way into the Pampas In response to 
changes in product and factor lelative prices, producers shifted their resources among 
different economic activities and shitted subsiantially iheir use of factor inputs, largely by 
substituting cattle and machini ry for migraiing labor Producers also adopted new 
techniques and inputs whin these wen available and prohtable, including new cattle 
breeds, new seeds, and new machinery Nonetheless, the total number of new innovations 
adopted was small relative to the experience of other countries possessing rich, well- 
diveloped agricultural seelors, and little increase in total lactor productivity seems to 
have occurred in the Pampas Govcrnnicnt policy, which inhibited the introduction and 
dissemination of technical change during this period, must be held chiefly re-sponsible 
High tariffs and impoit prohibitions were imposed on many agricultural inputs, and the 
public agricultural research and extension system was effectively dismantled Diaz- 
Alcjandro (1970) also has suggested that gov'ernment policies were most negative in their 
discouragement of technical change However, dejanvry (1973) has argued that the large 
and politically influential agricultural producers have not encouraged government 
research in yield-increasing techniques, even in recent years, because these developments 
would not have been iii their social and economic interest, and these producers have thus 
contributed to the lack of available technology 



520 


JOURNAL OF POLITICAL ECONOMY 


Jarvis, Lovell S “Supply Response in the Cattle Industry The Argentine Case, 
1937/38-1966/67 ” PhD dissertation, Massachusetts Inst Tech, 1969 

-. “An Example of the Use of Economic Models to Construct Unavailable 

Data Series The Estimation of Disaggregated Cattle Herd Stocks for Argentina, 
1937-1967 ” Paper presented at Winter Meetings of Econometric Society, New 
Orleans, December 27-29, 1971 Published m Spanish as “Un ejemplo del uso 
de modclos econbmicos para la construccibn de datos no dispontbles la 
estimacidn de la existencia de vacuno desagregado en Argentina, 1937-1967 ” 
Econdmica (Buenos Aires), no 1 (1973) 

Nores, Gustavo “Quarterly Structure of the Argentine Beef Cattle Economy A 
Short Term Model, 1960-1970” Ph D dissertation, Purdue Univ, 1972 
Otrera, Wylian R, “An Econometric Model for Analyzing Argentine Beef Export 
Potentials ” Ph D dissertation, Texas A & M Univ , 1966 
Rcca, Lucio G “The Price and Production Duality within Argentine Agriculture, 
1923-1965 ” Ph D dissertation, Univ Chicago, 1967 
Williams, Hadley Cogswell “The Role of Agriculture in Argentine Development 
A Problem of Farmer Response ” B A thesis. Harvard Univ , 1966 
Yver, Raul E “The Investment Behavior and the Supply Response of the Cattle 
Industry in Argentina ” Ph D dissertation, Univ Chicago, 1971 



Rational Expectations and 
Bayesian Analysis 


Richard M Cyert and Morris H DeGroot 

Camegu-Mellon University 


The approach in this paper is the development of models that describe 
the process by which rational expectations may be developed within a 
market The concept of Bayesian learning is introduced Consistent and 
rational expectations are introduced in models where the firms cannot 
immediately move to equilibrium Three different models are developed 
which demonstrate the interaction of Bayesian learning and expectations 
in achieving a market equilibrium These models are dynamic and 
describe the transition process toward equilibrium Two of the models 
involve unknown parameters about which the firms learn The third is 
a control theory model explicitly involving adjustment costs 


The concept of rational expectations was introduced in an important 
article by Muth (1961) Muth’s hypothesis was that the mean expectation 
of firms with respect to some phenomenon, say price, was equal to the 
prediction that would be made by the relevant economic theory Muth 
gives his reasons for believing in the hypothesis, proceeds to develop it 
further, and does some testing as to the general consistency of the hypo¬ 
thesis Our objective in this paper is to build on Muth’s basic concept by 
providing some insight into the processes by which the rational-ex¬ 
pectations hypothesis can, in fact, be realized Wc will introduce the 
concept of Bayesian learning into expectations and then demonstrate for 
three different models how the concept might work in achieving a market 
equilibrium 

This research was supported in part by the National Science Foundation under grant 
GS-32514 Wc arc indebted to William Brock, David Cass, Michael Lovell, Robert 
Lucas, and Edward Prescott for helpful comments We would also like to thank an un¬ 
known referee for excellent comments that resulted in a significant improvement of the 
paper 


521 



522 JOURNAL OF POLITICAL ECONOMY 

The Process of Generating Rational Expectations 

Mutfi sees his hypothesis developing out of a process in which each 
economic actor (he generally uses firms) has an expectation (prediction) 
about the value of some economic variable He makes it clear that each 
expectation is not perfet t (m the sense of being accurate), nor are all the 
expectations the same Out of this process Muth hypothesizes that the 
weighted arithmetic mean of these expet laiions is equal lo the prediction 
of the lelevant economic model, generally the equilibrium value of the 
model It IS our view that a process has to be developed if the rational 
hypothesis IS to be a scientific truth rather than a religious belief 

The attempt to develop process models immediately opens us to the 
critK ism of developing ad lioc models Wc acknowledge that there may be 
a large number of models that could potentially describe the process to 
equilibrium Our position is that, while the models have a certain amount 
of face validity, our major contribution is the introduction of an explicit 
harrung process destnbed in Bayesian terms The notion of developing 
models lo describe the iransuion process toward equilibrium of a system 
disturbed by some random shocks may be questioned by some economists 
rile development of comparative statics and the neglect of dynamic 
analysis is in part a reflection of such altitudes in the profession Yet 
without well-developed process models, the concept of rational ex- 
pec tations IS f ssentially a black box 

Muth seems to assume an independence among the firms making the 
eslmiatts Fanpiuc .illv, however, it is more leasonable to assume that the 
cxpec lalions arc cotrclated 7lit management of each firm m a given 
market operate s m essentially the same kind ol environment and has much 
of the same kind of information In most eases, the managers have also 
experienced the same set of historical inputs fiom having been in the 
pailicular inclustiy for some time In addition, the existence of trade 
associations will lend to m.ike similar information available to managers 
in the same industry, allhough managers can obviously obtain information 
fiom other sources It is also possible that even with similarity of infor¬ 
mation the same input can be intcipictcd somewhat dilTcrently when the 
subjcctiv'c probability distributions arc computed by each manager 
However, even if all films do not initially have the same priors, the 
feedback fiom the market will tend lo modify the priors to the extent that 
similarity becomes a reasonable assumption at some point Ihus, wc will 
postulate the same prior probability distributions for the decision makers 
in our models 

Initially, Muth makes the assumption of independent shocks within his 
model He assumes that the error term is randomly distributed, with no 
serial correlation I his model always leads to the trivial conclusion that 
the mean expectation is equal to the equilibrium value Thus, the 



rational expectations 


523 


existence of a cobweb is antithetical to Muth’s notion, and in his paper he 
argues that the cobweb model is not a good description of the data As an 
alternative explanation, Muth has recourse to the fact that a dynamic 
system hit with random shocks exhibits cycles with a stable period 
Process models such as we develop may be a middle ground for explaining 
the phenomena They have neither to assume irrationality and a lack of 
learning, as is true of the traditional cobweb model, nor to rely solely on 
random processes for an explanation of the real world 

Muth himself recognizes the fact that a model assuming rational 
expectations does not accommodate a dynamic process showing the path 
to equilibrium He attempts to accommodate such a process by intro¬ 
ducing the notion of serially correlated disturbances He assumes that 
these disturbances can be represented as linear combinations of normally 
and independently distributed random variables and that both the actual 
price and the expected price can also be represented as linear combinations 
of these independent variables with appropriate weights This approach 
gives Muth’s work greater flexibility However, m applications, the 
notion of independently and identically distributed disturbances is, for a 
variety of reasons, the most generally used As a result, most models 
incorporating rational expectations move directly to the equilibrium 
value of the model without specifying an adequate process to produce 
the result 

Learning and Uncertainty 

The model that we will follow will utilize Bayesian leaining in describing 
the process by which expectations, once formed, lead to an equilibrium 
Rather than assume that market expectations lead directly to the equilib¬ 
rium value, we will assume that learning takes place in the market and 
that this learning has the effccl of continually modifying the prior 
probability disliibution with which the firms start Through this process 
we will show that the market will typically converge to an equilibrium 
The learning in our analysis comes from the feedback resulting from 
the aclicms taken by the firm If a hrm has a prior distribution for a 
particular price and supplies the appropriate quantity, its prior will be 
modified and a new expectation formed when it observes the results in 
the market The new prioi in turn leads to another decision The form of 
learning that takes place is learning about the unknown values of the 
parameters in the particular market in which the firm is operating The 
first step IS the formation of a prior probability distribution about the 
values of the important parameters in the market The learning process 
then continues through feedback from the market and continuing 
modification of the prior (Cyerl and DcGroot 1970, 1971) 

We propose to deal with models in which there is uncertainty However, 



.he xem ••«ncer.«nfy” rrt« on « oomp/etdy different n,ea„,„^ 
moves from chuMcai stttiatics to » Bayesian approach rV * 
been generaJJy used to describe a situatJon in which the 
decision maker’s action is not precisely predictable becaJsTor°^'^ 
existence either of parameters with unknown values or of random trrnr 
terms The traditionaJ approach for analyzing uncertainty hat been to 
assume a probability distribution for the error term but not for ihr 
unknown values of the parameters This procedure leads ultimately lo 
the need for some choice criterion such as the mimmax or Neynian- 
Pearson approach (Modigliani 1949, Arrow 1951, Savage 1954) When 
we move into the Bayesian world, these choice criteria no longer havi 
relevance To the Bayesian, all uncertainty can be represented by prob¬ 
ability distributions Thus, the approach is to assign a prior distribution 
to the unknown values of all the paiametcrs wherever they appear in the 
model (DcGroot 1970) Learning then takes the form of changing 
conditional distributions for the outcomes m future periods on the basis 
of past outcomes The structure of the changes will vary depending on 
whether they arise from updating the posterior distributions of para¬ 
meters 01 from serially correlated error terms The only conditions under 
which the learning process described in the preceding paragraph is not 
present are represented by models that consist oferror terms independently 
distributed over time and no other parameters Our models will consist 
of botli unknown values of parameters and independently distributed 
error terms The kind of learning we have described will be possible in 
these modi Is la fact, as wc shall now discuss, this kind of learning is 
possible even when the form of the model is unknown to the economic 
agents 


An Inconsistent Model 

We classify the first model that will be discussed as an inconsistent model 
because the firms do not know the form of the model ot the process 
determining the price and, in fact, base their decision on an incorrect 
model Through learning from the feedback of market information, the 
firms can reach an equilibrium even though they h<%ve the wrong model 
However, since the model that the firm is using is incorrect, this process 
may converge only slowly or may not converge at all 

We feel that the construction of inconsistent models is not the most 
fruitful way to progress m this field If the firms have models that diveige 
drastically from reality, it seems reasonable to assume that the manage¬ 
ment would recognize this condition and change the model The firms 
would continue searching for a model that produc ed predictions that 
coincide more closely with actual observations In fact, the term “rational 



^tional EXMCTATIOBI 

expectation*” coined by Muth actually describes models that are ron- 
sistent The model under discussion, even though mconsistmi, do<s have 
some interesting properties 

This first model involves a parameter whose exact value is unknown to 
the firm The firm assigns a prior distribution to the value of the para¬ 
meter, and after the price in each period is observed, the firm updates 
this distribution m accordance with Bayes’s theorem However, since the 
price in each period is influenced by the firm’s expectations, and these 
expectations are at least partly incorrect, the time senes of prices that are 
generated could be highly unstable, even though other conditions of the 
model indicate a convergence to equilibrium should prevail 

Assume that the firm’s model for the price , m period ( + 1, g.ven 
the price p, m period t, is of the following form 

Pi+i = ap, + y,+ ,, / = 0, 1, 2, (1) 

Here the value of the coefficient a is fixed throughout the process but is 
unknown, and ti,, form a sequence of independent and identically 

distributed random error terms Suppose also that the firm believes that 
the common distribution of each of the error terms is normal with mean 0 
and known precision r (The precision of a normal distribution is the 
reciprocal of its variance ) 

If the firm has a given posterior distribution for the value of a after it 
has observed the price p,, then this distribution becomes the new prior 
and the firm can apply Bayes’s theorem to determine the posterior 
distribution of a after the price />,+ , has been observed In particular, it 
follows (DeGrool 1970, p 167) that if the posterior distribution of a at 
the end of a period I is normal with mean m, and precision h„ then the 
posterior distribution at the end of period t + I will be normal with mean 
771 ,+ 1 and precision h,+ i, where 

_ A, 771 , + rp,p, + ^ 

"^+1 -;-“ 7 ^ 

h, + rpf 

and 

+ i = h, + rpf 

It follows from these relations that at the end of period I, the firm’s 
expectation E,{p, + x) of the price in period t + 1 will be 

£',(/’<+1) = E,{a)p, + £,(»,+ ,) = m,/7„ (4) 

since E^{a) = m, and £,( 11 ,+ ,) = 0 

We shall assume that in fact, however, the actual price/>,+ , depends on 
the firm’s expectation £,{/>, + 1 ) = Specifically, we shall assume 


( 2 ) 

( 3 ) 



526 JOURNAL OF POLITICAL ECONOMY 

that the market equations take the folJowing form m any period t. 


= <^1 - ppt 

(demand), 

( 5 ) 

== dj 4- yE,.iip,) + u, 

(supply), 

( 6 ) 

= c , 

(market equilibrium) 

( 7 ) 


In these relations, Q, is the number of units produced in period t, C, is 
the amount consumed in that period, and u, is a random variable rep¬ 
resenting the uncontrollable factors which affect the amount produced in 
penod t. It IS assumed that </,, d^, fi, and y are given constants We have 
taken the same model as Muth to facilitate comparisons between his 
work and our own ' 

By solving the market equations, we obtain the relation 




( 8 ) 


We shall assume for simplicity, as did Muth, that (so that the 

equilibrium price is 0), and we shall let v, = —{\jpi)u. Then we have 


Pt = - + V, (9) 

P 

Thus, the price i is actually determined by the relation 

P,+ i = - l^tiPt+i) + + u ( 10 ) 

P 

or equivalently 

/’.+ != - + J'. + i (11) 

It IS again assumed that v,, Oj, are independent and identically 
distributed, each having a normal distribution with mean 0 and 
precision r 

Thus, although the firm believes that prices are determined in ac¬ 
cordance with (1), and forms its expectations about the value of a in 
accordance with (2), prices are actually determined in accordance with 
(11) We have, therefore, a relatively sophisticated cobweb process As 
in other economic processes, the model becomes consistent at equilibrium 
Thus, if the firm becomes convinced that a = 0, so that m, = 0, then the 


' The existence of a random variable in the supply implies a random variable in the 
production function of the individual firms However, the existence of a random variable 
in the production function does not in any sense imply that the firms are not attempting 
to maximize profits It does mean that the firms must make their production decisions 
under uncertainty We are assuming that they do this in an optimum fashion, and equation 
(6) IS the aggregate result of such decisions For a discussion of some related problems, see 
Grossman*(1972) 



rational expectations 

527 

model is no longer inconsistent, because equations (1) and (11) become 
identical, /i, + , becoming equal to the error term v, + i 

In the conventional cobweb analysis, the attainment of equilibrium 
IS dependent on the relative slopes of the demand and supply curves 
In terms of the parameters of our model, where the slope of the demand 
curve is — (1/^) and the slope of the supply curve is 1/y, 


^ > 

y 

convergence, 

(a) 

p = 

y 

oscillation, 

(b) 

p < 

y 

divergence 

(c) 


These conditions do not hold for our model It can be seen from equation 
(11) that the condition for convergence is that (y/j3)|tti,| ultimately be¬ 
comes less than 1 


Numerical Results 

To get a feeling for the operation of the model, we made several Monte 
Carlo runs A given run is earned out by specifying the values of yjfi 
and r to be used throughout the run, as well as the initial price pQ, the 
mean mg, and precision Aq of the prior distribution of a, generating a 
sequence i>i, V 2 , , v„ of independent and identically distributed errors 

for a fixed number of periods n and computing the corresponding sequence 
PuP 2 > ,/)„ of prices 

All of the runs were made using the value Aq = 0 That is, it was 
assumed that the prior distribution of a was an improper, diffuse pnor 
over the real line, although in order to determine the priceit was still 
necessary to specify the value o[ rug 

As a first example, for y/P = r = pg — mg = 1, the sequence given 
in table 1 was generated for a process lasting a = 100 periods As can be 
seen from table 1, the value of m, is close to 0 after 100 periods, and the 
prices p,, therefore, tend to differ little from the random error terms v, 
It seems noteworthy, however, that even at < = 100, the values of m, 
are still exhibiting small fluctuations In a second example, the initial 
price was changed to pg = 10, and the process was again followed for 
100 periods using the same values of the random variates v, The value of 
m, fluctuated near —0 03 after 100 periods, and thus again p, differed 
little from the random error terms 

It IS of particular interest that we are getting convergence in two cases 
that the conventional analysis classifies as oscillating This effect comes 
from the Bayesian learning built into the model The strength of the firms’ 
beliefs in the correctness of their expectations can be seen in the growth 
of the precision A, in table 1 (In the second example A, = 336 after 100 
periods ) The large values of A indicate that the variance of the posterior 



528 


JOURNAL OF POLmCAL ECONOMY 


TABLE 1 

Initial Conditions = m ,, = r at y/fi = 1 


t 

m. 

h. 

P. 


V , 

1 

-0 539 

1 000 

-0 539 

1 000 

0 461 

2 

0 566 

I 290 

0 355 

0 290 

0 645 

3 

-0 198 

1 416 

1 267 

-0 201 

1 066 

4 

-0 043 

3 021 

0 119 

-0 250 

-0 132 

5 

-0 050 

3 035 

-0 177 

-0 005 

-0 182 

96 

0 084 

76 458 

0 101 

0 031 

0 132 

97 

0 085 

76 468 

0 695 

0 008 

0 703 

98 

0 085 

76 951 

0 063 

0 059 

0 122 

99 

0 083 

76 955 

-1 618 

0 005 

-1 612 

100 

0 054 

79 572 

1 309 

-0 135 

I 174 



TABLE 2 

Initial Conditions ^0 = ^0 = ^ = 

1, = 0 1 


Ron 


Ten Runs of 100 Periods 


^100 

Aioo 

;p- 

o 

o 

^99(^100) 

1 

0 046 

76 307 

1 186 

-0 119 

2 

0 030 

105 143 

-2 054 

-0010 

3 

-0 174 

104 052 

-0 598 

-0071 

4 

-0 094 

97 175 

2 160 

-0 021 

5 

0 125 

104 670 

-1 969 

-0 032 

6 

-0 004 

100 390 

-1 049 

0 003 

7 

-0 005 

92 691 

-0 969 

0 016 

8 

-0 081 

80 874 

-0 473 

0 036 

9 

-0017 

95 349 

-0 136 

-0010 

10 

-0 028 

104 113 

-0 397 

0 001 


distribution is very small The firms arc beginning to treat their expec¬ 
tations as being based on essentially certain knowledge of the value of a 

These results derive from single runs of the process and therefore do 
not necessarily represent average behavior of the process In table 2 the 
results of 10 independent runs for pg = ttiq = r = 1 are given, and in 
every run the model behaves as would be expected from conventional 
analysis 

As the value of yIP is increased beyond 1, the model produces an 
increased number of explosive solutions For yjp = 1 2, we get con¬ 
vergence for all 10 runs For yjP = 1 5, however, five of the 10 runs 
diverge in the sense that price p, begins oscillating wildly, being positive 
for two periods, then negative for two periods, as its absolute value 
increases 

Essentially the same kind of results prevails for different initial con¬ 
ditions of pQ and mg For example, with pg = ]0 and ntg = 1, we get 
convergence of all 10 runs (100 iterations each) for y/jS = 1 5, in contrast 








rational expectations 

TABLE 3 


Initial Conditions fig mo = 10, r = y/^ 1, «, = o 


/ 

m, 


A. 


P. 

1 

-10 

10* 


-10* 

2 

9 802 

1 010 

X 

10‘ 

-10’ 

3 

-9 606 

1 010 

X 

10‘ 

9 802 X 10’ 

4 

9 406 

9 709 

X 

lO* 

9 416 X 10* 

5 

-9 203 

8 963 

X 

10' 

-8 857 X 10’ 

10 

8 115 

3 085 

X 

10"> 

-4601 X 10‘“ 

15 

-6 873 

2 810 

X 

10*« 

1 185 X 10” 

20 

5 422 

4 334 

X 

10“ 

1 177 X 10” 

25 

-3 584 

4 823 

X 

10** 

-2 702 X 10” 

30 

0 9246 

3 732 

X 

10** 

-2 684 X 10’* 

35 

-0 0773 

1 761 

X 

10*' 

3 556 X 10’“ 

40 

-0 0773 

1 76! 

X 

10*' 

9817 X 10'* 

45 

-0 0773 

1 761 

X 

10*' 

2 709 X 10' 

50 

-0 0773 

1 761 

X 

10*» 

7 475 X 10’ 

55 

-0 0773 

1 761 

X 

10*' 

0 0206 

60 

-0 0773 

1 761 

X 

10*’ 

0 0000 


to the previous result for pg = mg = I But y/jS = 1 75 leads to all 10 
runs exploding 

An analytic study of the convergence or divergence of this process 
appears to be difficult In order to perceive the essential nature of the 
process when it is stripped of all stochastic elements, and to gam a better 
understanding of the Monte Carlo runs just described, calculations were 
carried out for a model in which the error term v, in equation (11) was 
taken to be 0 in every period I 

When yIP = 1, it is found that the price p, converges quickly to 0 for 
each of the following three sets of initial values (i) pg = mg = i, 
(ii) pg =10, Wq = 1, (ill) pg — trig = 2 When yfP = 1 and the 
initial values are pg = trig = 10, the results are as shown in table 3 This 
process appears to be exploding during the first 30 periods The price p, 
spirals wildly outward, and again it is positive for two periods, then 
negative for two periods, as its absolute value increases The absolute 
value reaches a maximum of 2 684 x 10^* at period I = 30 After that 
period, however, the price p, moves rapidly toward 0 and the process 
converges 

To assess the significance of the error term in this process, a Monte 
Carlo run was also carried out for these same initial values po = mg = 10 
and yIP = r = 1 , but with the error terms v, again included in equation 
(11) Even with these error terms present, the process was found to behave 
similarly to the process represented in table 3 The price p, reached its 
maximum absolute value of 1 60 x 10 ^* in period 1 = 29 and then began 
to decrease in magnitude When / = 29, the values of m, and A, were 
found to be m, = 0 771 and A, = 1 67 x 10“*® All these values are in 
approximate agreement with the values given in table 3 


Our conclusion is that the expectationaJ model brings the market 
equilibrium under varying initial values, including some values when the 
constraints on the slope condition derived from conventional analysis 
indicate that divergence will occur It is clear that, if a mechanism were 
biult into the model to allow the firm to modify its expectational models 
when they were clearly wrong, convergence could be attained with large 
ratios of yIP and at a faster rate than we have observed in the current 
model Thus, one approach to further research of this kind (utilizing 
simulation) is to find learning mechanisms that will allow the economic 
units to change expectational models in a rational fashion 


A Consistent Model 

We will continue to work with the market equations described by 
equations (5)-(7), and we will now derive analytic solutions for a model 
that IS free of any of the inconsistencies previously discussed We will 
assume that the values of some or all of the parameters d,, dj) P, and y 
appearing m the demand and supply equations are unknown Moreover, 
the distnbution of u, may involve other parameters with unknown values 
A consistent model means that the firm’s expectations must be based 
on this mechanism Therefore, we can derive an explicit expression for 
these expectations by taking the expected value of both sides of equation 
(8) Thus, 

E,.i{p,) = 

- ( 12 ) 

It then follows from this relation that 


E (p) = --g,-i[(lW»,] 

1 + E,_,{ylP) 


Equation (13) represents the actual expectations of the firm in the market 
The market price will differ from this price because of the firm’s in¬ 
complete knowledge of the values of the relevant parameters and random 
terms 

The economic process will evolve as follows In each period, the firms 
form their expectation of the price m the next period from equation (13). 
The actual price is then generated in accordance with equation (8) The 
observed actual price leads the firms to change their expectations of the 
values of the parameters in equation (13) and, hence, to change their 
expectation of the price in the following period The next price is then 
generated from (8), and the process continues in this way. 



53 « 

a simple example, luppoie that in the market equations (5)-(7) the 
values of P and y are known but that the value oi D = d, - d, is not 
known with ceruinty. Suppose also that the random variables u are 
independent from period to period and that each u, has a normal dis- 
tnbution with mean 0 and known precision t (This assumption of known 
^ can be relaxed by the introduction of a pnor distribution for x when its 
value IS unknown ) It follows from (13) that 


EfiiP,) 


E,-,{D) 
P + y 


(14) 


Suppose now that the posterior distribution of D at the end of period 
/ - 1 IS normal with mean m,_, and precision A,_, Then E,_^{p,) = 
m,-iKP + y) Furthermore, equation (8) can be rewritten in the form 


PP, + yE.-iiP,) =D - u, (15) 


Therefore, it follows from Bayes’s theorem that the posterior dis¬ 
tribution of D at the end of period t will be a normal distribution with 
mean 


m, 


_ ^-iW,-i + APP, + y£,-i(A)] 


A,_i + T 

and precision h, = -ft Substituting (14) into (16), we get 
_ {^,-1 + ['cy/(/? + y)]H-i + 

h,_^ + T 


(16) 


(17) 


It can be shown from these relations that 


phm m, = D, 

and, hence, 

phmE,_,{p,) = -^ (18) 

»-*oo p + y 

Therefore, as / -* oo, the expected price in each period becomes stabilized 
at the value Dj{fi + y) It follows from (8) that the distribution of the 
actual price in each period also becomes stabilized as a normal dis¬ 
tribution with mean Z?/(/S -|- y) and precision 

Muth^ distinguishes between two concepts, the prediction of the model 
(theory) E,_i{p,) and the aggregate expectation of the firms p‘ It is 
necessary for him to make the special assumption that E,_i(p,) = /)', 
which he calls the hypothesis of rational expectations In our model and 
from the Bayesian point of view, this relation arises naturally and is 
nothing more than a necessary condition for the internal consistency of the 


’ Muth uses simply the notation E(,p,), but it is clear that the meaning is the same 



532 JOURNAL OF POLITIGAL SCOnomy 

model we have presented here. In the above example, it is an immediate 
consequence of the hypothesis of rational expectations that = 

/)' = Dj{P + y) In contrast, the consistent model we have presented 
shows the process leading to equation (18) If the firm is using a correct 
model, incorporating as many unknown parameters as might be needed, 
then equations (12) and (13) must be satisfied Indeed, it can be seen m 
equations (12) and (13) that it is in the Bayesian framework that the 
rational-expectations approach finds its natural setting 


Sequences and Expectations * 

Up to this point we have dealt with the problem of rational expectations 
essentially in the way that Muth posed it The essential characteristic of 
that problem is that the firms are attempting to find the equilibrium 
price of output at every decision point Another, and more sophisticated, 
approach to the problem is to view the firms in the market as attempting 
to reach equilibrium in a specified number of stages or time periods The 
logic of moving in a sequence of stages stems from the fact that large 
and rapid changes in output are costly and, therefore, a smooth adjust¬ 
ment IS sought Posing the problem in this way enables us to utilize control 
theory and, thereby, to determine the optimum output the firms should 
produce at each point These outputs then play the role of the equilibrium 
output in Muth’s definition of rational expectations In this formulation, 
a rational sequence of expectations leads to the optimum sequence of 
outputs as determined from control theory 


The Control ModeU 

In order to illustrate these ideas, we will work with the competitive model 
We will assume that all but one of the firms is operating at the long-run 
equilibrium price The lone firm not yet in equilibrium can be viewed as 
a new entrant or a firm that has been badly managed and has new 
management It is operating in a market with a fixed price, the long-run 
equilibrium price, and is attempting to maximize profits (minimize losses) 
in reaching the long-run equilibrium output 

It is assumed that at the beginning of the process, the firm’s output is 
at a certain given level Qq Then forj = 1, , n, 

Qj = Qj-i + “j + (19) 

^ The notation in this section is independent of that used in previous models in the 
paper 



^noNALBXPECTATIOm 

where Uj i* the adjustment that the firm makes to its output Q when 
niovmg into period j, and «j is a random factor affecting the om'put 0 
^ith mean 0 and variance <j]. It is assumed that the random facto/s 
arc independent If there is no random factor m any part,, ular 
period, then j = 0 for that period 

In a general formulation there would be a specific sequence of targets 
or goals Q*, , Q* with the property that the firm would like to have 
Its output Qj m period j close to the target QJ The cost of being away 
from the target in periodj is assumed to be oij{Qj - Q*)^, and tlie cost 
ol making an adjustment Uj in the output when moving into period j is 
assumed to be where > 0 and Pj > 0 arc given constants 

In our particular example the targets Q*, , Q* are all equal to the 

same value Q*, which is the long-run equilibrium output for this industry 
The cost ay(§j — Q*) ^ reflects the distance of the firm’s output in a given 
period from Q* and includes the usual costs of production The cost 
PjU^j reflects such items as hiring and firing costs, overtime pay, super¬ 
vision cost increases, and other such costs not included in the first term 
The symmetric quadratic form of these costs provides, under the usual 
differentiability conditions, an effective second-order approximation to a 
wide variety of actual cost functions 
Since the two costs we have defined are separate, the total cost in a 
given period is the sum of the two, and the total cost of the process is the 
sum of all these costs forj = 1, , n Since we assume that the firm’s 

utility function is linear, the problem is to choose the sequence of adjust¬ 
ments Ui, > «n order to maximize profits (minimize losses) Given 
the fixed market price, the firm achieves this objective by minimizing the 
overall expected cost 

^ E[oCj(Qj - Q*y -t- (20) 

j = i 

The optimal sequence of adjustments can be derived m this problem 
by the method of backward induction (DeGroot 1970, pp 411-14) and 
has the following properties The optimal adjustment Uj in period j is a 
linear function of the previous output Qy_i, that is, Uj = HjQj-i + bj, 
where the constants Oj and bj depend on Q* and the values of Uj, , 
and Pj, , P„ However, these optimal adjustments do not depend on 
the variances erf, , af 

In the problem we are considering, we shall assume that the coefficients 
Qty and Pj which enter the cost function arc of the form oty = ap-* and 
pj = Pp\ where a and P are given positive constants and p is a discount 
factor (0 < p < 1) Thus, the total expected cost of the process is 

^ p^EiociQj - Q*)^ + puj] 

J = i 


( 21 ) 



534 


JOURNAL OF POLITICAL ECONOMY 


It can be shown that if we let the number of periods n -* co, the optimal 
adjustment Uj converges to the following value, for j = 1, 2, 

= yiQ* - ( 22 ) 

where y is a certain constant in the interval 0 < y < 1 specified by the 
relation 

.. + P - pP)^ + + a + pP - p 

y = - : - (23) 

V(a + ^ - pP)^ + 4a^ + OL + pP + P 

With this choice of the adjustment Uj, the output in period j will be 
generated by the model 


= (1 - y)Qj-i + yQ* + 


Thus, with this derivation we can determine the optimal sequence of 
outputs for the firm to follow in adjusting to its ultimate target Q* The 
values , Q„ determine the least-cost path and help to determine 

another definition of rational expectations Thus, since the firm cannot 
reach the equilibrium value immediately, as is assumed in the Muth 
formulation, rational expectations in each period should be couched in 
terms of Q,, , Q„ More specifically, if wc return to the cobweb 

model which has occupied us in tins paper, the firm’s expectations about 
price in cac h period should lead to the appropriate output in the optimal 
sequence if the expectations are rational as defined 

Formally, will be a funttion of the firm’s expectation of 

the pricein period j based on all observed prices and outputs up to the 
end of period j — 1 Wc shall lety denote this function, so that 

Qj =/[£,-i(/'j)] + (25) 

where Cj again denotes a landom factor with mean 0 and variance tij 
which affects the output in period j 

We now define the expectation Kj-i (pj) to be rational if this expectation 
yields a model for Qj whu h agrees with the control theory model that was 
derived from the optimal sequence of adjustments In other words, 
Ej-iipj) IS a rational expectation if 

/[^,-i(/'.)] = (1 + «j)Qy-i + ^ (26) 

For the special problem with constant targets, discounted costs, and an 
infinite number of periods, Ej~i{pj) is a rational expectation if 


flEj-iiPj)] = (1 - y)Qj-i + yQ* 


( 27 ) 



rational expectations 


535 

For example, suppose as in equation (6) that f is simply an increasing 
linear function of Ej_i{pj), as follows 

— ^iP-j-iiPj) + ^2 (28) 

Then Ej^i{pj) is a rational expectation if and only if 

Ej-APj) = - [(1 - y)Qj-i + yQ* - c^] (29) 

^1 

The sequence of expectations generated in this way has the property 
that the expectation is updated in each period j after the output Qj in 
that period has been observed or, equivalently, after the price pj in that 
period has been observed This updating is not “Bayesian” in the usual 
sense of the word, simply because there are no unknown parameters in 
this model about which the firm must learn Furthermore, the firm’s 
expectations are not consistent or correct for the cobweb model in our 
earlier sense, because the actual expected price in period j, given the 
firm’s choice of Qj as determined by the market equations, will not be 
equal to Ej^i{pj) In what sense then are these expectations rationaP 
They are rational m the sense that they lead the firm to an optimal output 
in each period, that is, they lead to a sequence of outputs that minimizes 
the total expected loss or, equivalently, maximizes the total expected 
profit 

As a result of the application of control theory, it is possible to extend 
the definition of rational expectations to a process-oriented model There 
IS no need to postulate a model in which the equilibrium price is attained 
by special assumptions about the average expectation Given the costs of 
moving to the equilibrium position, it is possible, as we have done, to 
determine the points on an optimum path A set of rational expectations 
should then lead the firm along this path The net effect will be to dampen 
the fluctuation of the cobweb model and carry the firms toward the 
equilibrium position 


Conclusions 

Our basic thrust in this paper has been the development of models that 
describe the process by which rational expectations may be developed 
within a market We accept the fundamental notion that phenomena 
based on nonrationality such as that exhibited in the cobweb model 
cannot exist for any period of significant duration However, we have also 
pointed out that no attention is given to the process by which rational 
expectations may develop in a market In particular, the learning process 
has been ignored and we have attempted to remedy this deficiency by 



JOURNAL OF POLITICAL ECONOMY 


536 

introducing the concept of Bayesian learning We have also added the 
concept of consistent as well as rational expectations Finally, we have 
tackled the problem of expectations under conditions in which the firm 
cannot immediately move to an equilibrium position '*■ 

The work on expectations has its ultimate significance in its impact 
on the construction of dynamic models The development of dynamic 
models generally hinges on using some form of expectations as the means 
of enabling the decisions of one period to be made rationally More gen¬ 
erally, expectations in a dynamic model become the basis for relating a 
current period to future periods We have indicated in this article some 
new approaches for incorporating expectations in dynamic models 
involving complex types of adjustments 


References 

Arrow, Kenneth J “Alternative Approaches to the Theory of Choice in Risk- 
taking Situations ” Econometnea 19 (October 1951) 404—37 
Cyert, R M , and DeGroot, M H “Bayesian Analysis and Duopoly Theory ” 
JPE 78 (September/October 1970) 1168-84 

- “Interfirm Learning and the Kinked Demand Curve ” J Bcon Theory 3 

(September 1971) 272-87 

DeGroot, Morris H Optimal Statistical Decisions New York McGraw-Hill, 1970 
Grossman, Sanford “A Bayesian Approach to Static, Stochastic Equilibria ” 
Unpublished paper, H G B Alexander Res Found , Graduate School Bus, 
Univ Chicago, August 1972 

Lucas, Robert E , Jr , and Prescott, Edward C “Investment under Uncertainty ” 
Econometnea 39 (September 1971) 659-81 
Modigliani, Franco “Liquidity and Uncertainty Discussion ” A E R , Papers 
andProc 39 (May 1949) 201-8 

Mtith, John F “Rational Expectations and the Theory of Price Movements ” 
Econometnea 29 (July 1961) 315—35 
Savage, L J The Poundatwns of Statistics New York Wiley, 1954 


* It should be noted that we have retained Muth’s approach of defining rational 
expectation in terms of conditions on just the mean value of a distribution and have 
attempted to expand the concept as described Lucas and Prescott (1971) have defined 
rational expectation ol a variable to mean that the actual values and the anticipated 
values of the variable have the Same probability distribution 



Competitive Firm and the Theory of Input 
Demand under Price Uncertainty 


Raveendra N Batra and Aman Ullah 

Southern Methodist University 


In this paper, we examine the behavior of the competitive firm faced 
with making input-hiring decisions under conditions of price uncertainty 
Unlike Sandmo and Baron, among others, we show categorically that a 
marginal increase in uncertainty stimulates a decline in the firm’s output, 
provided the absolute risk aversion is decreasing Furthermore, the 
implications of a change in expected price remain unchanged, but those 
of a change in input price need not be the same as m the certainty case 
Specifically, the input demand function is downward sloping only if the 
production function is well behaved 


The purpose of this paper is to develop systematically the theory of input 
demand for a competitive firm facing price uncertainty For the sake of 
simplicity, we postulate a two-input model and exa' 'me the implications 
of uncertainty for the firm’s output and input demand under the 
assumption that the competitive firm, fating given input prices, seeks to 
maximize the expected utility from profits Earlier contributors to the 
theory of the firm have eitlier approached the question from the side of 
product markets,* or, in exanuning the firm’s behavior from the side of 
factor markets, they have assumed that the firm seeks to maximize 
expected profits and is thus risk neutral ^ The focus of this paper is on the 

We arc indebted to J Hadar and W Russell for useful discussions on the subject 
matter of this paper 

^ Sec, for example, McCall 1967, Horowitz 1970, chap 12, Sandmo 1971, and 
1 eland 1972 

^ Stc, for example, SmiOi 1960, Tisdcll 1968, chap 3, and Rothenberg and Smith 
1971 One exception is that of Horowitz, who uses the expected utility approach to 
analyze the implications for the firm’s decision making of uncertainty in the production 
function (not in the product price) m terms of a long-run model allowing for variability 
in the use of both capital and labor. Apart from the fact that Horowitz is concerned with 
different questions, his analysis is incomplete because he docs not get involved in the 
comparative-statics properties of his model It may be mentioned in passing that m 
another chapter, Horowitz anticipates some of the results derived recently by Sandmo 
(1971), especially the results concerning the output level of the risk evader and the 
implications of a change m fixed costs 


537 



53^ JOURNAL OF POLITICAL ECONoyy 

expected utility approach, which permits a more general formulation of 
the firm’s attitude toward risk 

For the mam tools of analysis, we draw upon the recent seminal work 
by Sandmo (1971), who has examined under price uncertainty the 
behavior of the competitive firm from the side of the product market 
However, in contrast to Sandmo’s results which are obtained from a 
short-run model of the firm where the capital stock is given, our results 
are derived from a long-run model where all factois of production arc 
variable ^ Furthermore, Sandmo (1971) is unable to derive categorual 
effects of a marginal increase in uncertainty whereas we show that an 
increase in uncertainty leads to a decline m the firm’s output, provided 
absolute risk aversion is decreasing 

I. Assumptions and the Basic Model 

Unless otherwise specified, the following assumptions will be maintained 
throughout the paper 

1 The firm is a price taker in the probabilistic sense That is to sa\, 
the firm is not large enough to influence its subjective probability 
distribution about the sales price, which is assumed to be a random van 
able Furthermore, the prices of the two inputs, capital and labor, am 
given to the firm 

2 1 he decisions concerning the volume of output and hence the 
utilization of inputs must be made prior to the knowledge of the market 
price 

3 The firm seeks to maximize expected utility from profits, and its 
attitude toward risk tan be described by a von Neumann-Morgenstern 
utility function, which among other things requires that the firm’s 
decisions are made by one individual 

Let U stand for utility and n for profits Then the firm’s utility function 
IS given by 

U = U{n) (1) 

where > 0 and 0, depending on whether the firm is 

risk averse, risk neutral, or a risk preferrer * Let y = output, A = capital, 

^ Actually, the short-run results of Sandmo can be derived as special cases of our resull'i 

^ 'This assumption is central to much of the recent work in the theory of the firm undf r 
uncertainty (see, for example, McCall 1967, 1971, Sandmo 1971, and Leland 1972j 
Acceptance of this utility iunction implies acceptance of the axioms underlying ihc 
existence of the von Neumann-Morgenstern utility function For a lucid analysis of 
these axioms, see De Groot (1970) and Horowitz (1970, chap 12) 

* Arrow (1965) has shown that if the utility lunction is to satisfy the von Neumann- 
Morgenstern axioms, It must be bounded from above This means that as n approach^ 
infinity, the utility function must become concave, so that at least in the range close to 
the upper bound of the utility function, U‘*{n) < 0 This also suggests that for the firm 
to be risk neutral [C/''(;r) = 0] or a risk preferrer [^{n) > 0], n should be restricted 
to a finite value 



COMPETITIVE FIRM 

and L = labor Then the production function of the firm is given by 

q=f{K,L) (2) 

The only assumption that we make about the production function is 
that fn and fi^ the marginal products of capital and labor, respectively, 
are positive and that the isoquants are convex to the origin, so that 
VkIlSkl -IlIkk -flftL > 0. which, it may be noted, does not 
necessarily imply that f^^ >0 (t = L, K) ^ The firm’s total cost of 
production is given by wL + rK, so that its profit is given by 

Ti = pq — wL — tK (3) 

where/? is product price, which is assumed to be a random variable with 
density function f{p) and mean E{p) — fi We assume that p is non- 
negative 

In view of (2) and (3), the expected utility from profit is furnished by 

E{U[pf{K, L) -wL-rK]}, (4) 

where E is the expectations operator The firm chooses input quantities 
so as to maximize expected utility from profits The first-order conditions 
for the maximum are 

= E[U'{n){pA -w )]=0 (5) 


and 


= E[U’{nKpA -r)]=0 
dA 


( 6 ) 


The second-order conditions for the maximum are g^ven by 
8^E(U) 


dL 


= /I, = £[f/"(it)(/>A - + PfLLU'in)] < 0, (7) 


d^E{U) 

dK^ 


= A, = E[U"(n)(pA - r)^ + pA^[/'(n)] < 0, (8) 


and 


d^£.((J) 8^E(U) 

BIJ BK^ 

- Bj = D > 0 


1_ BE BK J 


> 0 , 


( 9 *) 

(9) 


where 


B, = 


8^E{U) 

BLBK 


= E[U"{n){pA - r)(pA -w) + PAlU'A)] (10) 


® See, for example, Hadar (1970, chap 3, pp 26—27) on this point 



540 


JOURNAL OF POLITICAL ECONOMY 


Before we proceed further with our analysis, it is necessary first to 
briefly examine the second-order conditions for the attainment of max¬ 
imum expected utility In the certainty case, utility maximization requires 
that/,, < 0 and/i.i/jt,f “At > 0 (i = Z,, K), which means that the 
production function is strictly concave ’ We will now show that if the 
firm IS risk averse, strict concavity of the production function is sufficient 
but not necessary to ensure expected utility maximization 
Expanding (9*) by using (7), (8), and (10), we obtain 

D = {E{U"a^)E{U"b^) - \E{U"ab)Y) 

+ {E{pU')U^^E{U''a^) +A^.E{U"b^) - 2A^E{U"ab)]} 

+ {\E{pu')Y{f^^f^^ - /A)} (11) 

where a = pff^ — w, b = pf^ — r, and U' = U'(n) ® From the first- 

order conditions given by (5) and (6), since fi^ and A nonrandom 

and since w and r are given, 

Ml _ AE[U'{n)p] _ A ,12^ 

r UEiU'{n)p] A- 

Using this in a and b, we get 

^ = (^A - =-(M/A - 0. (13) 

A Jk 

so that < 2 ^ = b^f^lf^ Using this, the first term m (11) reduces to zero, 
and D becomes D = -E[U''b^l/i]{2f,f,f^^ - fl /kk -/k/ll) + 
lE(pU')]^(fi_i^fK^^ — /A) For second-order conditions to be satisfied, 
A^ and A 2 from (7) and (8) should be each negative, whereas D should 
be positive If the firm is risk averse, then U"{n) < 0, in which case, 
A^ and A 2 may be negative even if A„ and A,, are positive This im¬ 
mediately suggests that the production function need not be strictly 
concave for (7) and (8) to be satisfied This also holds tiue for (9), because 
D may be positive even if ( Ax ~ /A) < 0, provided, of course, that 
isoquants arc convex to the origin so that (2 AAAl “ Tl Ax “ 
/k/ll) > 0 Thus, if the firm is risk averse, the concavity of the pro¬ 
duction function IS no longer necessary for the firm to be at the optimum 
However, concavity is sufficient because it implies /, < 0 as well as 
convexity of the isoquants toward the origin 

If the firm is risk neutral, then concavity is both necessary and 
sufficient for (7), (8), and (9) to be satisfied, because here U"{n) = 0 
However, if the firm is a risk preferrer, so that U"{n) > 0, then the 
concavity of the production function becomes a necessary but not a 
sufficient condition for the firm to be at the optimum 

’ See, for instance, Henderson and Quandt 1971, pp 61-69 
’ Whenever the equation is too long, we will write W for U'(n) 



541 


competitive firm 

Suppose the firm is risk averse Then the convexity of the isoquants 
toward the origin along with (12) imply that, even with uncertainty about 
the product price, the firm minimizes the unit cost of production for 
whatever output it chooses to produce As with the certainty case, the 
marginal rate of substitution between the factors equals the factor price 
ratio at the optimum, even when the price is seen by the firm as a random 
variable This is very interesting, because in the following section we show 
that the risk-averse firm produces an output at which the expected value 
of the marginal product of each factor exceeds the given input price In 
other words, the firm produces an output at which expected price exceeds 
marpnal cost, but the level of output is one where the average cost is 
minimum 


II. Optimal Input Demand and Output under Uncertainty 

In this section, we will show that under price uncertainty, the risk- 
averse competitive firm hires smaller quantities of inputs and thus produces 
lower output than in the case where p is known with certainty The prob¬ 
lem may be formulated in terms of this question What are the optimal 
input demand and output under uncertainty as compared with the case 
where the product price is known with certainty to be equal to the mean 
of the probability distribution^ 

The first-order conditions (5) and (6) can be rewritten as 

E[U'{n)pA] = E[U\n)w] (5*) 

and 

E[V'{it)pA] = E[U'{n)r} (6*) 

Subtracting E[U'{n)nfi} from both sides of (5*), we obtain 

E[U'{n)fL{p - ii)] = E[U'{n){w - nfi)] (14) 

Applying the expectations operator to both sides of the profit equation (3) 
yields E{n) = E{p)q - wL - rK or E{n) - fiq = + rK) Sub¬ 

stituting the expression for — {wL -1- rK) in (3) then furnishes it = E(n) 4- 
(P - Ihc utility function is strictly concave, so that U"{n) < 0, 

then U'{n) ^ I 7 '[£'( 7 t)] for p > /t, or by multiplying through by 
^P - El'{n)f^{p - n) < U'[E{n)'\fL{P - /O for all p Taking 

expectations on both sides and noting that U'[E{n)] is a given number 
and that is nonrandom, we obtain 

E[U'(n)Mp -n)]^0 (15) 

Using this result in (14) implies that £[f^'(7t)(tti — p/l)] ^ 0, and since 
marginal utility is always positive, w ^ pfi_ Similarly, we can show that 

»■ ^ P/k 



542 


JOURNAL OF POLITICAL ECONOMY 


In Other words, under uncertainty equilibrium, the expected marginal 
value productivity of each factor exceeds its price, or, what is the same 
thing, E (p) exceeds marginal cost By contrast, the certainty equilibrium 
IS defined by the equality between marginal value productivity of each 
factor with its price Furthermore, under certainty conditions, /(, < 0 
IS necessary to satisfy the second-order conditions for utility maxiimzation 
Thus, if/j, < 0, It IS clear that under uncertainty, the optimal quantity 
demanded of each input is lower than the certainty case It then follows 
that the optimal output produced under uncertainty will also be lower 
than the certainty output 

Until now, our analysis has been concerned with the implications for 
optimal input demand and output of what Sandmo calls the overall 
impact of uncertainty We will now consider the effects of a marginal 
increase in uncertainty by defining the increased vanability of the 
density function of the price in terms of a “mean preserving spread ” * 
Let us define p* as /)* = yp 0 where 9 and y are the shift parameters 
which initially equal zero and unity, respectively Then the mean 
preserving spread type of shift in the density function of p* leaves the mean 
E{p*) unchanged, that is, dE(p*) = dE{yp + 6) — fi dy + d9 — 0, or 


dy 




(16) 


Using p* instead of p, n and the first-order conditions are replaced by 
n = (py + 0)f{K, L) - wL - rK, E{U’{mpy + 0)A “ a']} = 0, 
and E{U'{K)[{py + 6)f^ — r]} = 0 Differentiating these conditions 
totally with respect to y, and using (16), we obtain 


and 


^1 


dy dy 

(17) 

dy dy 

(18) 


where A^, A 2 , and have been defined before, and where 


-//, = E[qU''{n){pf 2 , - w){p - n)-\ + E[U'(n)Mp - /i)] (19) 

and 


-H 2 = E[qU''{n){pf„ - r){p - jt)] + E[U'{n)Mp - p)], (20) 


^ Defining a change m uncertainty in terms of a change m the probability distribution 
while keeping its mean constant is quite common with economic theorists nowadays (see, 
for example, Sandmo 1970, 1971, 1972, Rothenberg and Smith 1970, and Rothschild 
and Stightz 1970, 1971) 



COMPETITIVE FIRM 543 

remembering that initially y = ! and 0 = 0 Solving (17) and (18) 
simultaneously, noting that and using (9), we obtain 

^ ^ - g.] ^ H,E[U'in)p]{Af,, -^/kl) 

dy D Df^ 

and 

dK ^ ^ H^E[U'{n)p]{Af^^ - Il/kl) ,^2) 

dy D Df^ 

The signs of dLjdy and dKjdy depend on the signs of and given 
the fact that /k < 0, and, from (9), D is positive Now the sign of //j 
from (20) depends on the signs of (i) E{U'{n){p — /i)], which has already 
been shown to be negative under risk aversion, and (ii) E{U"{Ti){pfg^ — 
r)(p — ;i)] In the next section, we show that under the hypotheses of 
decreasing absolute risk aversion, ElU"{n){pfii — r)] is positive Thus 


E[U"{n){pA - r)(p - M)] 

= E \^"{n) + E[U"(n){pf, - r)]^~ - (23) 


where E{U"{n)[{pf^ — r)^lfn'\} < 0 if firms are nsk averse and, as 
shown before, {riff) ^ p Therefore, with E[U''{pf^ — r)] positive 
under decreasing absolute risk aversion, (23) is clearly negative All this 
discussion suggests that the right-hand side of (20) is negative, so that 
Ifi > 0 

Once H 2 IS demonstrated to be positive, then for dLjdy to be negative, 
it IS clear that — IkIkl) ** negative Similarly, for dKjdy < 0, 

{/k/ll ~ /lIki) < Given that < 0, then a sufficient condition 
for the marginal impact of uncertainty to be in line with the overall 
impact is that ff^i^ > 0, which, of course, is satisfied for several well- 
known production functions’® However, if {/l/kk ~ IkIkl) ^nd 
HkIll ~ Il/kl) are positive because/^ < 0, then the marginal impact 
ol uncertainty will be opposite to the overall impact 

However, although the marginal and overall effects of uncertainty on 
the firm’s input utilization may not be the same, it still turns out that the 
marginal and overall impacts of uncertainty for the firm’s output run 
in the same direction Differentiating the production function given by (2) 
with respect to y and using (21) and (22), we obtain {dq/dy) = hXdLjdy) -t- 
hidKIdy) = {(2AAA, -IIIkk - I^lLL)EiU'{n)p]H 2 )lDI„ It 


‘“A production function is said to be “well behaved” iffu < 0 andyit > 0 Thus, 
our results necessarily hold for well-behaved production functions 



544 


JOURNAL OF POLITICAL ECONOMY 


can be easily seen that with convex isoquants and > 0, {dqldy) < 0 
Thus, an increase in uncertainty leads to a decline in the firm’s output, 
if absolute risk aversion is decreasing Note that Sandmo (1971), among 
many others, has been unable to obtain the predictable effects of a change 
in variance alone on the risk-averse firm’s output 


ni. The Supply Function 

This section is devoted to the analysis of the firm’s supply function 
Specifically, we wish to determine the firm’s output response to a change 
in the expected price The problem may be formulated as this Let price 
be written a.s p + 9 where 9 again is the shift parameter A change in 0 
shifts the probability distribution to the left or to the right and changes 
the expected value of price without altenng the shape of the probability 
distribution 

Differentiating (5) and (6) with respect to 9 and evaluating the deriv¬ 
atives at 0 = 0, we obtain 


^ ^ +fLU'in)] = F, (24) 

CU OU 

and 

^ S - r)q +At/'(t:)] = F, (25) 

oO oO 

The solution of this system yields 


^ = ^ 2^1 - S,F, 
39 D 

M ^ A^F^ - giF, 

39 ~ D 


(26) 

(27) 


Since A^ and A 2 arc negative and D > 0, our problem now is to pm 
down the signs of 5,, F,, and F 2 Using (13) in (10) gives us 

B, = E[U"(K){pf„ - r)^{fJA) 

= E[U"(n){pA - w)HfKlfL) +fKLU'{n)p] 

Utilizing (12) and the new expression for and the expressions for F^ 
and F 2 in (26) and (27), we obtain 

^ ^ E(U'P){Em + (qlf,)£[U"(P/K - 0 ]}(A/xl -AAx) .291 

39 D 


dK _ EiU'p){EiU') + (qlft)E[U''(pf,^ - u;)]}(AAl -/k/ll) (30) 



545 


QOUFETrnVB FIRM 

\Ve will show later m this section that E[U'’{pfg — r)] and E[U'\pfi^ — 
a;)] arc positive under the hypothesis of decreasing absolute risk aversion 
or increasing absolute risk preference If we further assume that /„ < 0 
and fni. > 0) it is clear that dLjdO > 0 and dKjdO > 0 These results 
continue to hold if the firm is nsk neutral so that U" = 0 In other words, 
a rise in the expected price will induce the firm to hire more inputs 
However, if< 0, then the results arc not clear even if/,( < 0 The 
expression for the change in output can be obtained by using (29) and 
(30) Thus, noting that ()>/* - r)lfg = (^/^ - iti)//'^, 

EiU'p){E(U') + (qif^)E[U"ipfg - r)]} 

^9 _ _ C^/l/kIkL ~'fLfKK ~fKfLl) /gjl 

de~ D ^ ’ 

Evidently, nsk neutrality, decreasing absolute risk aversion, or increasing 
absolute risk preference are sufficient to ensure that a rise in the expected 
price will induce the firm to increase its output It is interesting to note 
that the effects of an increase in pnce on input demand and output under 
price uncertainty are the same as the corresponding results available in 
the certainty case ' * 

It IS high time that we define the concepts of the firm’s attitude toward 
risk Following Pratt (1964) and Arrow (1965), let us define the firm’s 
nsk attitude (if„) as Ra{n) = —{.U"{n)']l\U'{n)'], where R^in) is com¬ 
monly called the absolute risk-aversion function ‘ ^ For risk-averse firms, 
RJ,n) IS positive, for risk preferrers negative, and for nsk-indiffercnt firms 
zero For nsk-averse firms, Arrow suggests that Rgin) is decreasing in n 
This IS the hypothesis of decreasing absolute risk aversion However, if 
R„{n) < 0, It seems reasonable that — ^„(7t) be an increasing function of 
n If a firm prefers risk, then its risk preference should normally increase 
with the rise in profits This we call the hypiothesis of increasing absolute 
nsk preference 

Following the line of reasoning suggested by Sandmo, we will now 
show that under these hypotheses E[U"{Tt){pfg — r)] > 0 Let n be the 
profit level when p = rlfg Since (pfg — r)lfg = (pf, — w)!/,, p also 
equals wj/g R,{n) > 0 and RJ{n) < 0, then -[U"{n)]l[U'{n)] = 
^j(jt) < Rai^) for P S rjfg where RJJi) is a given number Multiply¬ 
ing both sides by —U'{Ti){pfg — r), we obtain U"{n){pfg — r) > 

'* In the certainty case, it can be easily seen that dLjdp = [piftftL — 

SKIdp = WA/kl -AAOl/Aand (dqidp) = [P{2A/k/kl - AV« -/a/ii,)]/i> 
The effects of a change in p on the input demand and output under certainty as well as 
uncertainty arc the same in the sense that, in both situations, the effects are determined 
by the signs of the same expressions Now, if the same expressions have different signs 
under certainty versus uncertainty, then that is a different matter However, if we assume 
concave production functions, then the effects of price changes in both situations are 
similar 

Also, see McCall 1971 and Sandmo 1971 



JOURNAL OF POLITICAL ECONOMY 


546 

R^{n)U'{n){pff- — r) for all p Taking expectations on both sides and 
noting (6), E[_U"{it){pf^ - r)] > R,{ii)E[U'{n)[pU - Ol = 0 
The same proof also applies to the case where —/f, > Oand —R'^{n) > 
0 Here [£/"(7t)]/[f/'(7t)] = > —R^{n) for p > rjf^, so that 

E[U’'{n){pf^ - r)] > -Ra{^)E[U'{n){pf^ - r)] = Oforall^ There¬ 
fore, E[^U"{Ti){pf^ — r)] IS positive 


IV. The Input Demand Functions 

We now turn to the effect of a change in the price of any input on the 
firm’s demand for the two factors For the sake of simplicity, suppose there 
occurs a change in the wage rate but r remains unaltered Differentiating 
the first-order conditions (5) and (6) totally with respect to w alone and 
remembering that dr = 0, we obtain A^{dLldw) -t- B^{dKldw) = Cj 
and B^{^L|8w) -I- A^idKIdw) — Cj where C, = LE\lJ"{n){pfL — «')] + 
EyU'{n)] and = L E[U"{n){^pfg^ — r)] Following exactly the 
procedure established in the earlier section, we hnd that 

^ + LE{U'p)E[U"{pf^ - r) 1 (A^K ~ Jk/kl) 

dw Df^ 

A,E{U')U + [E{U'p)-\{LE\lJ'\pft, - 

_ + £(t/0}(A/LL -/l/kl) zoos 

dw Df^ 

and 

^ ^ {LE[U''{pA - -')](/lVxx +f^fLL - 2AA/xl) 

cw Dfi^ 

+ fKE(U'){f,f^^ -fME{U'p)} (33) 

The results, as usual, depend on our hypotheses of the firm’s attitude 
toward risk as well as the signs of and In the certainty case, 
dL/dw < 0, whereas the signs of dKjdw and dqjdw depend on the sig[n 
ofThese results continue to hold when the firm is risk neutral for 
then dLjdw < 0, whereas the sign of dqjdw depends, among other thingfs, 
on the sign ofIn the case of dKjdw, once it is realized that with risk 
neutrality A^ now reduces to fLi,E[U'{Tt)p], it can be clearly seen that 
the sign of dKjdw depends only on the sign of 

The results, however, arc not so clear and interpretable when the firm 
IS not risk indifferent If we assume that > 0 and fn < 0, then under 
the hypothesis of decreasing absolute risk aversion or increasing absolute 

>’In the certainty case, dLldw = (/>/'ll)/(/i.i./** - /*!,). SKIBw = - {p/tt)! 

-/x\). and Sy/S"- = [/’(AAt. -AAi.)]/(Ai.Ait -/D 



j competitive FIRM 547 

„sk preference which ensures that E[U"{n){pJf^ — r)] > 0, dLjdw < 0 
and SqjSw < 0 However, the sign of dKjdw is still indctenninate because 
the second term in (32) is negative, whereas the first term, with < 0 
from (7), IS positive 

The differences between these results and those available under 
certainty must now be ostensible The mam differences are 

i) Under uncertainty, the sign R^so enters into the determination 
of the sign of dLjdw, whereas in the certainty caseplays no such role, 
and as a result the sign of dLjdw is unambiguously negative 

ii) Under certainty, the sign of dKjdw is determined solely by the sign 
of/jtLi whereas under uncertainty the sign of dKjdw is ambiguous 

The crux of this discussion is that with well-behaved production 
functions (so that /n < 0 and > 0) and with our hypotheses con¬ 
cerning the firm’s risk attitude, a rise m the price of any input induces the 
firm to reduce the demand for that input and lower its output However, 
the firm’s response toward the demand for the other input is indeter¬ 
minate 


IV Conclusions 

Using a two-input, long-run equilibrium model of the competitive firm 
fating price uncertainty, wc have derived the following results in the 
foregoing sections 

1 The concavity of the production function is both necessary and 
sufficient to ensure the firm to reach an optimum only if the firm is risk 
neutral If the firm is risk averse, concavity is sufficient but no longer 
1 necessary, whereas if the firm is a risk preferrer, the opposite is true, 

I provided the firm’s objective is to maximize expected utility from profits 
2 The risk-averse firm utilizes smaller quantities of inputs and thus 
produces a lower output than a firm operating under certainty, even 
though the former minimizes the unit cost of production of whatever 
output It chooses to produce Furthermore, an increase m uncertainty 
liads to a decline in the firm’s output if its absolute risk aversion is 
uerrcasmg in profits 

3 The marginal and overall impacts of uncertainty need not be the 
same for input demand, although for the output level, they continue to be 
the same 

4 The presence of uncertainty makes no difference to the effects of 
an increase m price on input demand and the output 
5 In the certainty case, a rise in the price of an input induces the 
hrin to lower the utilization of that input In the uncertainty case, how¬ 
ever, we need a well-behaved production function to ensure this result 
furthermore, under uncertainty the effects on the demand for the other 
input are unpredictable 


548 

References 


JOURNAL OF POUTICAL ECONOMY 


Arrow, K J Aspects of the TTuory of Ruk Bearing Helsinki Jahnssonm Saatio, 1965 
De Groot, M H Optimal Statistical Decisions. New York McGraw-Hill, 1970 
Hadar, J Mathematical Theory of Economic Behavior Reading, Mass ‘ Addison. 
Wesley, 1971 

Henderson, H , and Quandt, R E Microeconomic Theory — a Mathematical Approach 
2d cd New York McGraw-Hill, 1971 

Horowitz, I Decision Making and Theory of the Firm New York Holt, Rinehart 
& Winston, 1970 

Leland, H E “Theory of the Firm Facing Uncertain Demand " A E R (June 
1972), pp. 278-91 

McCall, J J “Competitive Production for Constant Risk Utility Functions ’’ 
Rev Econ Studies 34 (October 1967) 417-20 

— - “Probabilistic Microeconomics ” Bell J Econ and Management Sci 

(Autumn 1971), pp 403-33 

Pratt, J W “Risk Aversion in the Small and in the Large.” Econometnea 32 
(January/April 1964) 122-36 

Rothenberg, T J , and Smith, K R “The Effect of Uncertainty on Resource 
Allocation m a General Equilibrium Model " Q J E 85 (August 1971) 440-59 
Rothschild, M , and Stiglitz, J E “Increasing Risk I A Dehnition ” J Econ 
Theory 2 (September 1970) 225—43 

— - “Increasing Risk II Its Economic Consequences ” J Econ Theory 3 

(March 1971) 66-84 

Sandmo, A “The Effect of Uncertainty on Saving Decisions ” Rev Econ Studies 
37 (July 1970) 353-60 

- “Competitive Firm under Price Uncertainty” AER (March 1971), 

pp 65-73 

- “Discount Rates for Public Investment under Uncertainty ” Internal 

Econ Rev 13 (June 1972) 287-302 

Smith, K R “ Ihe Effect of Uncertainty on Monopoly Price, Capital Stock and 
Utilization of Capital ” J Econ Theory 1 (June 1970) 48-59 
Tisdell, C A The Theory of Price Uncertainty, Production and Profit Princeton, N J 
Princeton Univ Press, 1968 



Lifetime Earnings in a Professional Labor 
Market: Academic Economists 


George E Johnson and Frank P Stafford 

Umvetsily of Michigan 


It 13 hypothesized that persons who are more able and who enter the 
profession earlier in life purchase, through foregone earnings, more 
training options upon receipt of the Ph D and hence have more precip¬ 
itous earnings profiles than those who are less able and who enter later 
By using National Science Foundation Register data from 1964 to 1970, 
earnings of economists and other academics arc compared by age cohort 
both cross-sectionally and over time Defining junior and senior faculty 
as the two inputs producing instructional services and assuming a 
homothetic production function, a substitution elasticity of about five 
IS estimated 


I, Introduction 

Analysis of the determinants of the level and structure of academic 
earnings is interesting both in its own right and for what it reveals about 
the behavior of professional labor markets This paper focuses principally 
on the question of differences m the pattern of earnings over the work 
cycle of academics In particular, we are interested in the way certain 
indirect measures of “quality” and of the tune path of investments in 
human capital influence earnings at various levels of experience The 
empirical analysis of this paper is primarily limited to academic econo¬ 
mists, but most of our major findings have been verified m other 
disciplines The data sources are the 1964 and 1970 National Registers 


This research was sponsored by the National Science Foundation (GS-31639) and in 
cooperaUon with the American Association of University Professors We would like to 
thank Ronald Oaxaca, Sherwm Rosen, T W Schultz, and P O Steiner for comments 
on earlier versions of the paper Opinions expressed in this paper are the authors’ We 
would like to thank William Hershey and Robert Gitter for providing able assistance in 
this project Operation with the large and varied data sets used m this paper was greatly 
facilitated by use of a computer software language. Micro, developed by the Labor 
Market Information Systems Project, directed by Malcolm Cohen 


549 




550 


JOURNAL OF POLITICAL ECONOMY 

collected by the National Science Foundation (NSF),* and while the 
Register data have already been used to analyze the earnings of econo 
mists (Tolies and Melichar 1968), the orientation of their research wa^ 
much broader in coverage and did not focus on the questions with winch 
we deal 

The time period of our analysis eorresponds to a slowing down in [hr 
growth rate of faculty salaries as has been demonstrated by American 
Association of University Professors (AAUP) studies ^ Economist!, 
experienced a similar though less pronounced downturn toward the lalur 
part of the period, with average pnec-adjusted growth rates by ranlt 
falling off from over 3 percent m 1965-66 and 1966-67 to only minor 
growth m 1968 71 (Boddy 1971) Moreover, to the extent that rati s n| 
promotion also slowed down, the relation between years of experience 
and earnings would have been lessened to a greater degree than suggislcd 
by rank-spccihc salary changes Flic impact of this slowed rate of growih 
of salaries will be reflected m our 1970 data but will probably havi n 
greater effect on the structure ol earnings at some later date Yet tin it- 
were substantial c hanges in the struc lure of earnings which were alreadi 
present m mathematics and physics by 1970 

The plan of the paper is as (ollows In Section 11 we develop the hasir 
model of the dctciminants of lifetime academic salaries and estimate iti 
paiamctcrs on the 1970 data set In Section III, we add an indiiect 
measure of quality differences among graduates whether or not the 
icspondent attended a highly ranked graduate school- and estimate its 
effect on academic salary at various stages of the work c /cle Section l\ 
c harac tei i/c s lifetime cMrnings ol economists in different types of insli 
tutions Section V presents compaiisons of tamings between econoitiisic 
and prac titioncrs of a selection of other disciplines, and Sec tion VI offers 
a summary of our major findings and their implications 


II. Effect of Professional Experience on the Salaries of Academic 
Economists 

Although It is widely recognized that actual labor-market experience ic 
a much more important determinant ol earnings capacity than age anil 
IS often more important than formal schooling, most microdata sen 
contain age but not years of actual experience m present occupation 
dhis rccjuires that yc-ars of potential labor-force experience, defined is 


* We would like to thank James Brown ot the National Science Foundation for Ins 
support in pioviding these data from the National Register Files 

^ Sec American Association of University Professors (1972) Ihcrc it is shown thal ihi 
real rate ol growth ol all faculty salaries declinid as follows lOhl f)7, 3 2 pernni 
1967-68, 14 percent, 1968-69, 1 I percent, 1969-70, -06 percent, 1970-71, -1 <! 
pierce nt 



academic economists 



Fio I —T", = yean worked by late starter Ti = years worked by early starter 
A — earnings of late starttr, ZJ + C = earnings of early starter 

minus estimated age at graduation from school, be employed as a 
proxy for actual experience for most professional otcupations this is 
not an entirely satisfactory procedure For a \ancty of reasons (eg, 
military service, childbearing, indecision) individuals get started on their 
professional careers at many dilTerent ages, and the skills acquired in past 
pursuits are usually not transferable to their professions d hi NSF data 
do include the respondent’s report of years of professional experience m 
present occupation, and this will allow us to estimate directly, and 
possibly more accurately, the relation between earnings and experience 
'1 here are theoretical reasons for expecting the observed relation 
between earnings and experience to vary svstcniaticalK with the age at 
which individuals begin their professional experience First one way in 
which a professional forms human capital is to start out in a job m which 
he can learn a great deal about Ins trade Jobs with a relatively low 
training component must pay relatively lugh salaries, for an individual 
who takes one of these is presumably sacrificing future earnings (Mincer 
1962, Rosen 1972) A late starter m a particular occupation will have less 
time than an early starter to capture the future gams from holding a job 
with a high training component and thus will be less likely to take such a 






552 


JOURNAL OF POLITICAL ECONOMY 


job This argument is summarized geometrically in figure 1 A late starter 
expects to work T, years, the early starter Tj years While the late 
starter is willing to sacrifice the future earnings represented by B m order 
to get A, the early starter finds it to his advantage to sacrifice A in order 
to get B plus C in the future In other words, the late starter has less time 
remaining to make use of acquired skills, and thus the discounted present 
value ol marginal benefits is lower (Ben-Porath 1967, Stephan 1971) 
Utilizing the years of ieportcd-experience variable, we were able to 
infer that individuals’ definitions of “years of professional experience” 
varied tonsiderahlv I'or example, if we examine the relation between 
years of professional experience and age at which highest degree was 
attained, we find that some people claimed many more years of pro¬ 
fessional experience than elapsed yeais since PhD (highest degree) 
Now foi some individuals, this difference may truly leflcct professional 
cxpcriince, loi example, a year or two spent completing the finishing 
touches on a dissertation while fully employed On the other hand, for 
others these yeais probably reflect little more than time since receipt of a 
B A an> labor-force experience no matter how remotely pertinent to 
cuneiit profession To the extent that the latter is true, this predegrcc 
experience would result in their being the “late starters” portrayed in 
figuic 1 Whether or not the predegree experience reflects primarily 
elapsed time oi true professional experience is thus an empirical question 
To proceed m our investigation, we first estimated a simple quadratic in 
years of reported expeiience lor academic economists who have received 
the doctorate and worked full time * We estimated the parameters of the 
following basic model In SAL = o [0 H- a^X + -f (XjFEM -I- e, 

where SAL = 9-10 month academic salary. A" = years of reported profes¬ 
sional experience, and FFM = 1 if female, 0, otherwise 

From previous research the expectation is tliat a, > 0 and both a 2 
and a 3 > 0 1 he empirical results lor 3,040 academic economists with 

this specification arc given as 

In SAL = 0333.V - OOOSIBA^ - Xb'iFEM - 1 - 4 727 

(0013) (000033) (018) (0097) (1) 

= 363, standard error of estimate = 209 

To test the hypothesis that much of the experience of some respondents 
IS picdegrec activity of nominal value in current occupation, we dis- 


^ For an extensive justification of the use of a quadratic m experience to explain the 
logaritlim of earnings, see Mincer (1973) 

^ The salary ol females relative to males is thus SAL^/SALj^# = e*^, which is less than 
one if *3 < 0, and the proportionate-wagc-disadvantage of females is 

SALm — SALf. 

SALm 


= 1 - 



academic economists 


553 

aggregated total reported experience through use of the year-of-Ph D 
variable The model then was 

In SAL = ) 8 o + P^XPO + Pi{XPO)^ + P^XPR + P^{XPR)^ 

+ PiXPO X XPR + p^FEM + E, 

where XPO = postdegree experience and XPR = predcgrcc expt ncnce 
(X = XPO + XPR) Our expectation is that )?,, p^ > 0, /?, > Pj, 
Pi-: P* < Pi < ^rid that /?, + 2/?2 > P3 + 2^4 over the range of 
0-10 years of XPO and XPR (89 percent have XPR of 10 years or less) 
The results for the specification with reported experience disaggregated 
as we have suggested are 

\nSAL= 0441A-PO- QOOni(XPO)^ f OOOSA'P/f - 000182(AP/?)^ 
(0013) ( 000036) ( 0021) ( 000091) 

- 000656A’PO X XPR - 139FPA/ + 4 735 (2; 

(000100) (016) (009) 

R^ — 457, standard error of estimate = 193 

The test of the null hypothesis that disaggregation of total exptnence into 
prcdegrcc and postdegrcc experience does not add to the explanation of 
salaries is decisively refuted (F = 175 0), and we employ this functional 
form throughout the remainder of the papci ' 

The results of equation (2) show that the rate of salary increase is more 
than four times larger for postdegrcc experience than for predegree 
experience (0441 in comparison to 0098) at 0 years of each type of 
experience The substantial (negative) interai non terms between years 
of post- and predegree experience imply that the longer the penod of 
predcgiee cxpcnence, the flatter the postdegree experience profile This 
result IS fully consistent with our early starter-late starter distinction 
represented in figure 1 ® In addition to disaggregated reported experience, 
we also introduced a variable representing age at receipt of Ph D While 

’ Since the diflerenl profile shapes lor early and late slarters are critical, we also 
disaggregated our sample into five XPR categories 0, 1-2, 3-5, 6-10, 11+ This test 
provided the same information about profile shapes as eq (2), and a test on this dis¬ 
aggregation yielded F = 162 1 The basic result in eq (2) obtains in all other disciplines 
wc studied physics, anthropology, mathematics, sociology, and biology 

® Equation (2) was estimatid for a subgroup of professional economists for whom 
predegree experience would be expected to be more important, namely, business 
economists (These were excluded from our previous analysis as mentioned in 11 9 ) 
The corresponding coefficients of 0407 and 0154 on XPR for this group compared to 
0441 and 0098 in eq (2) give some additional support to the notion that pre- and 
postdegrcc experience represent different rates of professional development in different 
academic areas It should be noted at this point that the coefficient on FEM declines when 
experience is disaggregated because women have more years of predegree experience 
Eor a more complete discussion of this and related issues, sec Johnson and Stafford 

(1974) 



554 


JOURNAL OF POLITICAL ECONOMY 


TABIE 1 

Prfdictld Salarifs of ArADhMic hroNOMKTS iiY Years op Posi-Ph D Experience for 
.Selectfii Levels of Prf-Ph D Lxpfrifncf 


XPO 


XPR 

0 


10 

I') 

20 

25 

30 

35 

0 

11,300 

13,920 

16,380 

18,550 

20,210 

21,180 

21,300 

20,730 


11,910 

14,330 

10,580 

18,471 

19 790 

20.410 

20,250 

19,3)0 

10 

12,Sit) 

14,t.l0 

10,040 

18,230 

19,214 

19,490 

19,020 

17,81,0 

SAL (0) 

0 922 

0 ')33 

0 985 

1 018 

1 052 

1 087 

1 123 

I 10(1 

SAL (10) 

(013) 

(oin 

(010) 

(Oil) 

(014) 

(019) 

( 025) 

(03l| 

\<iir Slintliffl tritns «>l 

lh< t xiiin 1 

Ird V 

of loi( S — It.t; S l/,(0) art 

pivrn m pat rmhrse x 

Wliu till tlu rsliti 

1 lied \ tlufs 1 

1.1 S l/(0i/S 1/(10) 1 

In sc HC « 'vlt 

111 111 d uiordiiik' (<> til 

II fotiniiK 



+ KH'i 

+ IPePi (^,1 f Aif fit) ^ 

>\P(H (fii 

^0 2 n\roH (fit 

fi,)])/' 


the ( shmated toenicient on tins vaiiabh was on llu inare;in of statistual 
siijnifitantt, it was ot small quantitative iniporlanti and did not hold in 
othir bubsarnfili s in oni .tnal)sis Act ordinal v, we adhi re to our simplir 
sp< < ifieation in iqinUion (2) 

The etononiit signihtaiui ol the rtsuits in equation (2) can be mftnrcl 
by noting that tlu tntan ^eats ofpredegrtc ixpctnnu is f 3 years in nui 
sainjtlc of full-tinu I’h I) acadc lines and has .i standaid dLViation of 41! 
^eais ^ Using thest hguns .is the basis foi .qqiroximnti coinpaiisonb in 
table 1, w( h.i\c pi< seined piofiks for individuals with difiiient years ol 
fjicdfgrce experience at 5-\<ar iiucivils ol posidigiee cxptriinct A 
peison with \PR = () would be typically someone who went straight 
ihroLigh gr.iduatc si hool and btgaii at about <igi 25, wlurias a person 
with Xl’R — 10 would have tntirid the jnofission at, say, age 35 with a 
Fh 1) 'I he obseived staiting-salars diHerLntial m f.ivor ol tin late starter 
of 1900 per Mat can be mttrpntcd .is nllcctmg Ins choice of lower 
ti.lining options lelativc to the early sl.artcr, however, the accumulatid 
Itaming diffcrenti.ds escntuallv result m the early slartei's overtaking 
the Lite starter (at XPO = 10-15) ,md suipassing him m earnings from 
tliere on 

1 hi ernpiiiial impoitaiiee ot disaggregating expenrnri is sumnian/ed 
in talile 1 Yittlu question of mterpret.ition of the clifTi rcncc still remains 
o[)i n Tins IS because the .ige ol erilrv implied m XPR probably relates to 
ability as well .is to tin time period over whieh benefits are expected tii 
aceiut One e'xticme would be to inicipri I the entire differene c in profile 
shapes to difltn iie es in abilitv 1 licit is empintal evidenei that the inori 
able students, at least by conventional measures, are those who receive 
sufheient stipend support lo allow them to study full time rather than 

^ The distribution is skewed partly because lor a few persons the implied negative ye.us 
ol prtdtgrtc experience were set to ztro 



ACADEMIC ECONOMISTS 


555 


enroll part time and hold outside employment which prolongs their stay 
m graduate school (see Warkov 1965, Mooney 1968) On the other hand, 
some students spend extra time completing a thesis which is expected to 
have a very high payoff While there probably are, on net, differences m 
ability in favor of early starters, a tentative interpretation is that the 
shapes reflect primarily differences in remaining work life and associated 
differences in the expected span of remaining labor-force years In this 
regard, it is interesting to see if for the estimated relation in (2) the implied 
rate of return from choosing more or less on-thc-job training is in line 
with plausible discount rates Suppose the individuals’ abilities were 
equally distributed across the implit it age-of-entry variable (XPR) and 
that arbitage took place across the market for training options Then, 
under a discounted lifetime income-maximizing hypothesis, we would 
expect the rate of return to choosing the training option path (described 
as “early starters” in fig 1) to equal some rc,asonablc discount rate such 
as the rate of return on alternative investments—say, the long-run rate 
of return on physical capital, which can be given as 8 percent from studies 
of the rate of return in manufacturing (Stigler 1964) From equation (2) 
the (marginal) rate of return to choosing the training option for someone 
entering the profession at age 30 (after 5 years of predegree experience) 
tan be calculated as 7 0 percent ® (liven an assumed upward shift of 2 
percent per annum in both earnings profiles (ow'ing to generalized 
productivity growth and ignoring rising marginal taxes), the implied rate 
of return to choosing training options would be about 9 percent While 
this IS a somewhat hazardous calculation because of unspicified ability 
differences, the calculation docs suggest that a training-options inter¬ 
pretation IS at least plausible 

Utilizing the earnings function given above, we rely on expcricnce- 
earnmgs profiles as estimated from cross-scctional data to investigate the 
role of graduate school quality on lifetime earnings, to c omparc economic s 
with oth( r disciplines over time, and to develop other topics Given this 
use of cross-scction data, it is well to keep in mind some ol the limitations 
of such approximations to the lifetime profiles Not only will the profiles 
shift up over tune (as wc will discuss in Section V'), but there iiiav be 
considerable heterogeneity ol individuals distiibuted by age (and hence 
by experience) as a result of factors such as promotion policv lo 

® The ratt of return to choosing the training options assocMlcd with iht topical 
in(Jivi(Jual with APJi = 5 wau calculated by finding the discount rale (r), which saiisfu^ 

y [Ai/.(A:P0, \rR = 4) - s4L{APO, XPR = b)J(l + r)-*'’" = 0 
0 

’ Younger economists as of 1970 (say, with less than 10 y(.ars txpcncncc on the job) 
had acquired a knowlcdg< tif theory and iconoim tries which, by and large, icononiists 
with, say, 20 years ol posldegree experience did not acquire If such skills are earning 
quasi rents as of 1970, our profikji will generally be too flat relative lo i long-run 
equilibrium profile 



556 


JOURNAL OF POLITICAL ECONOMY 


TABLE 2 

Predicted Salaries of Ph D Economists by School of Degree and Years op 
P bSTDEOREF EXPERIENCE 


Years of Postdfgree Experience* 


School 

0 

5 

10 

15 

20 

25 

30 

35 

Chicago 

12,610 

16,490 

18,290 

20,770 

22,690 

23,830 

24,070 

23,370 

Columbia 

11,230 

14,280 

17,230 

10.720 

21,410 

22,070 

21,680 

20,020 

Cornell 

11,820 

14,060 

16,200 

18,090 

19,580 

20,620 

20,850 

20,620 

Harvard 

11 ,(>00 

14,670 

17,600 

19,850 

21,390 

21,920 

21,350 

19,760 

YaU 

10,720 

14,690 

18,340 

21,300 

22,860 

22,680 

20,780 

17,600 


• Ycjrs of picdr^irr rxprrumr tssiiiticil lo Ik* ') 


emphasize these kinds of difFit ullits, we have estimated, for graduates of 
several individual si hools, regressions of earnings on our experience and 
other variables in the form of equation (2) and have presented predicted 
salaries from these regressions (table 2) While the estimated profiles for 
most of the schools look reasonable, the Yale profile is curious, peaking 
out at 20 ^cars of posidegrce experience In part this refleets the fact that 
Yak graduates have had vcryfiwycars of predegree experience (2 Syears), 
and to fit the early (steep) part of the profile the quadratic form imparts 
some distortion to the end of the experic ncc-carmngs profile 

III. Effect of Quality of Graduate School on Salaries 

Although It IS (UStornary to assume that labor is homogc ntous, it is 
comtnonly recognized that members of particulai occupations vaiy widely 
with icspei t to ability, and this is true of jiiofessional labor markets as 
well In tins see tiori wc investigate bow one pioxy ior ability, llu “quality” 
ol the giaduatc school attended, influences lifetime earnings of acadi mu 
economists ft uonid be desirable, of eouisc, to h.cvc direct mcasiiies (it 
each individual’s productivity (c g , mmilicr and quality ol publications, 
teac lung effee tivencss, and the like), but these are dilhcult lo obtain lor a 
large sample survey and are not available in the N.Sh data set I’lic NSI' 
data do me hide the name of the giaduatc sc liool the respondent attended, 
so wc wcic able lo match this infoimation with the pet iodic suivcys o( the 
(fitctivcntss of graduate programs We tlicn constructed a oue-/cio 
dtiiiiiny variable denoting leceipt of doctorate fioni a highly ranked 
graduate school, in particular the top 10 (RNKD) The influence of this 

*Our procfclurc was as followsi Those nspondints who received their docloralis 
prior to 1940 wire assigned the ranking fiom the Hughes study of 1920, for graduates 
m the 1940-'iO period, the rankings of the 1957 Kenistcm study, and for 1960-70 graduates, 
the 1904 Carltci lanking In this way the individual’s rank depends on both the school 
attended and his vintagi Ihosi schools with iop-10 ranks were combined into a single 
dummy variable, RNKD The figuri 10 was chc»en arbitrarily—determined in part by 
the availability of data in other academic professions These data are found in Cartttr 
(190b) 






academic economists 


557 


variable can be attributed to two potentially separate factors, the effective¬ 
ness of the graduate program itself and the selectivity oi the program with 
respect to admissions Although this raises a very interesting question 
in Itself, there is no conceivable way we could identify the individual 
influence of each factor with the present data set. and we thus employ 
RNKD primarily to reduce the heterogeneity of the sample 

Introduction of the quality of graduate school variable yielded the 
following results 

\nSAL= O^ZXXPO - <mi^O{XPO)^ + OlllA'P/f- 000203(A’/’7?)^ 
(0013) (000036) ( 0021) (000090) 

- 000679A'/^0 X XPR - \?,6FFA1 + OjIRNKD -1-4717 

( 009) 

R^ = 468, standard error of estimate = 192 (3) 

This indicates that receipt of a doctorate from a graduate school ranked 
in the top 10 increases an economist’s 9-month salary by an average of 
about 6 percent “ Of greater interest are the results in which RNKD is 
interacted with XPO and (XPO)^ These arc 

\nSAL= 0362XPO - 000583(A/'0)^ -I- 0095XPR - 000170(XPR)^ 
(0015) (000043) ( 0021) (000089) 

- 000578A’7’0 x XPR - 133Fi.l/ - ()59RNKD (4) 

(000099) (016) (015) 

■+ 020nRNKD X XPO - OOOmRNKD x (XPO)^ -t- 4 756 
(0024) (000074) (013) 

R^ = 481, standard error ol estimate = 189 

These results suggest that there are sharp differences in the shapes of the 
salary/experience profiles between economists who received their graduate 
training from the toji 10 graduate schools and those economists who 
attended less prestigious graduate schools The predicted starting salaries 
of the latter are over 5 percent greater than the starting salaries of the 
former, but the rate of growth ol salaries is much greater in the early 
stages of the work cycle lor graduates of the top 10 schools than for 
others’^ The same pattern has been found in physics, anthropology, 
sociology, and mathematics In biology this pattern was not upheld, and 
alternate measures of school quality (eg, 1 - 0 , whether graduated from 
Berkeley or Harvard) did not yield any indication that graduate school 
attended influenced lifetime earnings The results for economists are 

“ The percentage wage advantage accruing to graduates of ranked schcxils is 100 x 
(e* — 1), where 6 is the coefficient on RhKD in the regression explaining log SAL 
The F-statistic on the two interaction terms is 37 7, which is highly significant 



558 


JOURNAL OF POLITICAL ECONOMY 


TABLE 3 

Predictld Salaries of Academic Economise for Ph D’s bv Rank of Graduate 

SciIOOl AND BY \eaRS OF PoSTDECRFF EXPERIENCE (iN DOLLARS) 


Years or Postdegree Experience* 


School Rank 

0 

5 

to 

1 T 

20 

25 

30 

35 

1 op- 10 rank 

11,460 

14,'ijO 

17,490 

19,910 

21,460 

21,900 

21,160 

19,360 

Other 

I2,l')0 

14,140 

r>,980 

17,340 

ia,7on 

19,350 

19,460 

19,000 

Top/otht rt 

0 943 

1 029 

1 091 

1 135 

1 148 

1 131 

1 087 

1 019 


1 015) 

(0)0) 

(Oil) 

(OB) 

(014) 

(015) 

(017) 

( 022) 


• \ c US of pit ilrgrcr rxfii rit lu e Tssuiuftl to In* 5 
t IstimiUfl siamlard trror ol ratio m pTrrnlht'srs 


poriTtiNcd in table 3 in which predicted salaries for SPR = 5 (i c , 
startinn agr = 30) aic shown The s,ilai\ ratio ol an individual with 25 
M ais ol postdf gicc expt iientc (in this ease about age 55) to a new entrant 
is 1 91 for giadtiaies of sthools lanktd in the top 10 versus 1 59 for 
graduates of other schools Whilt the RNKD variable does interact 
with postdegree espentnee tei ins, it deaes not inteiaet substantially with 
the firedegice expericnec teiins {P — 2 1) This is not surprising, lor the 
pa\oir to a more elaborate graduate progi.iin is likely to accrue to what 
one does altei eonipletion of the degiee 

from out cornpaiison ol postde'giee evpenencc prohlcs ol PhD 
graduates lioiii innked and othei progiams, we inlci that the difieiential 
shapes le lie (t the lae t that inoie highly trained and/or able ptrsesns engage 
in on-the-job training irioic extensively than less able oi less highly 
tiallied indiyiduals Since this inlerenee is wholly dependent on the shape 
of the cainings profiles, wc ran sepaiate re'giessieins foi individuals who 
attended schools ranked I 5,6 -10, 11-16, and all otheis as an additional 
[iiece ol c vide nee on the lelatieiri between a jtaitial measure eil skill and the 
hle-evcle path ofcainings The results indicate that the salai y/expel lene t 
pioliles get ptogiessive Iv less steep lioiii the 1 5 group to the tinrankcd 
schools We will not repent the icgression results (which aie based on the 
loiin ol e cj [2Jn fiul fesr individuals who beg.ui their prolessional ex- 
pe ne ncc at about .ige 30 [XPR = 5), the e stiinate el ratio ol a person with 
20 veai . ol postdegree cxpeiienee to a new entrant is 1 91 lor the 1 )- 
Ianked schools, 1 79 lor 6-10, 1 70 for 11 lb, and 1 57 foi those lanke'd 17 
OI lower At the same tune, the average academic salaty dilfercntials 
between graduates ot tlic most and least piestigious institutions are not 
veiy large As table 3 sliems. Pit D giaduates Ironi the lop 10 schools 
start out at a salaiv (6 peicent) less than graduates fiont schools other 
than the top 10 ' ^ Salaries aie roughly equal at about 3 years of professional 

' ’ A similar lesull has been noted by Francis M Bexldy in his annual surveys of the 
laiger (Ph D-granting) departments (Boddy 1971) A widening cxpeiiencc diHercnlial 
across ability groups is reported in Hause (1971) 




academic economists 


559 


experience, and the difierential slowly increases to a 15 percent advantage 
jfter 20 years of expern nee The average percentage salary advantage 
JO ruing to graduates from tlie top 10 universities) is about 8 percent over 
j working life Because the largest relative benefits come relatively late, 
however, the relative present values at the time of entry to the work forte 
lie t loser than is implied by this 8 perteni figure* 

One interesting eompaiison is between the results we have portrayed 
itid those reported m the Tolks and Mehthar volume ('lollts and 
Meliehar 1968) Using 1966 median salary and leported years of e x- 
pcticnee, the estimated ratiei o( peak tei beginning salary is about 1 67 for 
Ph U’s m academic employment While this generally compares to the 
aterage for eiur 1970 data, aeeordmg to eiur profiles lor VP/{ = 5, the 
disaggregation of academirallv employed into different values of RXAD 
and APR provides a measuie of the e xtent to which profiles var\ in le ims 
of shape At one extreme tliere are early starteis (APR = 0) fiom 
selieiols ranked 1-5 who have a latio of peak to cntiy salars of 2 14. anel 
at the other extreme late starters fiom selieiols ranked > 17 wliei have a 
ratio of peak to entry salary eif 1 48 1 hat is, the vaiious [laits of the 
leademie labor market (eir Pli 1) ’s vary wielelv with resjiee t tei tiu extent 
to wliieli lliete is a stieiiig dependents between se.irs ol expeneiiee anel 
sal uy as we II as large dille i< ntials by de gitt lesel '■* 

IV Determinants of Academic Salaries in Different Types of 
Institutions 

1(1 a ecitain extent the eldleiential elfeets ol RXRD on salaiies mas be 
illnbiiteel to dilTeienets m tspe eil e iigilos me iit between eeonomists 
Itoni i.inked and uni inked gi.iduate seliools It is also ssell knossn that 
silanes at stub institutions an* relatisels loss ioi lossei ranks anil 
lelilivtls high feir liiglie I ranks Hence .id|nsiing foi the ts pe of institution 
at whicli llie indiv idtial is e iii le nils t mplosed in is i xp) iin i laigt put ol 
till 1 ( latioii be tsse ( n salai K s and graein lie se heiol i ank \ lumian-rnpilal 
iiilt 1 pielalion ol ,u ade mie eainmgs ssotile! pie-eliel that tlieii ssonid lit 
ss sie m.ilu ddfe re nee s 111 I he e .ii mugs sti ue Uii e lie twe eii instilutieiiis s\ hie li 
speeiali/e |uimaiils in te ae lung xeisiis liiose s\ hicli spec lali/e in lese ,iu It 
1 Ik aigniiunl is that leseanli and tliesis-supc is isioii skills gciuialls 
iiquiie ,1 more extended pet lod to dcselop than leai lung skills so that 
sriiools with .1 reseaich orientation ssill liasc t.unmgs profile's ssliith Ian 

' ■* 1 he (hllc icntisl lietwce n l*h D anel nrsste r’s dterice lioldi is is 11 jk ire lU at 0 ) t irs 
of I xpe 11 < iitc and risis lo ') 1 pc lee ill .it 23 >e,us ot e\pc ru iKe foi gi idu in s ul mp-i inki d 
schools Ihe coriespoiiding ligiires an 33 and 32 [xieeiil nspiriistls, lor ici iduiKs ol 
oihe I programs riii use in this ehlTeie iilial by sears ol ex|)cn( nee prosides addniniid 
(\ idenee on the hte cycle ol actiuiied skill and c iriiings s\ he rt in more able ” indn iduals 
trim lor longer pi nods ol time and thus nali/e a gee il< r ixpeiieiue (and age 1 <le- 
JJ< iidi ney to the ir salanes 



s 

— <5 _2 



g-g g-g s 

-W” I /—s 


’*t< 

Ui 

u 

SQ 

-< 

H 


s 

Oh 

0 

£ 

;; 

o 

a 

O 


y 

0 

y 

8 

u 


7 

ul 

Q 

c 

< 


s 

< 

75 

h 

o 

p, 



5!gf^r'S2;S®P! s 5 " S g S g — o 
°S8o‘=ogggS 1 “ “'fSsr- 


OE 






c 

5 

a 


_4J 

3 

2 

cfl 

r-' 


ai 

ft. 


*i W « 


O 

2 ? 




X 


q 

c 

rt 

W-S 

0 

& s? 

0 

0 ; 

5 

ftj 


1 

0 

CL 


>« 2 . 


li. 

0 ; 

0 

cCws 


560 








academic economists 561 

out in comparison with schools which emphasize teaching To see if this 
IS the case, we created six subsamples of academic economists (1) those 
Ph D economists who are employed by the schools in the top 20 of the 
1964 Carttcr rank, (2) a selection of 34 small private colleges of reputedly 
high quality,*’ (3) schools in the Deep South (Alabama, Arkansas, 
Louisiana, Mississippi), (4) schools on the Great Plains (the Dakotas, 
Wyoming, Nebraska, Kansas, Iowa, and Oklahoma), (5) all schools in 
the University of California system, and (6) all schools in the California 
State College system 

The regression results for each ol these subsamples (as well as eq [2] 
renumbered as [5] for comparative purposes) arc presented in table 4 
Wc have also reported the means of the vanables by school type in table 5 
and predicted salaries by years of postdegrcc experience in table 6 I here 
are several notable results 

First, there is a wide range in the values of the estimated parameters 
on the postdegree experience variables The estimated coclhcicnt on XPO, 
which IS the ran of growth ofsalaries in (ross-sectional terms at XPO — 0, 
langes fioni 020 for the Dt ep South subsarnph to 055 lor the top-20 
subsamplc Accordingly, the t stimatid ratio of individuals with-YPO = 20 
to those with XPO = 0 vanes from I 23 for the Deep South to 2 00 lor the 
top 20 Substantially the same pattern across school types was obtained 
in the other disciplines for which we developed such comparisons (biology, 
mathematics, and physics) and over time (1964 and 1970) To some 
extent this difTercncc in profile steepness across schools reflects sekttive- 
retcntion polities with respect to junior fatuity and is another 1 xample 
of cohort heterogeneity mentioned m Section II V’et (as developed 
below) wc know that differences m profile steepness relate to ranking of 
graduate school, and giaduate school attended inHucnces ivpe of school 
of employment As argutd m our interpretation of graduate st hoi 1 
quality, the profile shapes represent differences m the life cycle ol learning 
on the job and lenting out one’s accumulated skills 

Second, the estimated toeflicient on RXKD is not sigmhcanlK different 
from zero in lath of the six subsamples, indeed, the point estimates arc 
lugativc m (our ol the six However, m the school groups with a laige 
Iraction of individuals who hold Ph D’s from the top giaduate schools, 
the estimated c xpericni e-earnmgs piofilcs arc much stetpti than m the 
schools which generally lure fiom other than tlie top 10 graduate schools 
these results suggest that the principal reason lor the positive value of 
{\og SAL)jC{XPO) r{RNKD) observed m equation (4| is that 

'' 'Vmherst, Aiuiorh, Bates, Beloit, Bowdoin, Brsn Mawr, Carlcton, Colnalr, 
Connecticut College, Dividson, Dennison, Furman, Goticher, Hamilton, Haverford, 
H.irpur, Kalamazoo, Middlcbury, Mills, Mt Holyoke, Obcrlin, Occidental, Reed, 
Rollins, Sarah Lawrence, Smilh, btiphens, Swarthmore, Frinity, Wellesley, Weslevan, 
Williams, Wooster, and Vassar 



562 


JOURNAL OF POLITICAL ECONOMY 


TABLE 5 

Means of Variables in 1970 Subsamplls 


V ariablt 

Entire 

Sample 

Top 

Twenty 

Small 

Private 

Deep 

South 

Great 

Plains 

University 

of 

Caliiornia 

California 

Stale 

SAL 

15,890 

18,560 

15,980 

14,750 

14,830 

16,270 

15,300 

/AC* 

20,570 

24,5<K) 

19,070 

19,020 

19,950 

22,170 

19,020 

RFS^ 

0 (')() 

0 82 

0 56 

0 51 

0 54 

0 92 

0 55 

XPO 

97 

II 0 

II 1 

92 

88 

9 7 

9 7 

XPR 

4 S 

2 5 

3 9 

6 4 

4 3 

2 2 

5 3 

PEM 

05 

03 

03 

02 

0 00 

02 

13 

RNKD 

33 

65 

50 

05 

08 

70 

25 

Sample sizt 

3,040 

511 

66 

123 

74 

109 

64 

• loiil inniiil priilrsMein 1 

1 iiiiontr 






t l'io)7oiti()n tipeiriink, rtM 

nrt h as i niTjor VM>rk n\ ily 







TABI E 1) 




pR! niCIfc-D 

9-MoNTH AcaDI MIC 

Saiariis of 

Economists ior 

Selectfu CIroups of 




Sciiool s. 

1970 










University 



Entire 

Top 

Small 

Deep 

Great 

of 

Caliiornia 

XPO 

Sainpl( 

I W< Illy 

Piivati 

South 

Plains 

Caliiornia 

Stale 

0 

1 1,770 

11 ,(>ao 

11,230 

13,310 

11,800 

11,520 

10,870 


14,090 

14,850 

13,260 

14,520 

13,950 

14,340 

13,900 

10 

10,250 

18,060 

15,070 

15,460 

15,870 

17,200 

16,750 

ri 

18,050 

21,010 

16,490 

16,080 

17,410 

19,910 

19,050 

70 

19,310 

23,370 

17,380 

16,330 

!8,H)0 

22,230 

20,430 

i'-. 

19,890 

24,870 

17,640 

1(1,190 

18,730 

23,940 

20,660 

JO 

19,750 

25,320 

17,230 

I5,(>80 

18,370 

24,860 

19,710 

3 j 

18,880 

21,650 

1(1,200 

14,840 

17,360 

23,910 

17,730 


Nr.ll \Ph - 0 


gi.iduatts of tlic inoit [jusligious schools are more likely to receive 
appointments in universities with a rese.ircli orientation rather than at 
scliools which cmpliasizi teaching 

lliiid, the results for tlu school tvpes are also useful in interpreting 
the breakdown ol total cvperienec into its pre- and postdegrcc lorti- 
jioiients rill (stimatcd totlficicnts on XFR and XFR^ in the si\ sub¬ 
samples, like thosi on RNKD, vaiy lather widely, but on average the 
(stimatcd cH((t on S4L 111 the subsamplcs is smaller than lor the entire 
samph Notice that acioss tlu subsamplcs the mean value of XPR is 
inversely related to the steepness of tlu (,stimated age-earnmgs profile 
One possible ('cplanation ol these results is that individuals who have a 
large amount of picdc’giec experience ate often those w'lio had difliculties 
finishing their doctoral dissertations and toeak jobs at schools which 
specialize in teaching prioi to completion of their degrees Although they 
ev'cntually finish their degrees, they still spec lalizc in teaching (because 







academic economists 563 

they probably were not strongly attracted to research in the first plate) 
so that their research abilities—and hence salaries—did not grow very 
quickly If this interpretation is correct, then the XPR variable, like the 
RNKD variable, reflects factors by which individuals are sorted into 
(,„,ploytuent at different types of institutions ' * 

fourth, the average values of RNKD and XPR for the small-privatt 
suhsarnple are fairly close to those for the top-20 sample Using RNKD 
and XPR as proxies for ability, the differences in the level and shape of 
larnings profiles between the two tan be taken as indicative of lifctiine 
eainings differences between those who tfioose to emphasize rcscarcli 
versus those who emphasize teaching 1 he estimated salary advantage of 
individuals m the top 20 over those from the small-private group starts 
out at 4 percent at XPO = 0 and rises monotonically to j2 percent at the 
tune of retiicmciit 'Ihcie are, of course, a numbi r of difhi iillics vsith 
interprc ting this differential as solely attiibutable to differences in research 
lirliavior (eg , nonpceiiniary advantages and disndvantagi s at both sits 
of schools at different stages of the woik cycle), but we expert that the 
diffcii ntial would behave qualitatively in this fashion 

In Sictions vve have presented the basic features of lifetime 

lainitigs patterns of academic eeonomisis ()uahtativrlv, tlu ri suits 
obtained for economists vviti found to hold with viituallv no ixiiptions 
iti the otiut disctphtits which vve have studied, and some eompaiisons 
have be cn given VVe now tin n to eompaiison of the striii luic of eainmgs 
in ti onotmc s to that of othe r disc lyilmes and ovei time 

V. Earnings in Economics and Other Disciplines, 1964 70 

Ihc stiuctuie of tccmomisls’ ariel sociologists’ eainings with icspect to 
[losldegrcc exjieiicncc was subject to lathei minor change from 1964 to 
1970, but in biology, mathematics, and pai ticulai 1\ in pin sic s theic were 
substantial changes m the stiuctuic ofcMriimgs lo nuike these mtci field 
comparisons, we estimated cqu.uion (2) for five diseiphiics iti both the 
1904 and 1970 NSI’ Register d.U.i 1 he c stnnalc d c xperume e-eai nings 
ficMction of economists shilled icp over this 6-vc ir mteival h\ a constant 

these cocilc cUcons ,crc suppccrCttI by chi tcclhming icgicssioii equation 

TOP 20 = 2 \oropi>+ r.'iT'Offb-fO) + iitrop(ii^ib) 

( 017) (021) ( 027) 

- 024Ara + 0007(\/’2?)2 -h KiO, 

(003) (0002) (013) 

= 10, standaid crior of esliinate = 38 

Here TOP20 = whither employed m lop 20, TOP') = whelhci attended a school 
rinked 1-5, 'J OP{()-]0} = whe the r allteidcd a school ranked b-10, and 7'0P(I1-Ib) = 
wKuher attended a school rtnked 11-lG The excluded gioup is those giaduating Irom 
all nlhei schools 



JOURNAL OF POLITICAL ECONOMY 


564 

proportionate amount (about 50 percent) for all relevant values of XPO 
In 1964 the predicted 9-month salary of the “typical” economist {XPO =i 
12 and XPR = 4) was $11,320, m 1970 the predicted salary of this same 
individual (XPO = 18 and XPR = 4) was $18,770, an increase of 66 
percent over the 6-year period This 66 percent increase can be 
attributed to three sources (1) price inflation, which was 25 percent 
during this period, (2) real wage growth of economists, which was about 
20 percent, and (3) a movement up (or down) the cross-sectional cx- 
perience-eainings profile Sources (1) and (2) account for the 50 percent 
upward shift in the profile from 1964 to 1970 (1 25 x 1 20 = 1 50), 
and this typical individual would be expected to gam 11 percent m salary 
due to moving from a postdegree experience level of 12 to 18 Thus, for 
the ty’pical academic economist in the beginning period, approximately 
0 35 of Ins money wage gains from 1964 to 1970 were due to real wage 
growth for the profession as a whole, 0 45 were due to price inflation, and 
0 20 were due to an improved position m the life cycle 

These results for economists and the other disciplines are presented in 
table 7 The peicentage rates of salary increase for “typical” membcis 
of each discipline (XPO = 12 in 1964 and XPO = 18 in 1970) were as 
follows sociology, 56, biology, 63, physics, 63, and mathematics, 66 
However, the relative gams of young and old cohorts difTcr ratlicr 
dramatically across the disciplines When wc examine table 7C, the 
percentage change of predicted salaries of new entrants (XPO = 0) 
vanes ftom 50 percent in economics to 26 percent in physics Upon 
deflating these bv the use in consumei prices, we see that the starting 
real salary of Ph U economists rose by 20 pci cent but remained viriiialh 
unclianged for Ph D physicists This differential growth is also rcHccted 
in cohort comparisons New entiants (XPO = 0) in economics and 
sociology realized a use in nominal salaiy of 87 percent and 84 percent, 
respectively, after 6 y’ears of experience, but new entrants in physics and 
mathematics realized a 72 percent and 80 percent rise in nominal salai\, 
respectively In contrast, senior scholars (-YPO = 24 in 1964) m economii s 
and sociology realized a use in nominal salary of 48 and 46 percent, 
respectively, alter 6 more years of experience, whereas senior scholars in 
physics and mathematics realized a 48 and 51 percent rise m nominal 
salary, lespectively 

What factors explain these interficld differences in changes in the 
striK tiire of compensation’^ Although wc have a very limited sample (tin 
data points are veiy expensive^, we can offer some comments on supph- 
and-demand influences on the compensation of diffeient expenente 
cohorts in the fields In table 8 we have summarized information on 


An XPR of 4 was used because it is closer to the mean XPR across the six fields 
studied 



academic economists 


TABLE 7 


565 


Academic Salaries for Selected Disciplines by Years of Postdlgkee Experience, 

196') AND 1970 

A Predicted Salaries, 1970 


postdegree 

txpenfnce 

Economics 

Sociology 

Biology 

Phyjiicji 

Mathr matics 

Anthropology 

0 

11 ,560 

10,650 

9,790 

9,470 

10,560 

10,460 

fa 

14,340 

13,370 

12,420 

12,940 

13,850 

13,260 

12 

16,850 

15,760 

14,850 

l(.,330 

ll),940 

1 5,980 

18 

18,770 

17,460 

16,740 

19,030 

19,330 

18,290 

24 

19,800 

18,150 

17,790 

20,470 

20,560 

19,900 

30 

19,790 

17,730 

17,830 

20,t30 

20,400 

20,508 


18,750 

16,260 

16,850 

18,640 

18,870 

20,210 


B Predicted Salaries 1964 


Postdegrtt 

Experience 

Economics 

Sociology 

Biology 

Physics 

Malhcm.itics 

0 

7,720 

7,280 

6,980 

7,540 

7,700 

fa 

9,610 

9,010 

8,700 

9,710 

9,800 

12 

11,320 

10,550 

10,260 

11,660 

II.GfiO 

18 

12,630 

11,690 

11,460 

13,080 

12 970 

24 

13,340 

12,250 

12,100 

13,700 

13,480 

30 

13,350 

12,150 

12,110 

I3,31K) 

13,090 

36 

12,040 

11,410 

11,120 

12,200 

11,880 


C Percentage Shift in Estimated Profiles, 1964-70 


Postdegree 

Experience 

Ltononiics 

Sociology 

Biology 

Physics 

Maihi maiics 

0 

49 7 

46 3 

40 3 

25 6 

37 1 

(1 

49 2 

48 4 

42 8 

33 3 

41 3 

12 

48 9 

49 4 

44 7 

40 1 

45 2 

24 

18 4 

48 2 

47 0 

49 t 

52 5 

30 

48 2 

45 9 

17 2 

51 8 

55 8 

3b 

48 3 

42 5 

57 5 

52 8 

58 8 


scnioi/junior salar) cliirLrrntials (rclatnt vva^esi, tlic proportion of 
Pli D’s with 14 or fewer years ol experience (rilalise supplies), and the 
rale ol growth of undergraduate students enrolled (shilts in demand' II 
one interprets (iinior and senior faeuliy as tlic two inputs used in producing 
instructional services and assumes a honiolhetic production function, then 
b\ regressing relative supplies on relative wages an estimated substitution 
elasticity of J 6 IS obtained *** This suggests that junior and senior faculty 

The estimated regression equation was 

log ^ 2 02 - 5 66 log , 

® L{^PO < 14) ( 28) ( 76) i>AL(\P0 14) 


where = 95 







566 


JOURNAL OF POLITICAL ECONOMY 


TABLE 8 


SErnoR/J unior Salary Difperentials^ Relative Supplies, and Enrollment Growth 


Jirld 

Salary {XPO = 

24)* 

L{XPO > 14)t 

% Change in B A’s 
1964-70 

Salary {XPO = 

= 6 ) 

L(XPO < 14) 

Eronomics 

I 38 


1 27 

66 

SocioJogy 

1 36 


] 44 

177 

Biology 

1 43 


1 04 

78 

Math( matics 

1 48 


0 75 

48 

Physics 

1 58 


0 64 

6 


• S iJ jry i sDiii III s from l ibl/ 7 \ 

t National Siinuc houmlalmii Am^ncan ^nrncf Manpoutr lable A-'> 


are quite suhstiUitahle m the aeadetnic prodiirtion prortss Anotlur 
possibility IS that differences in relative salaries by age are caused by 
different rates of growth of demand for different disciplines When tht 
growth variable is added to the regression explaining relative demands, 
Its estimated coelTicient is positive but statistically insignificant [l ~ 41!), 
and the estimated value of the elastirity of substitution between older ami 
younger faculty falls only to 4 9 Yet there is certainly a strong simple 
correlation between undergraduate demand and relative salaries (r - 
- 83) 

Another factor to be considered in the differential changes in the salary/ 
experience profiles is the differential acceleration in Ph D output m the 
fields, which is measured by the ratio of senior to junior scholars By this 
measure physic s and mathematics enjoyed the biggest boom in the 1960s, 
and this would encourage rapid salary growth since established scholan 
tram Ph D and master’s students As a result, the salary structure in 
1970 will reflect (1) the large gams realized during the decade by thosr 
in the discipline prior to the early 1960s, coupled with (2) the slack in 
demand m the undergraduate sector which lowers the compensation fcir 
new positions In addition, a large increase in the supply of new Ph D's 
should bid up the implicit price for training options (particularly m the 
short run) and lower salaries lor those in the early career stages To 
sunimari/e, these factors—namely, rhanges in relative supplies, dil 
fcrential growth m demand for graduate training during the 1960s, and 
differential growth in undergraduate enrollment—have operated in 
effect differential changes in the salary strut turc across the fields Given 
our small sample of fields, the relative importance of the different factem 
cannot be isolated 

In our discussion of earnings across fields we have focused solely on tlif 
academic sector This allows us to proceed without measures for a number 
of nonpctuniary factors which are dilficult to quantify and also allows us 
to compare standardized 9-inonth salaries II comparisons arc made 





ACADEMIC ECONOMISTS 


567 


TABLE 9 

Gross Professional Income by Sector op Empioyment and DisapuNE, 1970 
A Annuai Earnings in Academics, 1970 



XPO 

ECON 

SOC MATH BIO 

ANTHR 

PHYS 

5 







15 


23,340 

19,420 


20,770 


25 


25,900 

20,770 


24,470 

25,540 

B Annuat Earnings in GovEkNMh NT, 1970* 


XPO 

ECON 

SOC 

MATH 

BIO 

PHYS 

5 


19,320 

17,990 

18,850 

14,890 

17,850 

15 


24,5C0 

22,240 

24,900 

20,080 

24,680 

25 


20,390 

23,120 

27,050 

24,010 

27,860 

t Annual Earninc.s in Business, 1970t 


XPO 


ECON 

MiTH 

BIO 

PHYS 

5 



24,810 




15 



33,770 




25 



37,110 



29,430 


• Insuftiruni simple in I \THR 
t Insuflu lent s inipic m SOi and 4 \ IHH 


across the disciplines in other employment sectors (i e , government and 
business), we would expect that the relative salaries of the different 
disciplines would be quite similar to academics because of within-ficld 
migration which would tend to equalize net advantages In table 9 we 
present a comparison ol gross professional income (12-month earnings) m 
academics, government, and business in 1970 

Annual earnings m both academic and government employment (at 
XPO = 15, for c xample) are rather comparable in economics, physics, 
and mathematics, but are lower in biology and sociology 

In business employment, Fh D economists do markedly better than the 
other disciplines However, business earnings are generally highest if one 
develops managerial skills, and these are generally learned at the expense 
of more narrowly defined professional abilities in the disciplines The 
evidence whic h we have on this point is that for economics and phy'sics, 
the fields with the greatest business employment, there is a reverse tem¬ 
poral pattern to the earnings advantage of graduating from a highly 
ranked school relative to the pattern observed in academics initially' the 
earnings advantage of graduates of highly ranked programs is substantial 
(economics, 14 percent, physics, 6 percent), but the earnings advantage 
converges to zero at high levels of posldegrec experience 










568 


JOURNAL OF POLITICAL ECONOMY 


VI. Summary and Conclusion 

In this paper we have employed the NSF National Register data to 
examine the earnings of academic economists from a number of points of 
view Among our major conclusions arc the following 

1 An individual who completes his doctorate at a relatively early 
point in the life cycle has a markedly different earnings pattern from an 
individual who begins his career at a later point in the life cycle Early 
starters take positions with lower starting salaries but which afford greater 
human-capital accumulation possibilities than the positions typically 
taken by late stai ters Consequently, the ratio of peak to beginning salaries 
IS much greater for early starters than for late starters (1 88 for economists 
who received the Ph D at about age 25 compared to 1 58 for economists 
who receivfd the Ph D at about age 35) The i slirnatcd rate of return to 
nn-thc-joli training for economists was found to be 7 percent 

2 A second way ol differentiating the academic labor force is by the 
quality ol the graduate school attended We found that those economists 
who earned the doctorate from one of the top 10 graduate schools receive 
lower initial salaries than economists from less prestigious schools, but the 
rate of salary increase tends to be mueh greater for the former than the 
latter The predicted ratio of peak to beginning salaries is 1 91 foi 
graduates ol the top schools and 1 60 for other economists It would be 
interesting to know to what extent these differences in life cycle-human 
capital acc umulation patterns can be attributed to the content of graduate 
sc hools versus the greater selectivity of the top graduate schools, but this 
question could not he answtred with the NSF data set 

3 We also arialyred the determinants ol the salaries of economists 
employed in selected types of schools—the top 20 Cartter-ranked schools, 
a set of sclcc tive small private colleges, all institutions in the Deep South, 
all mslilutions on the Great Plains, the California Stale College system, 
and the University of Calilornia system As would be expected, the more 
rcseaich-onenlcd school types exhibit much steeper cxpenencc-earnings 
profiles than the less researeh-onented school types It is also true that tin 
rescan h mills h.avc a higher propoition ol their faculties who arc eaily 
staitcrs and who received the doctorate from a top university (They are 
also less likely to employ women than are institutions which sticss 
teaching) These fat ts suggest the institutional reasons for our results 
with respect to early versus late starters and economists from the better 
versus other graduate schools 

4 The salaries ofacademic economists in 1970 were not much different 
from those of physicists, mathematicians, and anthropologists and only a 
little higher than those of sociologists One exception to this is the fact that 
economists earn relatively higher starting salaries than anyone else m our 
selection However, gross earnings were greater for economists than for 



academic economists 569 

any of the other disciplines studied These same findings also applied in 
1964 

5 The analysis of earnings in various disciplines also provides a 
suggestion of a finite elasticity of substitution between labor ol different 
ages Specifically, disciplines in which a large fraction of the occupation 
has high amounts of experience arc likely to be characterized by relatively 
Hat experience-earnings profiles as compared to discipliriis in which a 
small fraction of the occupation has high amounts of expenenee 1 he 
estimated value of the elasticity of substitution between labor with 14 or 
more years of experience and labor with fewer than 14 years of cx- 
ptiience is about five This implies that to some txtent the demand for 
educated labor is age-specific, and models of the dynamics ol professional 
labor markets (Freeman 1971) should take this into account 


References 

Aracritan Association of University Frofe^isors "CopinG; with Adversity Report 
on the fxonomic Status ol the Prolession ” imirtcan Aiwc Univ Profewon Bull 
58 (Summer 197i) 178 2fJ 

Bcn-Poralh, Yoram " 1 he Production of Human Capital and the laic Cycle of 
Earnings” /Pk 75 {July/August l%7) 352-65 

Boddv, Francis M “Recent Behavior ol Economists’ Salaries ’ i PR 61 (May 
1971) 316 20 

Uarlter, A M /fn Asseament oj Quality tn Graduate bduialion Washington 
Amctican Councilor! F.due , l'K>6 

Freeman, Richard B T/ie Matiet for College 1 ramed Manpower (7ambiidge. 
Mass Harvaid Univ Press, 1971 

Hause, John “Ability and Schoolings as Determinants ol I ih lime Earnings or 
II You’re So Smart, Why Aren't You Rich’” \ b R 61 (Mav 1971) 289-98 

Johnson, George C, and Stalford, Frank P ‘ 1 he F.arnings and Piomotion of 
Women 1 acuity ” 1 E R (1974), in press 

Mince I, Jacob “On-the-Job liaining (jost, Returns, and Some Implications” 
J P b 80 (suppl , October 1962) 50-79 

— - Schooling, bepenence, and Earnings Ne*w York Nat Bur fa on Res, 1973 

Mooney, Joseph 19 "Attrition among Ph D Canilidalis An Analysis ol a 
Cohoil ol Recent Woodiow Wilson Fellows” J Human Riwuries 3 (Winter 
1968) 47-62 

Rosen, Sherwm “Learning and Experience in the Labor Market " J Human 
Resource! 7 (Summer 1972) 326-42 

Stephan, Paula F, “Life Cycle 1 raining Paths .Subsequenl to Formal Eduealion 
PhD dissertation. Dept Leon , Univ Michigan, 1971 

Stiglcr, George J Capital and Rates of Return m Manufacturing New \ oik Nat Bur 
Econ Res , 1964 

I’ollcs, N Ainold, and Melichar, Emanual “Sludie-s ol the Slriictuir of 
Economists’ Salaries and Income-s ” A E R , vol 58 (Dece inbti 1968, pt 2) 

Warkov, Seymour, ct al Graduate Student btnarue, 1963 Chicago Nai Opinion 
Res Center, 1965 




Frank Knight in Retrospect 


Frank Knight and the Tradition of Liberalism 

Scott Gordon 

Indiana Umveriiiy and Qmen's University 


Trank Knight was a very complex and subtle thinker, especially so in 
the area of political philosophy, and any appreciation of his views in short 
compass (or perhaps even at length) is bound to produce simplifications 
of the kind that would have made him groan In the 60 years of his study 
and meditation upon man as a political animal (i e , a rational one, not 
a bchavioristically social one like the termites or ants'), he generated 
far more questions than answers, more problems than solutions, and he 
was more distressed by ingenuousness in political philosophy than by 
error or evil He had the uncommon gift (and the curse') of the compound 
eye, human society appeared to his perception through many angles 
of view which compelled a projection that was a mosaic of great richness, 
complexity, and, ultimately, mysUry We may label him a “liberal” 
(in the midnmeteenth-century meaning of that term) because he did so 
describe himself, but his own discussion of liberalism was mon concerned 
with Its weaknesses than its strengths No one, indeed, has searched out 
Its subterranean defects with more tenacity 

The essence of the liberal philosophy, in Knight’s view, was freedom, 
not as an instrument (for utilitarian efficiency) or even as a human 
preference, but as an ethical value in itself This raises many philosophical 
problems, which are compounded by the fact that Knight was unwilling 
to accept any specific theory of ethics or even any of the extant general 
approaches to ethical theory He rejected alike all reductionist theories 
(including the reduction of all values to freedom) and all deontological 
statements of absolute values The closest he would come to advancing 
an ethical position of his own was to evince a belief (or a hope) that man 
IS capable of attaining higher levels of social existence—to become more 
“civilized” in some sense Yet he rejected altogether the idea that one 
may desenbe some ideal civilization and set it up as a goal, the ceaseless 

571 



572 


JOURNAL OF POUTICAL ECONOMY 


search for the better being superior to any conceivable “best ” It would 
ill serve the purpiose of even a brief essay on Knight’s political philosophy 
to focus exclusively upon his liberalism, since the problem of freedom 
was intimately connected in his thought with a number of other issues 
that are equally fundamental We will do better to circle around our 
target, as Knight himself always did, in order to note the vital inter¬ 
sections between Ireedom and other facts and values 

We may begin where Knight did, with uncertainty His doctoral 
dissertation. Risk, Uncertainty, and Profit (written in 1914—16, published 
in 1921) can be (and has been) read as a technical study on the pure 
theory of profit as a category of income distribution More broadly, it 
can be read as an essay on the nature of a private enterprise economy 
But it IS more than these If one does not neglect those passages whirh 
may appear to the economist as obiter dicta, it becomes clear that Knight 
was tackling one of the most fundamental problems of epistemology and 
ethics In Knight’s view, uncertainty is not only, or “merely,” the key 
to the understanding of an economy of private entrepreneurship whose 
motive force is profit, it is the most basic characteristic of human existence, 
which has profound philosophical and practical significance and produces 
much of the array of political and ethical problems which man faces as a 
participant in social organization A world lacking in uncertainty 
(even one in which “risk” were present) would not be one of free human 
beings at all, but of determined automata In such a world, the steady 
forces of biological evolution would inexorably fix all survival-valui 
behavior patterns in the gent tic inheritance No meaning could then 
attach to words such as “deciding” or “valuing”, indeed, consciousness 
Itself would be redundant, and “thinking” would not be distinguishable 
from “behaving” or reacting to stimuli A Laplacean world leaves nothing 
to be decided by acts of will Only a world of uncertainty can be modified 
by the values man adopts and the choices he makes, and it is man’s 
capacity to change the world, and himself, that is the most essential 
quality, and opportunity, of the human condition Yet, at the same time, 
man’s opportunity of acting to advance his civilization must be based 
upon reason (his capacity to examine the world objectively and to 
establish cause and effect) which is inherently limited by uncertainty 
Itself Thus, man is a free and thinking being because of uncertainty, yet 
it IS uncertainty which imposes limits upon his effective use of reason, a 
complexity that is compounded by the fact that we are uncertain also 
as to the limits of uncertainty, there is no way to establish the boundaries 
among knowledge, opinion, and plain ignorance 

The most important implication of this, for liberalism and for economics, 
IS that man is a fluid creature and the course of his psychological evolu¬ 
tion IS indeterminate In Knight’s view, the largest defect in liberalism 
as a social philosophy is that it takes the individual as a datum, and 



retrospect 573 

defines the objective of social policy as immstenng to his given wants, 
this turns a blind eye to the phenomenon of acculturation, the fact that 
the individual and his wants are largely formed by the social process and 
are not exogenous to it One cannot set up as an ethic that it is good that 
people should have as much as possible of what they want, because 
the ethical quality of a civilization is largely a matter of what its people 
want and the direction in which those wants are evolving under the 
impact of choices successively made Knight was the only economist 
of the twentieth century to take up seriously the problem raised by Alfred 
Marshall of the complex relation between man’s “wants” and the 
“activities” he undertakes to satisfy them The growth of economic 
wealth, in removing man from the pressure of immediate physical necessity 
creates a society m which wants are endogenous to the social process, 
neither ends nor means can be taken as gpven, nor are they separable 
from one another The evolutionary character of human culture as a 
consequence of this is what Marshall was referring to in his rather cryptic 
remarks about the necessity of economics becoming a “biological” 
science Like Marshall before him, Knight could not provide an analytical 
framework that could grapple with this problem but, unlike Marshall, he 
was not naive enough to believe that it will ever yield to the advancement 
of “science” in any meaningful sense of that term As a philosopher, 
Knight was unwilling to accept any conception of an ideal as a guide to 
human evolution, or any teleological belief that society tends toward 
some goal On this issue, as on many others, he amved at doubt rather 
than doctrine, an increased confidence only in uncertainty 

A connected theme, which appears time and again in Knight’s essays, 
is the significance of play in human culture This is a phenomenon of 
extraordinary significance for two reasons First, the means-ends relation¬ 
ship IS reversed in play activity, goals are established as instrumental to 
the true objective, which is to provide opportunity for satisfying activity 
Second, a good game requires rules, yet there is an inherent conflict 
between the motivation to win and obedience to the rules What Knight 
wished to point out is that many economic (as well as piolitical and other 
social) activities have the essential character of games, and the motivations 
of the participants are understandable only if they are recognized as 
desire to engage in sport or play Business, m particular, Knight regarded 
as an outstanding game, possibly the greatest ever invented 1 he ethical 
issues (not to gainsay the economic ones) that are involved here are as 
important as they are puzzling If games were merely a matter of recre¬ 
ation, one could simply classify this activity as one of the ways in which 
Homo economicus employs his time and the other resources of the economy, 
but the play element infuses much that is normally regarded as “work,” 
and this raises a host of difficult questions concerning the ethics of com¬ 
petition, the nature of the nonzero-sum character of games, and the 



574 


JOURNAL OF POLITICAL ECONOMY 


interpenetration of means and ends It even suggests that a competitive 
economy ought not to be viewed, as economists do, m instrumental 
terms, but as a manifestation or exemplification of human charactenstics 
which are “fundamental” in a cultural sense In the competitive economy, 
man displays himself not only as Homo economicus and as Homo faber, but 
also as Homo romanticus and as Homo ludens 

Politics too IS a game, inseparable from the existence of social life 
Like other games it is subject to rules, but politics has a special significance 
in that It makes the rules by which all games (including itself) are played 
Knight repeatedly emphasized that the contraint of individual activity 
by rules docs not in any way alter the essential fact that all social activity 
IS the interaction of men, and all constraints upon individual action arc 
decided by, and operated by, men, not by the abstraction of “law ” 
It IS only “romantic nonsense” he once remarked, to believe that one can 
have a “government of laws” that differs in any fundamental way from a 
“government of men ” What matters in a polity is not the form of govern¬ 
ment, but how public opinion is generated To the extent that he would 
accept “democracy” as a label for a good political system, Kniglit 
adopted Lord Bryce’s definition of it as “government by discussion " 
His frequent castigation of politicians was not intended to be a condemna¬ 
tion of politics, nor did it reflect any naive reverence for the automaticity 
of the market form of economic organization, the perfection of wine h 
he questioned more profoundly than any other economist, not exluding 
socialist ones His vexation with politicians was due to their tendency to 
turn “discussion” into “persuasion,” which Knight regarded as coercion 
of a particularly insidious and dangerous type A good polity, a “demo¬ 
cracy,” IS a society in which there is free exchange of ideas over tlie 
widest possible scope, unconstrained not only by censorship, but also by 
propaganda and guile In such a society, social policy will reflect a 
freely formed and eternally fluid public opinion which results from 
continuous discussion, most importantly of values rather than methods 
(which at bottom are really technical matters and can be settled by 
“science”) 

The operation of good political processes is very difficult, not only 
because of the tendency for “discussion” to degenerate into “persuasion,” 
but because it is inherently difficult for any group of free men to arrive 
at collective decisions Arrow’s “general possibility theorem” is not a 
serious difficulty for a democratic polity, unless one mistakenly identifies 
the process of voting, rather than discussing, as the essential feature of 
democracy Where discussion is free and extensive, individual preferencts 
are modified and collapsed into a broad consensus, at least on fundamental 
values, which permits the machinery of representative or even plebiscitary 
processes to function effectively on specific issues Nevertheless, sucli 
processes of collective agreement are very difficult, and Knight perceived 



retrospect 575 

that it would grossly overload the machinery of democratic decision 
making to charge it with the task of deciding all matters, and ail details, 
of collective cooperation A market economy does not require the per¬ 
formance of this practically impossible task Even a large society can be 
one of cooperation on the basis of exchange, which may take place on any 
terms which two parties agree is to their mutual advantage By means of 
the market mechanism a vast array of the practical problems of social 
existence are removed from the orbit in which coercion operates Any 
government which undertook to manage a modern economy in detail 
would have to be a comprehensive dictatorship, repressing not only- 
voluntary exchange, but, in order to sustain itself, the free expression of 
opinion The view that a comprehensive government could be, or long 
remain, a protector of individual freedom is an error that rests upon 
denial of the elementary sociological fact that such a monopolistic center 
of power would inevitably attract to itself persons with a taste for coercing 
their fellow men The socialist labors under the delusion that while 
politicians and bureaucrats in a capitalistic society arc venal, corrupt, 
and tyrannous, they will be paragons of honesty, justice, and self-constraint 
m a socialist one 

The freedom that Knight regarded as the essence of liberal philosophy 
was not viewed by him, in purely negative terms, as the “absence of 
constraint ” He differed from other liberals, like Hayek, in interpreting 
freedom as necessarily involving power, the ability to do what one wills 
This creates great difficulties in a world where what one may wish to do 
conflicts with the wants of others As a general philosophical principle, 
Knight accepted John Stuart Mill’s proposition that one’s legitimate 
freedom of action is constrained to the sphere within which there is no 
interference with that of others, but he recognized this to be a principle 
which leaves the largest problems of social life unsettled The economy, 
in particular, is a system m which conflicts of power arc of the essence 
A perfectly competitive economy, operating upon the principle of 
voluntary exchange, assures that commutative justice will rule, no one 
IS coerced to exchange against his interest But each person’s power (and 
therefore his freedom) to engage in economic processes is a matter of the 
aggregate means at his disposal, and there is no reason to believe that the 
distribution of such means will be just in any commorsense or sophisticated 
sense of the term 

This was, in Knight’s view, the most conspicuous and most dangerous 
defect of the market economy He did not accept equality of income as a 
distributional ideal, but he viewed existing inequalities as gross and 
moreover, he felt that there were powerful forces at work in a private- 
property economy tending to increase this inequality it is easier for the 
rich to save than for the poor, the inheritance of property extends in 
time, and potentially without limit, the advantages gained bv the success- 



JOURNAL OF POLITICAL ECONOMY 


576 

ful accumulation of wealth at any point; the owners of property, unlike 
those whose persons are their only productive means, can operate in the 
market for assets as well as m the market for the services of assets, and 
so are more able, in a dynamic economy, to take advantage of oppor¬ 
tunities for gam and to protect against losses Knight felt that the power 
of democratic government can be, and ought to be, used energenticall) 
against concentration of wealth (and power), but he was doubtful that 
It could do more than delay briefly the basic tendency He felt that the 
inevitable growth of inequality will erode the competitive market econ¬ 
omy in Itself, and it will also destroy its ethical justification 

Knight was no romantic reactionary, looking back to some time in the 
past as the golden age of civilization, but neither was he a romantic 
progressive, viewing the evolution of society as tending inevitably toward 
the better and better He saw the history of the West as one of great 
human progress since the Renaissance The defeat of intellectual formalism, 
the rise of empirical science, the destruction of religious authoritarianism, 
and the effect of the market economy in (initially) dispersing power 
and weakening coercion were, m his view, the highlights of man’s struggle 
to create a higher civilization But one cannot read Knight’s essays without 
apprehending the deep piessimism of their author In part this was due 
to Knight’s view that the competitive economy tends to economic 
inequality and therefore to self-destruction, in part it was due to hi. 
despondency over the qualities of man hts perversity, prejudice, love ol 
violence, and taste for evil, but more than these, he was dejected by tlie 
realization that the central problems of ethics were insoluble and that, 
even if that were not so, any society that devoted itself wholeheartedly 
to an ideal, even a good and noble one, would be a mean and servile 
place, stunting rather than enlarging man’s capacity for civilization 
Fiank Knight disliked labels, his compound eye saw the weakness!s 
of any political or philosophical “position” at least as clearly as its 
strengths 1 o classify him simply as a “liberal” would be misleading, 
and not only because that term has altered its meaning so greatly ovi r 
the past century of its philosophical ascendancy in the West Even its 
older form was gravely flawed, in Knight’s view, by its excessive individual¬ 
ism and failure to recognize adequately the social and the evolutionar) 
character of man But in one great sense, which transcends these issues, 
he was indeed a liberal, in the tradition of Milton and Mill, believing 
that a good society is one that is uncertain of its truths and dedicates 
Itself, not to an ideal, but to the eternal search 

The tore of liberalism—what most distinguishes it from otlici 
views of life—is a manifold revolution in the conception ol 
truth To say that belief is free is to say that truth is in¬ 

herently “dynamic,” subject to change and actually growing 



retrospect 577 

and changing The liberal interest in truth is one of curiosity 
and quest, not of mystical contemplation or adoration 
Any truth that is really “established” is no longer interesting, 
but a commonplace, even a bore Truth is the supreme example 
that liberal idealism looks at the values of life in terms of 
pursuit as well as possession Truth is an end when it is 

unknown or uncertain, and especially if controversial, hence the 
truth interest is finally a romantic one [Freedom and Reform, 
p 394] 

Frank Knight was a liberal, but he was also a conservative, an idealist, 
and also an empiricist, a romantic, and also a utilitarian, and he was a 
great modern critic of all these His mental constitution was incapable of 
that monocular fixation which is the indispensable prerequisite of wide¬ 
spread fame or the founding of a “school ” He leaves behind at his death 
many who have been deeply influenced by his thought, but no disciples— 
which is how he would have wanted it 



Frank H. Knight as Economic Theorist 

Arthur H. Leigh 

Rted College 


The greatness of Frank Knight as economic theorist rests as much on his 
contributions to the progress of economics on a broad front as to his more 
conspicuous theoretical breakthroughs, of which the theory of profit and 
the theory of capital are the most significant He defended the funda¬ 
mentals of neoclassical economics by refining and rehabilitating them, 
not merely by reasserting them With explicit recognition of the depth 
and complexity of human motives, he abstracicd from them the behavior 
patterns wliith arc pertinent to economic analysis, without implying that 
actual behavior could be accurately predicted with those patterns alone ' 
He demonstrated tlie effectiveness of a perfectly competitive marktt 
structure to achieve optimal allocative solutions but pointed out the 
difficulty of realizing or even defining with logical consistency the con¬ 
ditions of perfect competition ^ 

Knight defined economics broadly as the study of social organization 
whose function is the allocation of given scarce resources to maximize 
the achievement of given ends He emphasized the universality of the 
choice problem imposed by scarcity and the generality of its solution in 
terms of marginal equalities As a writer and thinker of great breadth, 
he transcended disciplinary boundaries, including those of his own 
definition of economics, with profound philosophical discussions of value, 
freedom, justice, and their dose links to economic theory 


The Theory of Profit 

In Risk, Uncertainly, and Profit (1921) Knight dchned profit as a pure 
residual,^ the amount of enterprise revenue remaining after the deduction 


' Cf Buchanan 1968, p 42b, Bose 1970, pp 7-10, Breit and Ransom 1967, pp 
198-200, Rima 1972, pp 289-91 

’ A noteworthy example is his famous controversy with Pigou on the effects of increas¬ 
ing returns on the optimality of the competitive solution and on the relationship between 
social and private costs (Knight 1936a, chap 8, pp 217-36, cf Rima 1972, pp 288-89, 
294-95, and Breit and Ransom 1967, pp 191-200) 

“ Knight acknowledges prior suggestion ol his theory of profit by F B Hawley (Knight 
1921, pp 31, 41-46) and the nineteenth-century German economists von Thunen 
and von Mangholt (Knight 1921, p 26). 


578 



retrospect 


579 

of all enterprise costs Costs consist of payments at market prices for all 
productive services to which a contribution to the product can be im¬ 
puted, including those provided by the owner-entrepreneur and his 
property They include “rents” for all exclusive advantages and privileges 
possessed by the enterprise which give rise to monopoly power, and pay¬ 
ments for all measurable, and therefore insurable, risk The existence of 
a profit residual is therefore due to “uncertainly,” which, in contrast to 
insurable risk, results from changes in market and technologieal con¬ 
ditions so unpredictable that their probability cannot be measured for 
the enterprise m actuarial terms Generallv anticipated change does 
not give rise to profit, for in this case competing entrepreneurs will bid 
up the prices of inputs and bid down the prices of product even before 
the change actually occurs, maintaining equality between revenue and 
cost The receipt of profit is temporary, lasting only until the innovation 
or adaptation which gave rise to it has been generally imitated * The 
realization of positive profit rewards the entrepreneur who anticipates 
change sooner and more accurately than do ihc majority of competing 
entrepreneurs Ihc average magnitude of profit in the aggregate tends 
to be low if entrepreneurs are generallv optiinistie about their prospects 
and bid up the prices of productive services Knight suggests reasons for 
believing that aggregate profit may be negative vsith average net losses 
cutting into other (imputed) income People arc commonly willing to 
gamble when the odds are known to be stacked against them Business¬ 
men tend to lie overly optimistic about their own capacities and chances 
lor success People respond to the dynamic stimulus to outbid their 
rivals and play the game, and the prestige and indcptndence ol being 
one’s own boss has wide appeal 

The entrepreneurial func tion is that of making dec isions for the success¬ 
ful adaptation of the enterprise to change, and in a progressive soeieiy, 
acting as the economic pioneer initiating useful innovations The entre¬ 
preneur IS a specialist in bearing uncertainty, including tlie uncertain 
consequences of his own decisions He guarantees fixed contractual 
pavmenis to the contributors of productive services, thus tcheving them 

* For critical commentaries on Knight’s distinction beivsccn risk and uncertainty 
and Its relation to probability theory and proht, see Hicks (1931), pp 170-71, 173, 
Arrow (1951), pp 408, 415-17, 420, and Ft liner (1965), pji 29-31 Hicks credits Knight 
with placing the theory ol proht on a hrm foundation—the dipendence of profit on 
uncertainty but argues ihat a satisl,ictory theory of profit can bt developed without 
Knight’s concept of true “uncertainly” or nonmeasurabit risk Arrow and Fellnir agree 
that Knight failed to develop a clear theory of unccriaintv as distinct from risk, but 
Arrow characterizes as an ‘analytic point o! tonsiderable interest” Knight’s assertion 
that il all risks were measurable, then risk aversion would not give rise to any profit 
(P 427) 

’ Schumpeter, like Knight, held a dynamic residual theory of jiroht, but he emphasized 
innovation and development rather than uncertainty and unpredictable change as its 
determinants (cf Schumpeter 1934, pp 128-56, 172, 1939, pp 102-9) 



JOURNAL OF POUTICAL ECONOMY 


580 

of uncertainty It is the function of profit (or the anticipation of profit) 
to motivate and reward this socially desirable behavior ‘ 

Before 1921, the profit concept was, with few exceptions, a variety of 
confusing mixtures composed of return on capital, wages of management, 
premia for risk, monopoly rent, entrepreneurial income, and exploitation 
of workers Knight isolated profit as a distinct distributive share and 
identified its essential social function together with that of the entre¬ 
preneur. In the process of doing so, he reexamined and improved the 
theones of competition, product imputation, knowledge and uncertainty, 
and enterprise organization He helped to discredit the common belief 
that “profit” can be indiscriminately taxed or bargained away with 
impunity The modern technical and textbook literature reflects his 
theory of profit significantly, but not completely Nearly all distinguish 
profit from interest, risk premia, and managerial wages Fewer dis¬ 
tinguish clearly between profit and monopoly rents “Profit” in business, 
tax, accounting, and popular usage is still the old confusing mixture 


The Theory of Capital and Interest 

Knight developed his theory of capital and interest in a series of con¬ 
troversial journal articles and essays, the first of which appeared in 1916 
and the remainder between 1931 and 1944 ’ It is best understood as the 
core of his theory of production and growth and as a reaction against 
important elements of Austrian,* classical, and neoclassical economics 
He rejected the then widely accepted “Austrian” capital theory, accord¬ 
ing to which capital is intermediate goods first produced by “original” 
productive agents such as labor and natural resources and then used 
up m the production of final goods over a definable period of time In 
Austrian theory, llic quantity of c,apital used vanes directly with, and 
IS measured by, the average length of the time peiiod between the 
investment of original factor services and the consumption of the fin.il 
product The productivity ol a process increases with greater “rouncl- 
aboiitness” (capital intensity), but the lengthening of the period of 
production is subject to diminishing returns A general preference lor 
present over future goods of like kind and cjuantity, expressed as a latc 
ol discount, means that savers must be compensated for “waiting” and 


* I he social function of profit as Knight saw it was most clearly spelled out in an article 
which appeared much later than Risk, Uncertainly, and Profit (Knight 1942) 

’Knight 1916, 1931, 1933a, 1933*, 1934*, 1935a. 1935*. 1935c, 1936*, 1937, 1933, 
1941, and 1944 

• Writers with whom Knight took issue were Bohm-Bawerk, Merger, Wieser, and 
jevons of the earlier generation, and included Miscs, Kaldor, and Hayek among his 
contemporaries In many important respects, Knight’s concepts of capital and the 
productive process follow those ofj B Clark (cf Clark 1914, chaps 9, 10, 18, and 20) 



RETROSPEcrr j3i 

limits the extension of the period of production The equilibrium rate 
of interest for the economy is established by the relationship between 
an upward-sloping supply curve for saving based upon time preference 
and a downward-sloping demand curve for capital based upon the 
marginal productivity of increased roundaboutness 

In contrast to these views, Knight defined production as the rendering 
of want-satisfying services by a large variety of coordinate productive 
agents and the creation of net additions to the stock of productive agents 
With production m the first sense, consumption (the enjoyment of 
services) is simultaneous With production in the second sense, no con¬ 
sumption corresponds unless or until society as a whole uses up its stock 
of productive capacity, a “decadent” condition not characteristic of 
modern societies ^ There is therefore no identifiable, much less measur¬ 
able, period of production short of the whole interval between the 
historic origins of the economic system and its ultimate teimmation 
Useful classification of “productive agents” requires distinctive cat¬ 
egories whose component units are sufficiently interchangeable to permit 
the correct application of the marginal productivity principle Tht 
traditional land, laborers, and capital goods trilogy is therefore more 
misleading than useful Knight’s “capital” is not a category of produt tive 
agents classified by origin or physical characteristics, hut the present 
value of all nonliiiman productive agents whic h yield a How of valuable 
services through future tunc For purposes ol thco’-y and accounting, 
this flow IS reduced to a perpetuity by providing for the maintenance or 
replacement of concrete productive agents out of their product How 1 his 
replacement process makes up inuc li of what is commonly called “produc - 
tion ” The stock of capital is thus the value embodied in a vast variety 
of objects of cvciy degree of durability At the indistinct borderlines of 
the concept are intangibles (such as knowledge) which may or may not 
be appropriable, and free human beings whose productive capacity is 
partly the result of cconomicallv motivated investment but for legal 
reasons is not subject to capitalization as property 

To measure a quantity of existing capital, the present value of a unit 
of productive capacity must be found bv capitalizing the yield imputable 
to It at some given rate of interest I he appropriate rale is the ratio 
between an increment of perpetual annual service yield and the minimum 
cost, in terms of foregone consumption, of constructing an increment of 
productive capacity capable of yielding it, and is determined at the 


’This does not mean, of course, that no additional consumption corresponds with 
(results from) the additional net output produced by the net additions to the stock ol 
productive agents, once they are in operation But that additional consumplion cannot 
at the same time be attributed to production of services by the new agents and the pro¬ 
duction of those new agents by previously existing agents 
Cf Schumpeter 1954, pp 900, 906, and 1018 



JOURNAL OF POLITICAL ECONOMY 


582 

economy’s margin of investment “ Free competitive markets establish 
determinate values for both the yield and the cost of an investment 
and tend, in spite of uncertainty, to bring rates of yield on all lines of 
investment into mutual equality This is the marginal rate of return on 
investment for the economy which is used to capitalize the yield of all 
existing nonbuman productive agents and thus to measure the capital 
embodied in them 

Knight argued that this rate, and the loan rate whic h tends to conform 
to It, aie determined indepi ndcntly of any supply function for savings or 
loanable funds The short-run demand funt tion for liquid capital is 
infinitelv elastic because the marginal productivity of capital is a function 
of the total stock of capital, and investment over a period as short as 
1 year adds too little to that stock to affect its marginal productivity 
The choice of savers between more current consumption and larger 
streams of future income therefore affects not the rate of interest but the 
quantity of saving and investment Knight considered the Austrian 
attempt to explain this choice in terms of time preference to be rnis- 
specified Properly stated, the question is whether, celens paribus, people 
would generall> prefer to receive a stream of income of given duration 
and total magnitude in increasing, equal, or decreasing installments, 
apart from the oppoitunity to invest and earn interest on the earlier 
installments Common observation suggests that the first option would 
be at least as popular as the third, a result which would run contrary 
to the timc-prefeicnce theory 

Knight assume d that the economy’s marginal productivity ol investment 
IS a decreasing function of the stock of capital and speculated that the 
elasticity of the function approximates unity The rate of return could 
never reach zero unless all services which capital goods could render 
became free goods Hut it was not until late in the development of his 
theory of capital (1944) that he explained diminishing returns to invest¬ 
ment The (oniprchensjveness ol his capital concept posed a problem 
A growth model in which all agents arc freely augmentabk by invest¬ 
ment appears to afford no basis for diminisliirig returns to investment 
in general, for there arc no agents fixed in cjuantily to which successive 

‘ ‘ Knight express!<J his d< hnilion ol the net rate of return on investnic nt in algebraic 
form as lollous (A*^ — I) ~ H{A^ — I), where S is the number ol dollais invested 
per year over a construction period ofc years, R is the itnpultd annual yield of the assi t 
in dollars, L is the life of the asst 1 m years, A equals (I -f t), ( is the rale of return (the 
unknown) Note that 1 is a ni t (ptrpctual) rats of rsturn even though ihe investment on 
which the equation is based is assumtd to hav< limited lift, and that the late cnteis into 
the cost side (as well as yu Id side) of the < qualion whenever the construction of the asset 
requires an interval of time (see Knight 195b, pp 49-50, 1934i, pp 265-G7, 193bli, 
pp 445-40) 

Knight (1944) In an earlier article, Knight (1938) offend an explanation of dim¬ 
inishing returns to capital in terms of diminishing marginal utility of consumption ser¬ 
vices, but later rejected it in favor ol ihc explanation which appeared in the 1944 article 



retrospect 583 

increments of variable agents can be applied with consequent successively 
diminishing increments of product. As solution, he postulated the existence 
of some productive agents which could be augmented only at sharply 
increasing cost in terms of other agents, with absolute fixity of supply 
as the limiting case There are echoes here of the traditional land versus 
capital distinction, but Knight does not identify his fixed agents with 
natural resources He saw no clear historical trend in the rate of interest 
and denied the existence of a long-run equilibrium rate Discovery and 
invention, themselves a form of investment, together with population 
growth, have tended to offset the downward influence of capital ac¬ 
cumulation 

Knight succeeded in freeing capital theory from preoccupation with 
the characteristics of particular productive agents and an array of 
distracting distinctions and classifications His theory of capital and 
interest is a long-term, aggregative model of production and growth 
emphasizing the interdependence and continuity of the productive 
system, the measurement and productivity of capital, and the cost and 
consequences of accumulation States of transition and disequilibrium 
are treated only incidentally, though it is clear that Knight considered 
the durability and transferability of concrete productive agents to be 
important as influencing the adaptability of the economy to changing 
conditions His subordination of disequilibrium analysis may have been 
partly responsible for his overemphasis upon demand at the expense of 
supply in interest-rate determination An infinitely elastic short-run 
demand function for investment funds implies that within an interval 
as short as 1 year equally profitable investment opportunities are un¬ 
limited, that the relative prices of investment goods do not rise in response 
to high investment activity, and that even in a continuously changing 
economy, prospective rates of return on all particular lines of investment 
remain equal in the short run ’ ^ This is not a realistic set of assumptions ** 


The Concept of “Cost” 

In the realm of conceptual refinements, Knight's view of “cost” occupies 
a prominent position The cost ol anything is the value of the best alter¬ 
native choice foregone to obtain it, not the “pain” or disutility of working 
or waiting endured in production or investment The contribution was 
not the opportunity-cost idea in itself, which had been emphasized by 


Knight’s own qualification of his dimand theory of inltrcsi rales was that since 
invistrncnt planning requires time, sudden changes in saving could lemporarily glut or 
slarvi the inarkcl for loaiiablt funds and affect the interest rate (Knight 1934ii, pp 
285-86) 

' * tor a highly critical commenlary on Knight’s theory of capital, sec Seligman (1962), 
pp 646-65 



JOURNAL OF POLITICAL ECONOMY 


584 

the Austrian economists in the 1870s, but in clearing away the pain-cost 
notion which the Austrians had failed to do This simplification of the 
cost concept facilitated another of Knight’s contributions, a clarification 
of the relationships among cost, utility, value, and price within a general 
interdependent system It directs attention to the choice problem central 
to efficient resource allocation and away from moralistic justification 
of the prices paid for productive services in terms of subjective sacrifice 
In a market economy, income distribution is a matter of the pricing of 
productive services and the ownership of productive capacity Distributive 
justice, with which Knight was deeply concerned, was to him a matter 
of free market (ornpctition and (more important as problems unsolved) 
inequality in inherited capacities and wealth, and in the opportunity to 
acquire, develop, and market valuable productive services By denying 
the usefulness of a “pain” theory of cost, Knight did not mean that all 
choices are among pleasurable alternatives The foregone alternative 
can as well be the avoidance of fatigue, boredom, or waiting as the 
en)oymtnt of leisure or immediate consumption satisfaction Rational 
choice docs not depend upon the exact location of the zero point between 
utility and disutility 

Conclusion 

I conclude with emphasis upon the breadth of Knight’s contributions to 
tlic improvement ol economic theory As one of the last great general 
economic theorists to rt ly entirely upon nonmathematical exposition,** 
he was keenly aware of the weaknesses of the conceptual t atcgoncs which 
he inherited from his predecessors Emphasizing the fact that all scien¬ 
tific analy sis is based upon and limited by the use of distinct classes of 
relevantly homogeneous phenomena, he re|crted or subjected to critieal 
scrutiny such traditionally accepted categories as land, labor, capital, 
profit, inteicst, production, consumption, cost, value, rompctition, risk, 
the (ntrepreiieur, and cronomirs His lifelong effort to increase the 
usefulness and the logical < ianty of the basic concepts of the field led him 
frequently to qualify his own concepts, elaborate upon their meaning 
and liinitations, and digress from the central line of argument This 
tendency made many ol his writings discouragmgly complex to some 
readers, but it eontiibutcd substantially to the intensely stimulating 
quality ol his woik It was part of his effort to get to the fundamentals 
of the discipline, to construct firm foundations, usefully related to the 
real world as wc perceive it, on whicli to construct our increasingly 

‘ ’ Having bi gun his intellectual life as a physicist, Knight’s choice of verbal exposition 
was a matter of prtlerencc, not necessity 



retrospect 


585 

elaborate theoretical superstructures, and to set economic theory in the 
direction of discovering and elucidating rather than obscuring the 
“truth ” 

References 

Arrow, Kenneth J “Alternative Approaches to the Theory of Choice in Risk¬ 
taking Situations ” Econometnea 19 (October 1951) 404-37 
Bose, Ashalata The Methodology and Scope of Economics Calcutta Bookland 
Private, 1970 

Breit, William, and Ransom, Roger The Academic Scribblers New York Holt, 
Rinehart & Winston, 1967 

Buchanan, James M “Knight, Frank II ” In International I-ncyclopedia of Social 
Sciences, edited bv David Sills Vol 8 New York Macmillan, 1968 
Clark, John Bates The Distribution of H'ealth New York Macmillan, 1911 
It liner, William Probability and Profit, Homewood, l\\ Irwin, 1965 
Hicks, John R “The Theory of Uncertainty and Profit ’’ Eronomica 11 (May 
1931) 170 89 

Knight, Frank H “Neglected Factors in tin Problem of Normal Inttrest “ 
QJE 30, no 2 (February 1916) 279 310 
- Risk, Uncertainty and Profit Boston Houghton MilBin, 1921 

— -- “Professor Fisher's Interest Theory A Cast in Point " J P/• 39 (April 

1931) 176-212 

-- “Interesi ” Encyclopedia of the Social Sciences Vol 8 NtwYork Macmillan, 

1933 (a) 

- ‘Capitalistic Production, lime, and the Rale ol Return ’’ In Economic 

Eisays in Honour of Gustav Cassel London All< n &. Lnwin 1933 (h) 

- “Vro(\t ” Encyclopedia of the Social Sciences Vol 12 Newyork Macmillan, 

1934 [a) 

- “Capital, Time, and the Interest Rate " t-ionomiia 11 s 1 (August 1934) 

257-86 lb) 

“Comment ” J/'i 43 (Oclober/Decembi r 1935) 625-27 la) 

-“Professor Ha>tk and tht Ihcois of Iintstment ” lion J 15 (Manh 

1935) 77-94 (b) 

- ' The Theors of Investment Once More Mr Boulding and the 

Austrians "QJE 50 (1935) 36-67 (r) 

-- Ethici of Competition 2d fd New York Haiptr, 1936 ta) 

-“I he Quantity of Capital and th< Rate of Inteiesl " J P F 44 (August 

and October 1936) 433-63, 612-42 (b) 

- “A Note on Dr Lange’s Inteiest Theory” Rev Econ Studies 4 (June 

1937) 223-30 

- “On the Theory ol Capital In Reply to Mr Kaldor ” Econometnea 6 

(Januaiy 1938) 63 82 

- -- Proftssor Mives and tht Thiorv ol Capital ” /conuimca n s 8 tXovembei 

1941) 409-27 

— - 'Proht and the Entrepreneurial Function” J Eion lint 2 (Detembii 

1942) 126-32 

-“Diminishing Returns from Investment” I PE 52 (Mirth 1914i 

26-47 

— - Or the History and Method of Economics Bt rkeley Lniv Calilornia Pitss, 


1956 



JOURNAL OF POLITICAL ECONOMY 


586 

Rima, I H Development of Economu Analysts Rev ed Homewood, Ill ‘11^10,1972 
Schumpeter, Joseph A The Theory of Economic Development. Cambridge, Mass 
Harvard Univ Press, 1934 

-, Business Cycles Vol 1 New York McGraw-Hill, 1939 

-. History of Economic Analysis New York Oxford Univ Press, 1954. 

Seligman, Ben “F Knight and Abstractionism ” In Main Currents in Modern 
Economics Glencoe, Ill Free Press, 1962 



Communications 


Rational Response to the Money Supply 

Richard Roll 

European Institute Jot Advanied Study in Management 


I. Introduction 

Knowledgeable consumers should be atteniiM to pubhslud measures of 
the money supply They have an incentive to foneast money’s future 
quantity as accurately as possible, for they must decide which com¬ 
modities, assets, and labor income should be contracted now, m nominal 
terms, and which should be left for purchase or sale at future (unknown) 
prices 

Inlormation about the money supply has a central role m consumption 
forecasting and choosing because it is lOinmonly and cotrcctlv believed 
to be the most important influence on commodity prices by forecasaing 
the money supply into the future and estimating its effect on the sequence 
of coininodity prices, tlic consumer is peifoiming a rational decision 
calculation, identical in substance with that performed by an investor 
estimating the effect of predicted future earnings on the sequence of a 
film’s common stock puce 

By the very act of making decisions on the basis of antic ipalcd inoncv 
stocks, however, consumers decic.isc the empirical connection be twc'cn 
future changes m moiiev and the current price of goods The tthcient 
commodity market would displav a minimal connection between the 
two because commodity puces would be bid qmc kl\ to a discounted 
level related to the predictable portion ol the monctaiv sequence Again 

this research was supportid by a lord houiidation tillowship Howcvii, ihc <on- 
clusions, opinions, and ocher slaKminis arc mine alone 4ind iin not rncissariK those ot 
the ioundation rtu participants ol thr NBER cotilcrtnct on Ritionil nsptctilions 
and Time Scries Analysis at the Umscrsity of Miimrsola in Match 1973 procidcd 
many useful comments In particular, I wish to thank AVilliam Brainard, William Brock 
Idinund Phelps, and Karl Shell (without suggesting that ihi y art, iii any w ly, icspoiisible 
for the results) 


5«7 



JOURNAL OF POLITICAL ECONOMY 


588 

the analogy to a perfect stock market can be made a firm whose earnings 
have grown period after period will not find its common stock changing 
in price after the next earnings increase unless those earnings deviate 
from the anticipated 

Unlike corporate earnings, the supply of fiat money represents virtually 
no real resource It can be manipulated willfully and costlessly, and 
rules for the distribution of new nominal money balanct s can be chosen 
arbitrarily by the fiat issuer The current distribution rule in the United 
States involves direct exchange of new money for real goods and services 
and financial assets, but this is certainly not the only possible rule or 
even the rule which is most frequently discussed by monetary theorists 
They generally suppose that the distribution rule is unimportant and 
indeed has no bearing whatsoever on the pattern of commodity prices 
observed in the economy ‘ 

In this paper, I hope to argue well for a contrary hypothesis The 
nominal money distribution rule is of crucial significance provided that 
consumers behave by adjusting their current portfolios of money, assets, 
and consumption on tlie basis of the anticipated future monetary sequence 
Although the rules to be discussed here arc primitive, they do illustrate 
the consequencts, for monetary policy, of rational consumer behavior 
Hence, they may claim to have a prominence beyond tbeir reality 

n. Asset, Money, and Commodity Demand 

On the most disaggregated level, the individual tonsumer ran be en¬ 
visioned as selecting a feasible consumption program ov'er his lifetime 
In the mitial period, tus choice set contains probability distributions 
only and docs not include a guaranteed future level of consumption The 
tune-honored method of converting this choice set into a set containing 
recognizable economic quantities is to suppose he maximizes the expected 
value of a utility function defined over current and future consumption 
levels and then to relate such levels to an initial wealth endowment and 
to quantities of assets selcc ted c urrcntly with the purpose of storing wealth 
for the future 

Throughout the paper, all costs attendant to buying and selling assets 
and differential tax effects will be neglected Within these restrictions, 
the consumer’s hope is to maximize £[I/(C,,(T 2 , (?„)] while remaining 

within the eonslrainls imposed by 

IV - M, - p,A, - = 0 (1) 

Pfj - h - (^1- . - ^ j ) - 1 - = 0, J = 2, 11-1 (2) 

- Tn - - h^n-i = 0 (3) 

‘ The quantity of nominal money balances is «mphasized, of course It is only the 
distribution rule which is considered unimportant 



COMMUNICATIONS 589 

where U = utility, E = mathematical expectation, Cj = goods con¬ 
sumed in period j, Pj = consumption goods price in period j, Aj = 
quantity of bonds (variable price asset) held from j io j + \, pj = bond 
price in period j, Mj = quantity of money ^ held from period j to ^ + 1 
(nominal balances), W = initial wealth (human and nonhuman capital), ’ 
= labor income earned in period j (this is assumed to be unrelated 
functionally to asset and money holdings), n = maximum possible life 
span, ~ denotes random variables as of the first (current] period * 
A solution to this problem requires the “stochastic calculus” of Kushner 
(1965) because the constraints (2)-(3) contain stochastic elements 
Kushner’s technique has been discussed in detail in economic applica¬ 
tions by Long (1971), and the reader should refer to that paper or to 
Kushner’s own for more information Here, only the results (first-order 
extremum conditions) for the solution to the above problem are given 
For the asset money, they arc’ 

E{0^) = E[C,I{\ 1,)] 

0 , = C^j+i/(l + /j+i),7 =2, , n - 1 (4) 

where subscripted U'% indicate partial derivatives of the utilitv function 
and Ij is the rate of goods price inflation between peiiods j - 1 and j, 
Ij — {^1 ~ 

The optimum conditions (4) impl> rather simple marginal trade-offs 
between time preferences and commodity price changes When one 
realizes that UjjUj+ , has the units of a real interest rate (i c , a commoditv 
interest rate), plus unity, the conditions arc seen to be the monetarv 
asset’s equivalent to the Irving lusher (1930) iclaiion between ical 
interest rates, nominal interest rates, and tlu rate of mllation (lor 
money, the nominal rate of interest is cx.ictly zero ) 

There should be as inanv diffetent tys and I^ s in each period, as 
there arc different consumption goods Bv not denoting the quantities 
m (4) like “L) j” and “7^ for the marginal utilitv and inflation rate 
corresponding to good i, 1 have assumed iinpliiitlv a single composite 
consumption good lliis has been done solely foi notational case E\- 

^ Money IS defineli ,'is assets nhos< nominal snluis ,irt alws>s fixed, i g, curnntv, 
(oiii, non-ini< rest-payinj? dc inaiui deposits Moiuv i> iasuiik d lo bring no ii()n|>ecuniary 
utility 

' 1 or the purpKjsc of this illuslrtilion, constraints on lh< nonTmi kc lability ol (human i 
capital are ignored 

Stochastic prices and laboi incoints arc rtali/<d at tht bi ginning of the period 
denoted by their subscript For exampk.y)^ is the actual bond pnct tvicilv at the stait 
ol period 3, two full periods afur iht beginning of the pioci ss (*oods puii.h.ists and 
consumption arc assumed to occur during each pciuKi afu r prices and income at* ob- 
sdved Rcpurchascii ol assets and money also arc assumed to occur during each period 
It the prices observed at its beginning 

* The equations in (4) are obtaineei by combining seseral hrst-esrder conelitions to 
eliminate Lagrange multipliers There are also conditions em asset holding which derive 
from the maximization proble'in but they will not be discussed m this pape r 



59° 


JOURNAL OF POLITICAL ECONOMY 


phcitly accounting for k different available consumption goods in each 
period would place k equations in (4) for every one there now. This 
would cause no special difficulty aside from the inevitable problem of 
selecting the “correct” commodity price index for empirical work 

For j ^ 2, note the relation in (4) of several stochastic quantities 
whose equilibration is not under the investor’s current control I he 
sense of these equations relates to consumption decisions upcoming m 
future periods As they must depend on previous outcomes, the elemenis 
of these expressions are obviously stochastic now, but as the investor 
will choose optimally for any outcomes whatever, the equalities them¬ 
selves arc certain They arc, in fact, the “stochastic calculus” analog to 
the dynamic programming assumption that all future decisions will be 
made optimaliv conditional on the states that have occurred up until ihi 
decision peiiod 

'I'hc equations in (4) may seem to imply that uncertainty exists onl\ 
in period 1 This is not the case At the end of period 1, asset and con¬ 
sumption prices p 2 "'■'id Pj are known and the consumer makes a second 
round ol choices which brings him to the lollowing positions 

t(C,) = A(t7,)/{I -p 4), 

£(f7,) = -f 4)1, 

= (4,,/(l + 7,^,), j = 3, ,ti-l 

'llicsc difiti from the initial first-order conditions because 4 now is 
nonstochastu , marginal utilities 0, and O 2 '>•^1 hi stochastic evtn 
though consumption has occurred bee ausc the marginal utility of cnii- 
siimption in an early period may be a lunction of (stochastic) consump¬ 
tion in later peiiods For additive utility functions, however, the first 
equation above would vanish because U^ would not depend upon later 
choices 

III. Simple Models of Nominal Money Supply 

Helicopters and Other Mints 

1 he lucidness of Pricdman’s (1969) hypothetical economy recommends its 
adoption in the following analysis P’nedman supposes a hclicoptei 
bearing fiat money to circle over the economy dispensing (or collecting) 
bills at given rates No resources arc necessary lor these flights nor loi 
producing the added (or decremented) currency Friedman’s purpose 
was to discover the socially optimum frequency of flights, a topic which 
depends a great deal on the value ol money’s specific nonpecuniary 
services * My purpose, however, is to examine the effect of such flights 


See Meltzir (1970) for a dear enunciation of this point 



COMMUNICATIONS 

on current commodity prices (and thus on anticipated inflation rates), 
and for this the nonpecuniary qualities of money simply get m the way, 
adding extra terms to the derived equations but no extra substance to tlu 
economics Consequently, all nonpecuniary qualities of money except 
its perfect nominal safety will be ignored here Of course, Friedman’s 
essay also discussed the process of inflation, but in its contt ntration on 
the major issue of optimum money supply, it neglected the rules used by 
the helicopter’s bombardier to distribute money among individuals 
Consider the following three mutually exclusive simple money supply 
rules New money is distributed (or collected) {a) proportionately to 
money balances held in the first period;^ {b) proportionately to money 
balances held at the beginning of eaeh period, (r) randoml) (and not 
proportionately to money balances) These rules arc so siriiple that we 
canjust append them to constraints m each individual’s demand problem, 
to equations (1), (2), or (3) of the last section With rule a, money is 
distributed during all future periods in amounts proporlional to A/,, 
the quantity of money balances held at the beginning of period 1 The 
individual still intends to maximi?e expexted utility of consumption, 
E[U{Ci,C 2 , . (^„)]. arid Ins first-period wealth constraint is unchanged 
But under the assumption of a growth in money at tlu rate a, disti ibutcd 
in eaeh future period proportionately to first-period holdings, his second- 
and later-pciiod constraints become® 

- A'/; - - i; 

- A/,[«‘J-"‘' - = 0,J = 2, , 'I - 1 (2aj 

-Jn - = 0 (3a) 

A straightforward application of the stochastic caU ulus to this problem 
will provide a different set of hrst-ordei extremum conditions than Lliai 

^ This IS the rule ustd in fritcImsTn’s simpltst model (1969, p 9) 

® Since, the growth rate of money is a, wc have 



Beginning 
of Period 

Total Money 

Supply, V/’ 

Increment in 
Supply from 
Last Pt nod 

1 

2 


\l’ = M’ 

\tl = v/>- 

M'y ~ 1) 

3 


Ml = My 

Miy - e‘] 

J 


Ml = \ry 

Miy-"’ - f'j- 


In period 2, the investor receives — I) Irom the helicopter and he has held U, 

liom period 1 Thus, his total money balance is -f* -- 1) - at ihe 

htginning of period 2 He chooses to carry the amount A /2 over from period 2 i(> 3 
Ihus his money balance in period 3 is ^f 2 plus the increment h( lectivis Irom the hdi- 
copter, — «*), which i^, by lh< assumed rule, based on first-period bilances, 

etc , for later periods 



592 


JOURNAL OF POLmOAL ECONOMY 


formerly obtained with a fixed money supply. Correspionding to equation 
(4), which depicted the (fixed-supply) marginal trade-off between 
commodity price inflation and time preference, one will now find 

E{0,) = 

Oj = i7^+i/(I + /y+i), J > 1 (4a) 

For the first-period equation, (4a) indicates a rate of commodity price 
inflation n — 1 times the rate of monetary expansion * For later periods, 
however, the inflation rate is uninfluenced by money’s growth even 
though the two events occur simultaneously Given neutral time pref¬ 
erence by all consumers, these conditions actually imply a zero rate of 
change in commodity prices between adjacent periods beyond the first 
two, which implies that money supply rule a, linking all future distribu¬ 
tions to first-period balances, must have its entire effect during the first 
period The helicopter’s promised appearance sets off a clamor for period 1 
money balances that only ceases when period 1 prices have been bid 
down (i e , money’s period 1 commodity price has been bid up), to the 
point that an inflation rate ofr*"”*** is anticipated between periods 1 
and 2 The inflation rate between the first two periods is n — 1 times the 
rate of money expansion then because new money to be dispensed later 
has been discounted in advance, thus raising even further the benefit 
of holding money m period 1 Under rule a, money balances in period 2 
are not linked in any way to future distributions of new money so there 
IS no incentive to increase the demand for them similar to the incentive 
existing for period 1 balances 

Since this conclusion is at variance with Friedman’s,''’ it should hi 
worth considering the assumptive differences which must have been 
responsible I bcluve they can be reduced to a single item Friedman’s 
individuals are assumed to be either myopic or irrational His “in¬ 
dividual IS not able to affect the amount of additional cash he receives 
by altering the amount of cash he holds’’ (in the first or in any other 
period) (1969, p 9) With this presumption, there is no reason for an 


^ With this rule and with the others to follow, the individual hrsl-ordcr optimum 
relations somtlimes arc discussed as if they wire markt i equilibrium conditions I his is 
strictly valid only if all investors have common subjrctivi probability bf Iirfs, i t , homo¬ 
geneous expectations In that case, ratios of marginal utililiis bttween adjacent periotls, 
must be the sajn< lor all investors in equilibrium bccaust individual optimum 
conditions such as (4a) bind the marginal utility ratios to (commonly held) assessments 
of future inflation and future money growth With nonhomogeneous expectations, 
expressions with the same form as (4a) would be in effect as rnarket-cUaring relations, 
but the 0/s would have to b< interprcKd as complex wughied averages of individual 
marginal utilities 

He concludi's that a continual monetary expansion or contraction at the rate r* 
will bring, ahtr initial short-term adjustments, a continual inflation or deflation at (h< 
same rate 



COMMUNICATIONS 593 

individual to forecast future helicopter flights or to perform any portfolio 
adjustments that would ordinarily bring gams in expected utility If 
the individual were not so restricted, he would find it advantageous to 
increase money balances in pieriod I as I have assumed he actually does 
It must be admitted, however, that the hypothetical individual of this 
paper is on the opposite extreme of the myopia spectrum from Friedman’s 
individual, here, he is presumed to realize the helicopter is coming and 
to understand its distnbution rule perfectly 

Given the monetarily prescient consumer, a most interesting feature of 
rule b, which distributes new money proportionately to balances held 
at the beginning of each period, is an equivalent market phenomenon 
to that Friedman has found using rule a and a myopic or irrational 
individual To prove this, notice that constraints (2) and (3) of the basic 
maximization problem have different forms under rule b Money is 
now distributed at a growth rate a to holder of balances at the beginning 
of each period so that notationally, the constraints for future periods are 

- h - - Aj) = = 2 , 

n - 1, (2b) 

which IS the same as under rule a only for period j = 2, and 

- Sn - (3b) 

After performing a few stochastic calculus operations, one now will find 
the process of inflation desi ribed by 

E{0,) = + h)Y 

0, ^2. ,n-l, (4b', 

which, under certainty and with neutral time preference, would implv an 
inflation rate equal to the money growth rate m every period ' ' 

Now consider the third rule, c, whereby new fiat money is distributed 
without regard to initial money balances Suppose M’ is the (random) 
number of bills received by an individual during period j If these are 
truly distributed at random without hope of being affteted through 
resource expenditures m earlier periods, future constraints on the 


Under rule b il is also easy lo confirm the Fisherian rtlaiion btiwern nominal 
interest rates, real (commodity) interest rates, and the anticipated rate of mfiation The 
< ausc oi an inflation premium in the nominal interest rale is quite removi d from Fisher’s 
(l^UO) explanation, however The helicopter distributes bills to money holders onlv 
and the inflation premium is a penalty demanded by bond-holders to compensate for 
their not benefiting by the helicopter’s munificence Since the holder of a quantity V/y 
of money at the beginning of period ; re*ccive*s A/y(e* — 1) from the helicopter during^, 
h< could only be induced to hold bonds if they were expected to appreciate in nominal 
value by at least thi same relative amount 



594 JOURNAL OP POLITICAL BOONOUy 

individual’s maximiiation problem are altered to become 

-y^- - A?,) + ~ Aj) - 0, 

j = 2, . , n - 1 (2c) 

-y„ - + lUl - (3c) 

The first-order extremum conditions are exactly the same as those of 
Section II where the money supply was fixed This is because AfJ 
not a decision variable It is not under the individual’s control and thus 
cannot be chosen to achieve an optimum asset portfolio or an optimum 
consumption sequence Of course, this does not imply a total lack of 
effect by on sequences of asset and commodity prices The basic 
marginal condition relating to inflation rates will still be equations (4), 
but these relations anse by equating certain random Lagrange multi¬ 
pliers that will be altered by the additions of more money m the future ‘ ^ 
In fact, one may defimtely infer a change in marginal time preference 
between the equilibrium conditions under absolutely fixed money supply 
and under rule c variation in supply If investors anticipate future random 
money increases, they will attempt to decrease current money balances 
by exchanging them for assets or commodities This will have at least two 
effects It will increase current commodity prices, Pj, and it will decrease 
the first marginal utility, {/,, to the extent that risk averters use some 
newly purchased commodities for period 1 consumption.’^ Starting 
from a position of neutral time preference, {U^ = Uj), which implies 
zero inflation, equilibrium first-order conditions will move to a position 


Specifically, the first-order conditions responsible for equations (4), with, for 
example, n = 2, are 


First-Order Condition 

For Decision Variable 

E{Oi - JiPi) = 0 

c. 

E {J 1 4" »^2) “ ^ 

Ml 

O 2 4- J 2 P 2 = 0 

C 2 


where the J'a are random Lagrange multipliers corresponds to the wealth constraint 
for period i, W — PiC, + Mi -(- piAi, and J 2 to the constraint for period 2, jPjCj = 
•W’l + + M' + J'j The constraints are, of course, first-order conditions with 

respect to Ji and J 2 Since penod 2’s constraint is altered by the addition of AfJ, its 
associated Lagrangian, /j, will also be altered, and since {O 2 IP 1 ) = ~-^ 2 i the first- 
order condition for Ci, and all other first-order conditions involving Jj, will also be 
changed 

This will be mitigated somewhat by consumers carrying over more commodity 
inventory to future periods 

' It should be emphasized again that the implication of zero inflation (brought about 
by an assumption of neutral time preference) occurs only if money’s nonpecuniary benefits 
are ignored 







COMMUNICATIONS 


595 

of positive inflation, Pj > P^, and negative marginal time preference, 
Uj > Ui, such that equations (4) still hold Note again the effect of 
advanced discounting—future money issues change current prices, 
/>,, as well as future prices, Pj, even though, in this case, the extent of 
future price changes will be larger than the current price change 

Under a random distribution rule, therefore, the inflation sequence 
could possibly match the sequence under rule b a continual price change 
at the rate o( monetary expansion This is not necessary, however, 
because consumers have the freedom to purchase and hold assets and 
commodity inventories rather than money By purchasing inventories 
initially, for example, just when the helicopter flights and random 
minlings are first annoiinted, consumers would taiisc a much 
larger price cliange initially than the monetary expansion in period 1 
To offset this, later inflation would have to evolve at a rate lower 
than the rate of money growth If asset markets arose to offer 
current claims on luturc receipts from helicopter flights, individuals 
would be able to incorporate titles to random future money issues directly 
into their initial portfolios Such money “futures” contracts would have 
the same effect on consumers’ decisions as would period 1 money balances 
unde r distribution rule a, both assets bring future monev mintings under 
the consumer’s ownership 


Consideration of Realism 

The helicopter mint is a useful pedagogic device, but it bears little 
resemblance to the ac tual iiietfiod of money issuance cuircntly in vogue 
111 lact, the present issuer uses a sizable traction of new fiat money to 
buy helicopters and pilots (and other real goods and services} with the 
purpose of dispensing objects more lethal than paper bills 1 he acquisition 
of real commodities through fiat money issuance complicates life for the 
economist as well as for the citizen because it necessitates the considera¬ 
tion of commoditv production A realistic money supply rule just cannot 
be appended to the individual’s first-order demand conditions Rather, 
It inusl encompass the rules used by government to sc lect i,omrnodities 
lor purchase with newly minted money and the rules used to determine 
how those commodities will be reallocated among individuals 

In the simple world where fiat money was distiibuted directly to 
individuals, a dynamic equilibrium between the rate of monetarv ex¬ 
pansion and the rate of inflation was found when individuals received 
money or were relieved of money in proportion to their just previous 
nominal money balances This result was brought about b> rational and 
accurate forecasters who discounted in advance the liiture produttion of 
new money and caused current balances to fluctuate m value with 



JOURNAL OF POLITICAL ECONOMY 


596 

changes in their explicit claims to future money issues The fact of govern¬ 
ment dispiensing new money indirectly, by buying and distributing 
commodities, does not logically require a change in the presumption of 
rationality and in the forecasting ability of individuals ' * Thus, if govern¬ 
ment-purchased commodities are redistributed to individuals in pro¬ 
portion to their former nominal money balances, and if those commodities 
are marketable (transferable), advanced discounting of future receipts 
of goods would increase the current demand for nominal money balances 
For example, a situation wherein commodities purchased by the govern¬ 
ment in each period are distributed proportionately to money balances 
held at the beginning of the first period is effectively the same as heli¬ 
copter rule a and will bring the same inflation sequence, namely, a very 
large inflation in the first period and zero inflation thereafter 

Again speaking of things as they actually are, the government neither 
distributes new money nor taxes old balances proportionately It may 
indeed distribute goods and services roughly in relation to wealth, but 
wealth is composed of many other assets besides money, and for an 
exact simultaneous relation to exist between money and inflation the 
link of new money issues to old balances must be exactly proportional 
Little additional time need be devoted to the possibility of private 
asset supply Since private supplies are equivalent in effect to short 
sales, they have been implicitly accommodated on the demand side and 
thus will alter no conclusions by being explicitly considered If legal 
restrictions on privately validated, non-mtercst-bearing currency were 
to be removed, their issuers would assume the status of this paper’s 
government, and the time sequences of that currency’s commodity 
value would be determined by the identical mechanism Interest rates 
contracted in that currency would be governed similarly 

Finally, we must hedge the present analysis against possible real 
effects of money If a pjosition ol underemployment (short-run) equilibrium 
can be alleviated by a monetary expansion, the link between nominal 
money supply and commodity price level becomes much more tenuous 
To put It simply, m a condition of unemployment, newly minted money 
could be used to coax production from idle capital and labor, at least 
this seems to be the prevailing opinion among macrocconomists If 
this IS indeed true, then no inflation tax would inure to money holder-., 
and the resulting rates of inflation would be lower than the rates implied 
earlier in this paper regardless of the money distribution rule 


' ^ However, it most dcfinilely changes the achievable level of utility of each individual 
since It alters the productive mix of commodities by favoring those the government 
happens to choose for us reallocation endeavors Naturally, i have neglected many 
important matters here such as the production of somi goods, say nuclear weapons, 
which because of difficulty m charging benefits, external effects, nontransferability, etc , 
would not have existed without the government's demand for them 



COMMUNICATIONS 597 

References 

Fisher, Irving The Theory of Interest New York Macmillan, 1930 
Friedman, Milton “The Optimum Quantity of Money ” In The Optimum 
Quantity of Mon^ and Other Essies Chicago Aldine, 1969 
Kushner, Harold J “On Stochastic Extremum Problems Calculus ” J Math 
Analysis and Appl 10 (April 1965) 354-67 
Long, John B, Jr “Consumption Investment Decisions and Equilibrium in the 
Securities Market ” In Studies in the Theory of Capital Markets, edited by Michael 
C Jensen New York Praeger, 1971 
Meltzer, Allan H “Discussion ” J Finance 25 (May 1970) 450-53 



The Honest Government's Guide to the 
Revenue from the Creation of Money 


Leonardo Auernheimcr 

Texas A & M UnwersUy 


The analysis of the revenue from the creation of fiat money is usually 
(arricd out m terms of comparative statics Given the value of relevant 
parameters, the rate o( inHaiion which maxiini/es revenue is charac¬ 
terized as the one yielding the highest stream of revenue once all adjust¬ 
ments have been completed either some rate of monetary expansion is 
set, and the results corresponding to the long-run values of the vanahli s 
arc analyzed (Friedman 1971), or the inflation rate is fixed at the outsit 
adjustments taken to be instantaneous, but assuming away ati\ 
efleot of the once-and-for-all changes in real cash balances on the variable 
to be maximized (revenues'), that is, assuming that those changes are 
met by otict-and-for-all changes in the price level (Bailey 19'j6) Of 
course, both procedures amount to the same thing 

This IS, indeed, the correct analysis in processes in which the transition 
period can be assumed away because the long-run value of the variable 
to be maximized is reached asymptotically lioin the very beginning, so 
that the highest stream at any point is also the stream with the highest 
present value at the initial point This is not so in the case of the rcveiiiic 
from the i reation of money 

The piurpose ol this paper is to show, first, how explicit consideration 
ol revenues accruing during the adjustment period (or “at” the adjust¬ 
ment, whin it occurs instantaneously) is essential for the understanding 
ol the essence ol money as a capital good, and second, how for a particular 
dd|ustmcnt behavior—instantaneous adjustment - taking due account 
ol once-and-for-all changes produces results very different from those 
reported in the literature In general, lower “revenue-maximizing” 
rales ol inflation arc called for, and in particular, those rates are mdepeii- 

This paper was written at the University ol C hicago, and it is based on a chapter of 
niy dissirtatjon 1 am thankful tots hischcr, R Gordon, G \ Rodriguez, L Sjaaslad. 
and K Zether, and in particular to M Friedman, who asked the right question Apitiliss 
comment by H G Johnson and a set ol careful suggestions by an anonymous relercr 
forced me to restate the argument in what 1 hope is a cli arer fashion After this paper w.is 
submitted for publication, an unpublished paper by Charles D C'athcart was brought 
to my attention, in which some of the points made here are independently developed bv 
Cathcart 


598 


COMMUNICATIONS jgg 

dent of the rate of growth of the economy. For expository purposes, we 
first proceed to demonstrate the second point 


I, The Assumptions 

The explicit consideration of transitional effects makes it ntcessary to 
specify the assumptions very precisely For the particular case we examine, 
they are (a) When some rate of inflation (n) is announced, it is believed 
by the public and “implemented” immediately by the government 
(fi) The adjustment of actual to desired rash balances is so quick that 
It IS proper to treat it as being instantaneous (c) No oncc-and-for-all 
changes in the price level arc allowed, that is, when a ‘lower than the 
preexisting” rate of inflation is announced, the government steps into the 
market and “sells” the additional real balances desired by the public 
Likewise, when the rate of inflation is increased, it is prepared to “re¬ 
purchase” the excess of existing over desired real balances, 1 liis means 
that no once-and-for-all “subsidies” or “capital levies” are imposed 
upon the public 

Assumptions (a) and (A) arc the same as the ones used, for example, 
by Bailey (1956) Assumption (c) is critical, and requires some elaboration 

Assumption (c) does not present any dilficulty for the case where some 
lower (than the preexisting) rate of inflation is annoiintcd, and the 
government reaps the once-and-for-all increase in loiai wealth, iJierc is 
no reason why it should not do so, interested as it’is in maximizing 
revenue 'Fhe question is, Why should government proceed in a sym¬ 
metric fashion when a higher rate of inflation is announced, that is, to 
intervene in the market and sell goods so that no once-and-for-all change 
in the price level occurs^ 

rherc arc at least two reasons for assuming so In the first place, a 
“oncc-and-lor-all” change in the price level is a short-hand expression 
for a very short period of high inflation, strictly, there is some “cheating” 
by part of the government in allowing that to happen Here is where the 
adjective “honest” applies The same assumption is used bv iolev and 
Sidrausky (1971) in analyzing the effect of changes m the rate of inflation, 
in the context of their model ' The second reason is that, if the symmetry 
IS not imposed, the model becomes highly unstable when the other two 
assumptions are maintained 1 he reasons for this arc explained m 
Section IV below 

‘ “ The change in the target rate of inHation is refleettd by a change in the slope of iht 

(semilogarilhmic) line at time T, but there is no jump in al time T If, wlun the 
government announces a change in the rate of inflation, il does not want to disappoint 
private expectations about the course that will follow, then no jump m pm can be 
allowed at 1 = T Otherwise, the rate of change ofal 1 = 7'would be infinity* (Folcv 
and Sidrausky 1971, p 190) 



Goo 


JOURNAL OF POLITICAL ECONOMY 


Finally, it is worthwhile to mention that at least assumption (c) acquires 
full empirical relevance m cases where the preexisting rate of inflation is 
higher than the “optimum” rate, as described below 


II. The Analysis 


In what follows, for simplicity, I will use a Cagan-type function of tin 
demand for money and assume constant income elasticity (not nerts- 
sarily equal to unity) I also assume, as it is usually done, that govirn- 
ment is the only issuer of money 

'1 otal real revenues per (inhnitely small) period are defined as 


_ dM \ _ _ 

' ~ ^ "P ~ ~dt ~ 


(Ij 


where p is the (proportional) rate of monetary expansion, and m is real 
(ash balances 

By differentiating the identity m = MjP, 

R, = pm, = m,n, + m„ 


where n, is the rate of inflation and a dot over the variable denotis ilu 
time derivative 

11 the desired slock of money—idcnticallv equal to the actual sloi k is 


m, = A e'' 


-S(r t It) 




where y = a “giowth factor” (equal to n + i'tj, for n — rale of popiil i- 
tion growth, s; = rate of growth of per capita income, rj = income 
elasticity of demand), r = “the” real interest rate, A. b > 0, paramcicis 
then ni, = ym, — hm,n II adiustments arc mstantanc'ous, rt = 0 after 
/ = 0, so that the flow of revenue alter / = 0 becomes R, = m,n + 
m, ~ m,{n + y) Assuming that the luture is discounted by governmciil 
at the prevailing rate of interest r, at anv time 1=0 the present value 
of the revenue from the creation of money, for some constant rate ul 
inflation, is given by 


^1 = 0 = I w,(7r + y)e''' dl + (in,^o+ - fh 

Jf = 0 

where ni,^o ‘i''c cash balances at the preexisting rate of inflation, n,, 
and m,-g + , cash balances corresponding to the new rate of inflation, n. 
that IS, just after the change 

The first term in expression (3) is the present value of the flow d 
revenue obtained at each period after / = 0, this is the term usualK 
considered m the literature The second part is the once-and-for-all 
“sale” (fxisitivc or negative, according to whether ot, = o+ ^ 

7 t 0 Ttg) of real cash balances which takes place at < = 0 



communications 

Given the assumed form of the demand for money, equation (3) 

becomes 

F,.o = + It) 


p gtv-O. 

J; = 0 


dt 


+ (4j 

Assuming r > y, so that the integral converges to some finite value,^ 




+ Ae-'"{e- 


.-hno 


(»■ - y) 

Differentiating with respect to the rate of inflation, 


(5) 


dV^ ^ 
dn 


Ae 


+ _J-+aV 

\ y - r y - r y - r J 


and equating to zero. 


1 — by — hn + b{y — r) =0, 


Tt = - — r 
b 


( 6 ) 


That IS, the rtvenue-maximizing rale is independent of the rate of growth 
of the economy ^ 

From expression (6) it follows that, to the extent that the basic as¬ 
sumptions are met, the usual analysis overestimates the “revenue- 
maximizing” rate (and, consequently, errs als>) in the calculation of 
magnitudes such as maximum revenues, ratio of maximum revenues to 
total income, etc ) The interested reader can easily calculate the extent 
ol the overcstimation for particular values of the parameters 

I he overestiination is larger the lower the value ol the growth factor, 
y This result is consistent with a particular interpretation of the approach 
taken in this paper In the usual analysis, the concern is with the revenue 
derived from keeping the new desired real balances constant, and from 
providing future “generations” with the same balances,* but not from 
taking charge of the difference in the real stock desired b\ existing 


^ TTic case when r < y confronts us with the same siiuatjon where an iridu idual owns 
a stream of income which grows at a conMant rate, larger than the rate of interest of the 
economy Such an individual is infinitely wealthy,provided he can borrow ad infinitum 
If he tries to consume ad infinitum, eventually the rate ofinttrosi will rise The relation¬ 
ship between the rate of growth and the rale of interest in the steady stau has bten 
Ignored here 

* An alternative proof, along the lines of the calculus of variations, is immediate 

* Unless income per capita docs not grow, so that the “growth factor” j reflects only 
population growth, the term ‘'generation” has a special meaning in this particular 
interpretation It refers to some combination of “new bodies” and “new purchasing 
power ” 



6 o 2 


JOURNAL OF POLITICAL ECONOMY 


people at / = 0 In other words, we consider existing people just hkp 
future “generations The larger the growth factor, y, the smaller the 
weight of instantaneous “sales” in “total new sales” as a source of rcveniu 
-as opposed to the rtvciiiic derived from a tax on existing balances 
From expression (')), a geometric interpretation is straightforwaid 
At any instant t = (1, the difference helwetn the present value of futim 
total revenue (at the. preexisting rate jio) and the picsent value of the 
revc'nuc from some alternative latc, jt, can he expressed as 


= (t»,=o+ - "h=o) -f 


">,^o + (y + tt) 


"',= 0(7 + ttp) 
r - y 


f7i 


The first leini in expression (7) is the once-and-foi-all gain (loss) lioni 
ch<ingmg to a lowc^r (higher) rate of mllation, the second and third, tin 
picstnt values of the two alternative stieams ] his can be reduced, .iliir 
some transfoimations, to 


(Al-., 


^0 



r"', = o + ('' + rt) - m,_o(r + 7r„)l, 
Kr + ir)fAw) -I- (A;r)(w,^o)l. 


(Hi 


whcK (r + n;(Aw) — (area J; + (aiea/)) in figiin 1, and (w, _ q) (Alt) - 
(an a f 1 Tlu mi u iiu nt in the total pn sent value will he/cio when in i 
fid /i) incl aic.T ( are equal, indi prndentlv ol the value ol y llu 
anaivsis lan he c.iiiiid out .it inv point in lime, ih.u is, foi iny posilinn 
ol l!i< d( rn.irid 1 urvi I — L 

Notu I th.il this (an also lx expiessed as ,i (low, liy multiplying tin 
tot.il m.iignial piesent valiii (8) hv the lati of intenst, in partuiil.ii, 
wlu 11 y = () It !)('( oiiK s 

i(Af ^ (r t- n}{Am) + m, „(Anl 


Ohscivc ilso til It tlicsi lesiilts aie the s.imc as w'liat would he ohianui' 
when leviiute pi r pi nod is taken to In equal to m(r + it), tin mtaiiin^ 
oi this will hecomi rleai in Section III 


III A Digression on the Economics of the Supply of Capital 
Goods 

In a recent paper (Friedman 1471), Friedman asks the cpii stioii 


’ I am mclebti it to Stanley I isthi r for tins mtirpri Utioii 



communications 


603 



It lins Always scorned natural to assimilate a go\eminent 
monopoly of hat money issue with a pin itc inonopolv of a 
piodiKt produced at zero «osf Cournot’s mineral spnng ^ et 
we lia\( [list df rnonsiiated (hat this is not eoirert Tin owner 
ol llic spnng will ma\imi/e his piocccds b\ charging 1 prut 
at vs Inch the ]iri< e elastii it\ ol ddnantl is umt\ And tins will 
1)( true wheilur tlu (loiioniv is station uy 01 giowmg 
W'li) IS this not also true loi tin goviininfnt which < an issiu 
tnoiK ^ at /(ro marginal tost' \\ liv will it not also uu\iini/e 
Us icvtnue 1)\ piKiiig It iIk iinit-elasiK Its point on its dcmantl 
(Uive, tegardk ss of wlu tlu I lh< i< isgrowihoi not ^ 

1 Ik answii is hdonsi thtre aic two diiriicnt piiees that aie 
i( levant to montv issue one iht goods and stiviccs that art 
given up to acquire a dollar, thi oiliti the nnmht 1 ol tents pt r 
dollar pt I scar that the money hokh 1 must spend to keep Ins 
teal balantes constant Onlv the latter is analogous to the 
prit i of the mincial wattr [Fiiedman l‘)71, pp 8af-5')] 

Then, he suggests an analogy with the "monopolistic pioducer and 
strvuer of durable equipment who sells tlie equipment but thaigts 
an ,mnual fee lor seryit mg it 

Oiii analysis points tovvaid a sornivvhat diffiitnt iiiteiprt tntion 1 he 
collect analogy stems to be with J iiedman’s piodtaci md scivicci ol 
dniable equipment, provided that (H the ecjuipine 111 li is a li\cd sale 



JOURNAL OF POLITICAL ECONOMY 


604 

price (equal to unity, in our money case), (2) the producer is comnrutted 
to “repurchase” the equipment at that fixed price An alternative example, 
amounting to the same thing, is the case of a monopolist renting a capital 
good (say, telephones), at an explicit rental per unit per year (jt), and 
charging a fixed and refundable deposit At any period, total revenues 
are made up of the explicit rental on the existing stock (wit in our money 
case), and of the net receipts of deposit funds (m in our case) The analogy 
in this case is exact, and we need only to introduce assumption (c) 
(Section I above) when adjustments are instantaneous In both cases 
(telephones and money), what is bought by the public is services, and the 
total price paid for those services is made up of the “explicit rental,” 
n, plus the “implicit rental,” the real interest rate * The correct ma\- 
imij'ation procedure, then, is with respect to the total paid for the services 
sold, this yields a result identical to that obtained from the production 
and sale of any other product, that is, independent of the rate of growth 
of the economy 

The Consequences of Relaxing Some Assumptions 

The analysis of Section II is limited by the extreme restrictiveness of 
assumptions (a)-(c), discussed in Section II We proceed now to show the 
extent to which the relaxation of each of them would modify the results 

1 If assumptions {a) and (i) arc maintained, and assumption [c) is not 
imposed, the model is unstable Consider a situation where, for each 
discrete change m the rate of inflation, the government is allowed to 
bear (reap) or not the once-and-for-all loscss (gams) If so, the optimal 
behavior will be to reap the gams when n is lowered, and let the public 
bear the losses when n is increased Take, as an initial position it = 
(1/A) — r (assuming no growth, for simplicity), now, it will be profitable 
to switch to the higher rate of inflation n =■ 1/A, to which a higher 
stream is attached, since there will be no once-and-for-all loss to (more 
than) compensate for the gain m the flow (1 e , only the first term in 
expression [3] is relevant) If, instead, it = 1/A is taken as the initial 
position, then it is profitable to change to a lower rate, 7t = (1/A) — r, 
as the oncc-and-for-all gain more than compensates for the loss in the 

* The thiory of the optimum quantity of money makes it clear that this last component 
is a cost for money holders, ihc literature on the revenue from the creation of money 
dots not, in general, see it as a “revenue” for the money issuer For an exception, sec 
Johnson (1969a, 19694) 

’ The question is somewhat similar to the one arising in the case of a monopolist selling 
some capital good, where the (exponential) rate of depreciation can be controlled by 
the producer In a recent paper (1970), Swan has shown that some erroneous conclusions 
in the literature were derived precisely because of ignoring once-and-for-all changes in 
the existing stock—changes which, as a purely mathematical matter, cannot be captured 
by static maximization procedures when capital goods, and their price—instead of 
services and rentals—are considered 



COMMUNICATIONS 5oj 

present value of the future stream (i e, now, both terms in expression 
[3] are relevant) The result is that there will always be incentives for 
“jumping” from any of the two rates of inflation to the other Obviously, 
we cannot expect the other two assumptions to survive under these 

circumstances 

2 Assumptions (b) and (c) are twin brothers if a lag in the adjust¬ 
ment of actual to desired cash balances is allowed for—removal of 
assumption (b )—then assumption (c) is no longer necessary, and the 
anaJogry with the private monopolist as described in the previous section 
IS exact It can be easily shown, indeed, that the “instantaneous- 
adjustment” case IS but a particular limiting instance of the more general 
case 

3 Removal of assumption (a) presents different difliculties The 
analogy with the private monopolist as described above is preserved in 
Its formal presentation, but no longer can the “control variable” be 
taken as being the same while our monopolist can set the explicit rental 
at every period, the same is not true of the hat money issuer But, in 
any case, statements about the optimum (constant) value of the explicit 
rental which are valid for any of the two remain valid for the other 

4 If no restrictive assumptions are imposed, then no rule on optimum 
behavior can be deduced a prion (even if, for example, the rate of 
monetary expansion is forced to be constant from the very beginning) 
without knowing the exact form of the adjustment and the expectations 
equations Of course, it will be true that for the whole adjustment period 
total revenues are 

f" 

m,n, dt -t- I m, dt, 

Jlo 

where tj — (q is the duration of the adjustment period hor any change 
in the equilibrium rate of inflation, from the demand for money function 
we know precisely the sign and magnitude of the second integral The 
problem, though, is that nothing can be said regarding the magnitude 
of the first integral Furthermore, the problem is complicated when 
earlier flows carry a larger weiglit than later ones (in particular, if the 
problem is analyzed m the context of a model allowing for a lag in the 
rate of inflation after a stepwise change m the rate of monetarv expansion) 

V. Conclusions, and Two Implications 

We have shown how explint consideration of changes occurring at the 
transition allows, in general, for a better understanding of some cbaiac- 
tenstics common to both money and other capital goods Foi a particular 
adjustment process (instantaneous adjustment), we have deduced a 
“revenue-maximizing” rule different from the one derived in the htei aiure 




6 o 6 


JOURNAL OF POLITICAL ECONOMY 


Even though the mam point in this note is theoretical, at least two 
empirical considerations follow First, that there may be substantial 
once-and-for-all gains (for both growing and stationary economies) from 
a reduction in the rate of inflation, gams which may accrue almost 
immediately, provided that adjustments are fast and government can 
effectively persuade the public about its intentions, these once-and-for- 
all gams may be large enough to compensate for a fall m the subsequent 
permanent stream This is still more relevant in cases where some timt 
IS needed either for decreasing the level of public expenditure or for 
raising revenues from conventional taxes 

Second, the argument shows how, in principle, the provision of the 
optimum quantity of money is “self-financed” (in the sense that there is 
no need to impose conventional taxes for the payment of interest on cash 
balances) if at every instant (1) the first component of total revenues 
(til) is used to purchase real capital and integrate an “investment fund” 
(presumably in the form of equity) and the second component (mn) is 
used to pay interest on real balances, (2) the interest received on the 
“investment fund” {mr) is used to complete interest payments in real 
^^h balances Of course, tf in the past government has consumed part 
or all of the “investment fund,” and at some point it decides to start 
paying inteicst on money, then it would need “outside financing ” Btti 
if this IS the case and the economy is growing, the proportion of mterfst 
paid on money which needs to be “outside financed” decreases owr 
tunc, and eventually becomes negligible after a few years ® 


References 

Auernhtiiner, Leonaido “Essays in the Theory of Inflation ” Ph D dissertation, 
Uiiiv Chicago, 197J 

Bailt V, Martin J “ 1 he Welfare Cost of Inflationary Finance ” J P E 64 (April 
1956) 93-110 

Foley, D K , and Sidraiisky, M Monetary and Fiscal Policy in a Growing Economy 
New York Macmillan, 1971 

Friedman, Milton “Clovcrnnient Revenue from Inflation” JPE 79 (Julv/ 
August 1971) 84t)-56 

Johnson, Hairy G “Inside Money, Outside Money, Income, Wealth, anti 
W'elfare in Monetary Ihcory ” J Money, Credit, and Banking 1, no 1 (Ftbru.ny 

1969) 30-45 (a) 

-“A Note on Seigniorage and the Social Saving from Substituting Credit 

for Commodity Money ” In Moneucry Problems of the International Economy, 
edited by R A Mundcll and A .Swoboda Chicago Umv Chicago Presv 
1969 (b) 

Swan, Peter L “Durability of Consumption Goods” AER 60 (Decembee 

1970) 884-94 


* For further elaboration of this point, and some hypothetical calculations, see 
Auernhcimer (1973, chap I) 



Cardinal Welfare, Individualistic Ethics, and 
Interpersonal Comparison of Utilities: A Note 

Antonio Camacho 

University of Illinois, Chicago Circle 

foil Sonstclu' 
hurlhwiskrn Cmversilji 


Ill John Harsanvi propost d three appealing postiilatts for 

SOI i.il chniit under mut rl.iinlv and liud to sliou ( Ihtoitni \ , p '114) 
that tin St postulati s 1( ad to .i social wt ll.ii i fnnt non u hit h is a ut ighti d 
Sinn ol tin utility funrtions of tlic individuals 

1 he iinpoitanie ol tins result is that it shovss llial a lalhir utak and 
plausilik ethical postulate CFostiilatt C bi low! sulht i s to islablish a 
eareiinal sot lal svellare luiie lion whit It is a sseighled sum of the indisiduals' 
eaidiiial utililv litrit nuns 

However, as [ireseiitid b\ tlarsan^i, his conchisitin tan be apjihcd 
onlv to .1 icslnt ted i lass ol easts wliieli we do not lonsieler broad enough 
to iiirorporate nianv inipoinm sitiwtions ti( sot i d t hoicr that niav 
apyieai in the teal win Id 

In tins paper we will attempt to iniptove llatsanvi’i) result in two 
ihiietioiis I'lf St, instead of his Posiiilatt C we will use anew Postulate 
wliii li IS evi n wtakei than Postulati C Seeoiiel, our prool oi the existi ere 
ol It u ilinal sot lal wellare fuiit non tliat is a wt ighie d sum ol thi individual 
uiilitv lunt tiiiiis will not ele penel on tiitain risiiiitni assumptions 
utili/eel 111 ilarsanvi’s |)rool, anti roiisetpit nil) oui lesuli r.ui be npplieei 
to ,1 bioaeJer sptitium ti| situations Imiillv, wt wani to note that the 
in ilhenialii s < m[jlt)Ve el in out piool is veiv siilipit ineliiil 

VVi now present Harsanvi’s postuhites and llieoiim \ 1 In ii, 

Postulate { / will bt miiodui 1 el aiiel our slionger the oie in ( I In oie m \ ' i 
|>/in e d 

1 laisanvi’s thiee |)oslulate s aie 

I’listulute I Sot lal pit It itm es/? salislv Maiseh ik’s {1 tliDj Postulates I 
II, HP, and l\ 

l‘iislulale Ii Indivitiual pitltienees R,'y = 1 ,iij s itisiv tin saint 
loui postulates 

I'lnlulate f, - If two prospiets R and Q ate iiielilii tent lioin tlie stand¬ 
point of eve ry incliv idiial, the v are .ilso iiidiffe rent Ironi a sot lal stanelpouil 


6o8 


JOURNAL OF POLITICAL ECONOMY 


Let Xi, , Xj, , X„ represent the m sure prospects available to 
society From Postulate A it follows that there exists a social welfare 
function W, representing the social preferences, that satisfies the following 
property (a) If the value of IF corresponding to the sure prospect 
Xj{j = \, , m) IS Wj, then the value of fV for any prospect P = 

(Pi, ,Pj, ,PJ,Pj^^, 

m 

= 1 . 

IS 

m 

H'(/'l. ,Pj, Pm) = 

1 

Tins social welfare function is unique up to a positive linear trans¬ 
formation Similarly, from Postulate B it follows that for each individual 
i{i ~ \, , n) there exists a utility function f/,, representing his pref¬ 

erences R„ and satisfying property (a) 

m 

^liPl, ,Pj, , Pm) ~ ^ * PMlt, 

vTn^ is the value of U, that corresponds to the sure prospect X^ 
I’his utility function is unique up to a positive linear transformation 
Harsanyi’s 'I heorem V can now be stated as follows If the individual 
preferences Ri{i = 1, , n) satisfy Postulate B, and the social pref¬ 

erences R satisfy postulates A and C, and the corresponding utilit) 
functions U, and the social welfare function fV are chosen in such a way 
that for a given prospect, all take the value zero, then the social welfare 
function tv can be represented as a weighted sum of the individual 
utilities of the form 

n 

I = 1 


where the a,’s arc constant real numbers, that is, they do not depend 
on the prospects P 

A close look at Harsanyi’s paper sufliccs to show that his proof of 
Theorem V relies on the existence of prospects such that, after choosing 
a common origin for the individual utility functions, the utility of each 
of these prospects is one for a given individual and zero for the rest of the 
individuals in the society For those situations where there are prospects 
{k = I, , n), such that 


1 , if i = ^, 

0 , if 1 ^ k. 




Harsanyi’s proof and his interpretation of the coefficients a/s are both 
correct But in those case where, after choosing a common origin for 
all the individual utility functions, there is no prospect whose utility is 



COMMUNICATIONS 609 

one for a given individual and zero for the others, neither his proof nor 
his interpretation of the coefficients a, applies 

We believe that in those situations where the possible social states that 
the individuals face are probability distributions of alternative income 
distributions, condition (fi) is satisfied But for other important social 
choice situations that are likely to appear in the real world, such as 
public education and other public projects, we may very well have the 
case where the preferences of the individuals are such that condition 
()?) does not hold The simplest of these cases is, of course, that m which 
the preferences of all individuals with regard to the alternative social 
prospects are the same But other less trivial situations may also appear 
The proof that we will present below is not based on the existence of 
prospects with utility equal to one for an individual and zero for the 
others and makes use only of a Postulate C' that is weaker than Harsanyi’s 
Postulate C Let P be any given prospect with all its components strictly 
positive P = (/i, ,pj, > 0,j = 1, ,m, 

m 

j=t 

Postulate C —If a prospect P is indifferent to P from the standpoint of 
every individual, then it is also indifferent to P from a social standpoint 
We will now prove the following riieorem V' If the individual pref¬ 
erences (» = 1, , n) satisfy Postulate B, and the social preferences 

R satisfy postulates A and C', and the corresponding utility functions U, 
and the social welfare function IV are chosen m such a way that for a 
given prospect all take the value zero, then the social welfare function 
If'can be represented as a weighted sum of the individual utilities of the 
form 

n 

IV = a,U„ 

1= I 

where the a,'s are constant real numbers, that is, they do not depend 
on the prospects P 

Proof —Observe first that in view of property (a), to prove that 

n 

w =X 

1= I 

It suffices to show that 

H 

“'j = 

I * I 

forj = 1, , m 

Observe also that if the theorem holds when a given prospect is chosen 
as the common origin of the utility functions and the social welfare 
function, then it also holds if any other prospect is chosen as the common 



6io 


JOURNAL OF POLITICAL ECONOMY 


origin Thus, without loss of gmcrality, let the prospect P be chosen as 
the common origin Clonsiclcr now the two following systems ol equations 


“ll/'l 

+ 

+ 

^ 1 mPm 

= 0 

“nlAl 

+ 

+ 

“nmA 

= 0, 


+ 

+ 

^’mPm 

= 0 

“ll/'l 

+ 

+ 

U^mPm 

= (J 

«,.l/b 


t 

»«n,Pm 

== 0 


C'liarK s\(* li.i\e llic folhming 

ii Piospei 1 /' — (/q, /ij, 1 All) 'S I solution ol 1)01 h sysK m ( 1 1 
.111(1 s\ si( 111 , and /)j > 0 , for j = I, , m ] Ims both sysU itis li.isi 

i ( ommon soluiion wiih .ill its (ompoiunts siiiliK gri .iK i th.in /im 

I iu s( I ol ,1 11 solutions ol s\ su ill 1 1 ) wliK li .111 piospt (Is IS |)i (( isclv till 

s( t ol ill |nusp(Lls vsIikIi U( indifhiini to piospiii /’ liom the siaiiii- 
poiiil ol (.\(. 1 \ mdisidu.il 1 111 s( 1 ol ill soluiioiis ol s\sli m ( 2 j wlm h .m 
p i(j.sp e( Is IS piKisih till, stl ol ill piospi ( Is wliK-ll .lie indilhiLllt to/* 
tr^m th( St uidpoiiil (il (%i t\ mdividii.il and lioin ihi si.iiulpoiiit ol soticis 
II) .\n\ solution ol sssKm ( 2 i is .ilso .x sohilioii ol ss sU m (Ij, siriii 

sssliui ' 2 \ IS ol)l.uti((l liom ( 1 ) bv idding iIk' ((pi.ilioii ti'|/)| + ) 

"■mPm ^ " 

111 II l/)|, , //j, llol IK((SSII|K I piosplll, IS 1 SolllllOll 

ol ( 1 I .111(1 /)j '' 0 lot j = I tru th( n 


■ A 'i • 

\\ liK h I s .1 pios[)( ( l, IS .1 solution ol (1 ,1 .111(1 b\ Postiil.iti ( ', it must .ilso 
b( .1 solution ol (2 I, v\ liK b iiiipht s lh.it ;/),, . is .i sohii lou ol (2 

liom III, ('ll, 111(1 I 111 I It lollow s lli.it (uj (/) I, , /),„) win le ji ^ D 

lor .ill /, IS .1 solution ol both svsKin (ll .iiid sysliiii (2), .ttid [h) iii\ 
(/;, ,, with p^ > t) loi .ill wliiili IS .1 solution ol fl), is ilso .i 

sohilioii ol (2, .111(1 vi(( MIS I Hut {<!) .iiid ih) impiv that systiins i^l. 
111(1 f2i U( ( (piivah lit, llixt IS, (Mi\ solution ol ('ll is i soluiion of (2 
.111(1 V 11 ( \ ( I s,i ' Si III ( svsti III <2; is di i im d bom ss sti in ( 1 i b\ adding iIil 


‘Litu/i, , i/j, , (/.„) l)(. any soIulK.n ol (I) \\( i an write ly^ — /)j + alj, when 
pj H- ij It lor 7 = 1 , , )/i Ihtn, siiio <Ai. , Pm) o i suhjtion ol (1), (Acj, , 

(/j, , r„), ind ypm-^ £^fn) Uso soUitKms ot (1) Sinci + 

Vj (I Ifir .ill It HilUmb lllal ipx + + fm) •'> i solution ol (2) And sinu 

•'Pm) 1'' solution ol (2^, 11 lollovvh that (t,, , l„), (/fj, , «ind 

(^1 + , p„, + /‘L,„) = (qi, ^ q„) UK also bolulions ol (2) In tlu saniL 

( in siiovs that any solution ot (2) is a solution ol (1) 



COMMUNICATIONS 


6ll 


(qudtion It;,/), + + “'m/'m = ‘t follows that 

n 

“'j = 

i=‘ I 

,is dfsirccl 


References 

Hciis<inyi, John ‘(Cardinal Welfare, Individualistic I'diirs, and Inn rpi rsnnal 
<';om[iansoiis ol Utility” J Ph ()1 (Atiqust l‘)j'>) 309 21 
Marsrhak, Jacob “Rational Behavioi, Unrtitain I’rospitis, and Measuiabli 
Idilitv ” htummetnea l(i (April lOOG) 111 tl 



On Killing off the Market for Used Textbooks 
and the Relationship between Markets for 
New and Secondhand Goods 

H Laurence Miller, Jr 

Umvtrsity oj Hawaii 


Arc secondhand copies of textbooks a dram on publishers’ and authors’ 
profits’ Publishers’ representatives have no doubts on the matter I 
have talked several about the “problem” and about some companies’ 
efforts to redl^ secondhand sales One is said to have aimed for “instant 
destruction” by including answer pages in books to be lorn out and 
handed m to the instructor, decreasing the value of used copies That the 
publishers’ representatives have some support from professional econo¬ 
mists is clear from this statement m the prefac e to a widely used textbook 
in pure theory “Since everyone knows the basic rcasoti for a revised 
edition is to kill off the existing used book maikct, it would be idle to 
suggest otherwise 

It IS sometimes said, on the other hand, that the initial price captures 
the present value of all subsequent transactions so that used items do not 
in fact compete wuh new ones The buyer of a newly produced diamond 
pays a price consistent with what the diamond can be sold for to others 
including members of later generations 

Textbooks and diamonds do differ The cost of producing additional 
textbooks after setup costs falls more dramatically than the cost of 
producing additional diamonds It costs more to produce a hardcover 
book than a paperback, whereas the cost of producing a diamond is 
independent of us life and number of uses And a used textbook, unlike 
a used diamond, is seldom a perfect substitute for a new one But are 
these differences important’ 

Would publishers and authors be better off if they could kill off the 
secondhand market for their products’ The problem is more complex 
than some appear to suppose as well as analytically similar to problems 
arising in other contexts m economic theory ^ 

I am intJcbtccl lo Armen Alchian, Milton Friedman, and Akira 7akayama for dis¬ 
cussions of an earlier version of this paper They bear no responsibility for the result 

’ Ferguson 1969, p vii 

* See especially the problem “Alcoa The Secondhand Market and Market Position,’’ 
in Friedman (1962, pp 278-79) I am indebted to Friedman for pointing out the simi¬ 
larity Some matten discussed in this paper parallel those in the peak pricing literature 
(cf also Hirsbleifcr 1971) 


6i2 



COMMUNICATIONS 


613 


I 

Consider an idealized situation ( 1 ) Transaction costs are zero (2) A 
good has a technically given service life equal to two possible uses or 
time periods (3) Costs of production per unit are constant (41 All goods 
produced new, A",, survive into the second period to provide fiotential 
second period supply, A' 2 , so that under certain conditions secondhand 
sales in any period would equal the previous period’s sales of new units 
If optimal sales in the second period are less than first-period production, 
storage costs are prohibitive, disposal costs negligible (5) New goods are 
preferred to used goods at identical prices Though some of these as¬ 
sumptions conflict with the facts relating to production of books and 
other Items, they are initially harmless and can be modified 

Let some buyers purchase new units, keep them for one period, then 
trade them in on new ones, others buy secondhand units, keep them for 
the second tunc period, discard them, and buy another set of tradcd-in 
units Alternatively, one group rents the good for the first part of its life, 
the other rents it for the second One can think of an automobile that 
lasts exactly 2 years and consumers m the market year after year for the 
services rendered by automobiles One cannot think realistically of 
individuals in the market year after year for the new edition of a textbook 
Although a new set of individuals enters the market for textbooks every 
quarter, semester, or year (and some students do fail), the previous 
period’s demanders leave the market But this difference between the 
market for textbooks and some other goods is not t rucial 

Assume, finally, that the demand for new items is T, = T,(A’,) and 
demand for used items is Tj = /’^(Arj) so that the demands are inde¬ 
pendent The cross-partial elasticities of demand {dXJSP,) = 0, t ^ ] 
These demands appear as /?, and Dj in figure I This simple version 
of our problem is illuminating though too restricted to fully answer 
the questions asked above 

Figure 1 portrays long-run equilibrium in a competitive industrv * 
(We can ignore discounting and the problems constant costs raise re¬ 
garding competitive supply ) The vertical summation of D^ and Dj is 
Dj- The equilibrium rate of production and new sales, X*, equals second¬ 
hand sales, X*, P* and P* would be the effective annual prices paid by 
owner-consumers or renters, P* + P* = C First owners pay a gjross 
price C and receive a trade-in allowance of P* one period later , C — P* = 
Pf, P* — 0 = P* Transaction costs and scrap value are zero Vertical 
summation is a familiar step in analyzing problems in joint production— 
here the services the book renders in turn to first and second users 

Figure 2 applies this argument to a monopiolist who maximizes profits 
by equating marginal cost C'{X ^)—which equals average cost, C{X^)|X^ 

‘ Figure I i> identical with one in Miller (I960, p 231 ) llial analysis is correct so far 
01 it goes I simply adopted Fox’s assumption of independent markets 



6i4 


JOURNAL OF HOLmCAL ECONOMY 



I ir 1 

.inci ili( MiiK.il sum 1)1 ilir ifV(nu( liinctiDns ,ii output aiicj 

s.il< s \* = \* I irsl owners pas a irross piiu <quil to }’* + /’*, a 
lu t pi to of S< ( oncl o\vn<rs pas a net pure Pf Allei n.itivi. ly, P* 
.ukI P* ,it( (Ik full ()ti(<s (o K nl( (-iiseis fl( n P* + P* > C 

Wall tin (Osi-r<ssnu( siiu.ttioii iii lui;iii( 2. \* laii ((puil A* svitti 
mamiiial rcsitmc in both niaiktts noniu < 4 aliS'< bqiial saks m bolli 
markets touki liad to ,i mi^.UiS) m.m^mai nsinm m tin sitondiiancl 
market No monopolist ssoiilel e iietose lh< output \ m litjftire ‘3 svlieit 
A//V, < 0 tmk ss ioreed to do so l)v a tegitlatois age in s Used umts 
svould he unde sire d hs-piodue is tatlie i tlian tlesiiabk |oml pieiduits 
1 be rule IS lliat maigmal e osl must e'qual total maigm.tl rise nut, 
^IR^ - 0 , MR-, > 0 , with MR-, = 0 iiplaeiiig 3 / 7^2 < 0 lot secejiid- 
hand sales exceeding \ * CDptiiuum saks ssill be V* and A*. A’* > X* 
Ike quantity -V* ~ svill be stoied oi disposed ol depending ttpeiri 
le Its.inl costs and re se nut s 

1 ke^e letnaiks apiils to a inoiiojtolisl ss'ko icnts Ins [ireieluct He need 
ne)t rent uriprolitabk strondliand units 11 he sells umts to owner- 
eotisumcrs wlicj ssill liave A'* umts to sill in siiondkand m.nkets, u will 
pay him to buy back — \* sfcotidliatid units at .i |)tict P* first 
owners will anticipate this policy and pay an initial puce that counts 
on a trade-in value ol P* for all units 


COMMUNICATIONS 


615 



FlO 1 

Suppose' iIr t{ov< niuK rit {)ass<*d <i law lirniim^ ijoods lo one* uso or 
soimoni dcvilopod a irood dial nisianiK disiio^rd iisilfaltii ilir (irst 
list with 110 change in (osis li is easv lo s(i that widi our assuiuptioi's 
whuh ini ludc indcpi ndent demand scheihih s sui h a law would dt- 
( n .is( , and (ould result in a < onipleti tessalioii ol, salts ol iiiw umis 
t\( u' salts would tease il 6 evrtedtd (be tnleiset uoji ol />, and dit piict 
axis m the hgures And [iiohls would be Itss Neilhcr tin nitinopolisi 
in riguit 2 with V* = \* nor tin iitoiiopohst 111 hgtire ^ with V* > \* 
would w.int to kill oir the set ondiiand inarkel If we assiitne tint tost ol 
piodiit (Kin dot s vai y with iht hie tif the gtiod the fall in costs at ti nuates 
tilt dtt re<ist in rtvt nues liansattion losts opt iate, apait from other 
corisidi ratitsns, lodttreasi tin sttipt ol settmdliand markets 

Now 2 ), and ZJj may be mdept iident li tlie\ rtpitscnt demand for 
diamonds b\ present and luture generations 01 dtniands based tin quite 
different uses of a good "* But in analvnng new iiid secondhand demands 

* Urmand for many items is approximately indri>t ndtnt of demand (or new items due 
In (relatively) noncompettng uses, e g , demand lor turnishiiigs lor suninitr collages 



6i6 


JOURNAL OF POLITICAL ECONOMY 



FlO 3 

in general, we cannot assume that Z), and D 2 will be independent ’ 
New and secondhand goods will seldom be perfect substitutes at identical 
prices But they will usually be good substitutes, where X 2 ) 

and P 2 = P 2 {X,, X 2 ) with {dXjIdP.) < 0, t = and 0 < [dXJdP,) < 

00,1 ^ j 


II 

It IS not difficult to determine the effect of interdependent demands so 
long as one assumes that it pays to sell the same number of units in each 
market, that is, that X^ = X 2 It can be shown that a monopolist 
maximizes his profits in this case by equating marginal cost, C'(A!’,), and 
total marginal revenue, MRj = (e, + a)f’, + (£2 4- P)P 2 where £,, 


’ Much of the interesting work on the peak pricing problem assumes independent 
demands (see, e g , Littlcchild 1970) A brief summary of that paper appean in the dis¬ 
cussion of the Kuhn-Tucker theorem in Baumol (1972, pp 167-69) 



COMMUNICATIONS 


617 


Sj, a, and P are functions of the various elasticities of demand 


fii 

a 


= 1 + 


^1 SP, 


SP, 

Pi SXi 


£j = 


p = 


1 

Pi 
Xi bP, 
Pi - 5^1 


An approximate graphical representation would be obtained by appro¬ 
priate substitution of thf se expressions for the marginal revenue functions 
in figure 2 These elastiniies are functions of Xi and f*, surli that 
(^XjjPi) {(iPJPXj) need not equal nor have the same sign as 
1 l[{PJXj) {(iXJdP,)] * One can establish the conditions for them to have 
the same sign but can say nothing about any actual situation without 
quantitativi values The interdependent case reduces to the independent 
case if a = P =0 

Again, It might occur that MR 2 < 0 with A", = A'j, and the rule is 
that marginal cost must equal total marginal revenue, MRj > 0, 
MR 2 S 0 And a monopolist who sells his product will buy back some 
units to achieve the result otherwise attained by renting 

What about a law limiting a product to one use or development of a 
product that destroys its< If after one use with no change in costs^ These 
would be like an infinite increase in the price ol used books and would 
shift demand for new books upward Whether profits would increase 
or dec rease--the limiting case would again be that production would 
cease entirely - would depend upon the substitutability of new and used 
books and the effect on costs of decreased durability Although this 
argument is formally correct, it does not lend itself to clear-cut answers 
A slightly different way of viewing the probh m may shed added light 
on the possible outcomes 

Suppose that new and used books are pcrlect substitutes, that no 
resale is peirnitted, and that demand for new books is given by the 
linear demand lunction, D, in figure 4 If resale is now permitted, one- 
half of the customers will by assumption shiit to used books, and demand 
for new books will change so that the price will be doubled for halved 
quantities ^ New-book buyers will pay twice what thc\ paid before, D', 
in anticipation of receiving one-half that price as a trade-in value 1 
year latei The net price of new and used books would be equal If costs 
of production were mio, output of new books would be one-half of what 


* {dFJdXj) and {dXjjSPp have the same sign in /', = /'.(A,, Xj), but in 

P, = P,lXi, Xj) and (dXjjSP,) in Xj = Xj{P,, Pj) arc derivatives of dilTereni functions 
and may not have the same sign 

’Jack Hirshleilcr suggested in correspondence that this might be a Iruitful way to 
approach the problem 



6i8 


JOURNAL OF POUnCAL ECONOMY 


P* = 2P^» 



Fio 4 

II was bt(orc, ilu pna of a new book would bo P* = 'IP* = 2P*, 
and total rcvenui' .ind profits would be unclianG;cd by introduction of a 
socondltand market 

A pioduct that de’sirovs itse'lf alter one use would involve the reverse 
moveiTient irnrn D' to D If costs were zero, eiuiptii would double, and 
total revenues and profits would be line ti.inged by killing off the used 
market 

With positive costs of produe tion, ciutput would less than double, and 
profits would be lower without a secondhand market When costs fall 
significantly with durability, the-se effetts are attinuatid Although costs 
“could” fall by more than onc-hall when the hie ol a good is halved, a 
rational publisher not subject to transaction costs would never thoose to 
produce the more durable book m the first place * 

Used books arc not in fact jscrfcct substitutes lor new ones, so D' 
m the more general case will be below D' m figure 4 Introduction of a 

“A simple example will support the sialemcnts m these three paragraphs Suppose 
that new and used books arr prrfitt substitutrs and that demand with no resale per¬ 
mitted, D, is given by /' = S — A', so demand with resale permitted, D', is given by 
/*' = 6 — 4A' If AC — MC = 0, revenue and profit will be 2 2') in both cases If now 
AC = MC = 1, profit with no resale and profit with resale will be 1 00 and 1 94, rcspec 
tivcly If costs increase as durability increases, AC — MC = 2 for a book with twice thi 
life expectancy marks the point where profit could be increase d by killing off the second 
hand market 



COMMUNICATIONS 


619 

secondhand market would not double the demand price at halved 
quantities Killing off the market would decrease demand price by less 
than one-half at doubled quantities It is conceivable—though hardly 
necessary—that some combination of revenues and costs would make it 
pay to eliminate the secondhand market in accord with the publishers’ view 
of the situation We have no information on relevant revenues and costs, 
however, that makes this prohabk "I be probability decreases with in¬ 
creases in the marginal rate of substitution between new and used books 
There is no reason to suppost that we would obtain different results 
if we had assumed three 01 niorr usis Storage and disposal costs and 
transaction costs would add iralisin to the pi< ture but arc not cssrntial 
in an examination of the chief issue 

With regard to the quotation appearing m the first paragraph, a 
monopolist may or may not me re'ase profits hv changing his product 
A monopolistic compeliior must, of eourse, keep the ideas and facts 
in his book up-to-date or find his demand curve shifting to the left 
It IS not as obvious as the eiuoiation suggests that the reason for bringing 
out a new edition is to kill off ilie cxisiing used-book maikcl 


References 

Baumol, William J Economu Theory and Operations ‘inalysti 3d ed bn^lcwexid 
Chtfs, N J Prciiticc-Hall, 1972 

fenpison, Charles F, Muroceonomic Theory Rev ed Homewood. Ill Jrwin, 
1969 

fnedman, Milton Price Theory Chicaito Aldine, 1962 

Hirshleifer, Jack "Suppression of Inventions ” IPE 79 (March/Apnl 1971) 
382-83 

Lilllichild, Stephen C 'Triring Felephone Services ’’ Bell J Econ Management 
Sii 1 (Autumn 1970) 191 210 

Miller, H Laurence, Jr "A Note on Fox’ Theorv ol Second-Hand Markets ’’ 
Economica, n s 27 (August 1960) 249-Vi 



A Note on Income (Labor) Inequality: Income 
Tax Systems and Human Capital Theory 

Claude Montmarquette 
UnwtTsitf de Montreal 


Human capital theory has recently provided us Nvith a theory of income 
distribution and an explanation ol the inioinc inequality existing in a 
given country at a specific point in tune However, works in this area 
havc^jLierallv neglected to consider the intoine tax structure with 
which a potential investor will be confrtinted when returns from his 
investment arc to be received ‘ On the other hand, measures of income 
inequality (spot ifically defined from a Lorenz curve) have been dis¬ 
cussed and related to lump-sum and proportional taxes but outside thf 
context of human capital theory (Levine and Singtr 1970) 

The purpose of this note is to investigate, with the htlp of human 
capital theory, the before- and after-tax income distributions associated 
with different income tax systems The discussion formalizes the wide¬ 
spread belief in the literature of income distribution (Aitchison and 
Brown 1957, Goode 1964) that the income tax structure influent es the 
pattern of before-tax income I then analyze the problem of income 
inequality for the different income tax systems using the variance of the 
logarithm of income as a measure of this inequality 

Let us follow Mincer (1958, 1970), Chiswick (1971), and others and 
assume that all schooling and training costs are viewed by the individual 
as time costs In equilibrium, individuals with larger amounts of training 
must be compensated for their costs In other words, neglecting ability 
and quality issues, we must equalize the expected present value ol 
lifetime earnings at the time the training is undertaken Then, by assum¬ 
ing expectations to be realized,^ we can explain observed income by 
past investments in human capital ^ 

My thanks to Edi Kami for helpful comments on an earlier version of this note 
Several of my colleagues, a referee, and Robert J Gordon also made useful comment' 
Any errors are mine 

‘ To mention a few exceptions, Hansen (1963) and Btckrr (]%4, 1967) 

* Note the distinction between anticipations on the return from schooling and tl 
realized return of this investment Tht distinction respccifies the identification problc 
which relates schooling to income, and vice versa The distinction is also conceptual 
helpful m stressing the difTerence between the allocative effects of a tax and Us distributi 
effects 

^ For simplicity of discussion, 1 am considering the schooling model, a simplih 
version of the human capital model 


620 



OOMMUNIQATIONS 


621 


Mathematically we have 

r'‘dl = y/e-"(l - e-'-)lr, ( 1 ) 

where y,'* = expected present value of lifetime earnings of an individual 
with s years of schooling, 7/* = expected annual earnings of an individual 
with s years of schooling, r = the interest rate of the expected return on 
schooling, n = adult life to retirement, I = time, and e = base of natural 
logarithms 

Similarly, for an individual with s — d years of schooling we obtain 
(assuming the number of years spient working equal to n for all schooling 
groups) 

( 2 ) 

Js-d 

In equilibrium, V* = V^_^, and we obtain the ratio of expected 

annual earnings of persons differing by d years of schooling Now assume 
the expectations to be realized, then F, = F,_j will yield 


K.-d = 




■s-d 


( 3 ) 


the ratio of annual earnings of persons differing by d years of schooling 
Let us specify 7, and 7,.^ as after-tax income By doing so we assume 
that individuals are concerned with the expected values of these variables 
in their choice of a specific income stream “ Again, by focusing on the 
inverse relationship, realized after-tax income will be explained by 
investments in human capital ’ Before-tax incomes (7*) are related to 
after-tax incomes by the usual expressions 


7, = 7/(1 - T,), (4; 

n-. = - T,.,), (5) 

With T’j and as the icicvant average income lax rates * 


^ In a complete model, the individual will adjust to the distribution pattern (if one 
exists) of tax proceeds Note that the distribution of the proceeds need not be related to 
a specific income tax s>^tcm 

® Mincer (1970) did not distinguish between before* and after-tax income distribution 
The distinction has some relevance in testing the ability of the human capital theory to 
explain income distribution 

® Tax rates are indexed with the schooling variable, as they depend indirectly on the 
level of schooling Note that formally 

t,{y. - y.- i) 

y. 

where i, is the marginal tax rale of a given income bracket 




622 


JOURNAL OF POLITICAL ECONOMY 


Forming the ratio K*,_^ of before-tax annual incomes, we obtain^ 

K* ^ K (1 - 7 ;-,) 

F,_/i - rj (1 - r,) 


e-'V - T,] 


( 6 ) 


The last two equalities are derived from (3) 
plicity,® equation (6) reduces to 

- T’o) 


With t - (/ = 0 for sim- 


27 

YS 


Yoi^ ~ T,) 


K. 


«,0 


(1 - T,) 

(1 - T,) 


= r 


(1 - T^) 
(I - T,) 


(7) 


With Tj = Tq (proportional income tax), equation (7) gives Mincer's 
schooling model (1970) As he points out “The formulation highlights 
the principle that it is not the shorter pay-off pertod of the more educated 
(recall same span n of life time earntngs is assumed) but the post¬ 
ponement of earnings that is the basic cause of differential in earnings” 
(Mincer 1970, p 7) With ^ Tq, another cause of differential in 
gross earnings is added 

From (7), we can then easily derive for different income tax systems 
the before- and after-tax income distributions as explained by the school¬ 
ing model Table 1 summarizes the results, it also gives the variance of 
the logarithm of income, a commonly used measure (hereafter called 
F-measure) of income inequality * 

From the table, it is easily seen that, in the case of a proportional 
income tax system, the same measure F of income inequality holds for 
both before- and after-tax income distribution ' ® Furthermore, if we 
assume for the three income tax systems the same rate of return on 
schooling and the same variance of schooling, the F-measure of the 
after-tax income inequality will be the same under any of the three 
income tax systems Finally, under the same assumptions, the progressive 
income tax system will present, relative to the case of a proportional 
income tax system, a greater inequality m the before-tax income dis¬ 
tribution This IS also true for the regressive income tax system, provided 


’ The samt result could be obtained by forming the after-tax present value function 
= r/(l _ T.) J" 

and by repeating the analysis The presentation in the text explores the analogy with the 
treatment in investment literature of the physical capital affected by a corporate income 
tax (see Harbcrger 1962, 1968) Note also Friedman’s (1962) similar discussion in terms 
of wage rate equalization rather than capital value equalization 

' The constant Y’_ ^ = YS represents the income derived from initial endowment 
’ For an appraisal of this measure of income inequality, see Weisskoff (1970) 

‘°Thi« result was also derived by Levine and Singer (1970) for measures of income 
inequality defined from the Lorenz curve 



COMMUNICATIONS 


623 


TABLE 1 

Income Tax Systems and Income Distributions 

Bcforc-Tax Income After-Tax Income 

Proportional Income Tax Sysicm 
{T, = To) 

Y. = Yoe" 

f * var (s)* var (log Y,) = var (/) 

Progressive Income Tax System 

(To = 0, r. > o)t 

y; = - t.) 

var (log Yf) = var (i) 

+ var [log (1 - r.)] 

— 2r cov [i, log (1 — T,)] 

Regressive Income Fax System 
(To 0. To > T., ¥T,) 

y; = YS^'V - - TJ y. = Foe" 

var (log }‘) = r* var (i) var (log Y,) = r* var (t) 

+ var [log (I - T,)] 

— 2r cov [j, log (I — T,)] 

• tor ity r is lakerj Riven 

t Vote (hat A,,o > I so To *» 0 an«i T, » 0 imply a proRrr^sjve mroine tax structure, 
sitiLe the nurgtna) tax rate increases with income (see n bi 


however that 2r cov [c, log (1 — TJ] < var [log (1 — T",)] is satisfied 

The presentation clearly suggests that a progressive income t 2 ix 
system might create inequalities in the before-tax income distribution 
that would counteract the mathematical effectiveness of this progressive 
taxation to reduce after-tax income inequality '' More specifically, to 
the extent that investment in human capital alone explains income 
differences, and to the extent that returns to investment in education 
are equalized at the margin, income taxes are completely shifted forward 
in the long run This result is not original, since incidence theory main¬ 
tains that income taxes arc completely shifted forward, as long-run 
supply in any occupation can be considered perfectly elastic ' ^ The merit 
of this note lies in the derivation, in terms of a specific measure of income 
inequality, of sufficient conditions for this last result to hold Further¬ 
more, these conditions appear to be sufficient conditions for the Pareto 
law to hold, as this law states that income is distributed in the same way 

'' The concept of progrcssivity in taxation is not trivial, since it must be defined for 
the tax system in general and not only for the income tax However, what I discuss in 
the text are the effects on income distribution of an effective progressive income tax 

' * A referee of this Journal drew this point to my attention 


y. = Yoe" 

var (log F,) = r* var {s) 


y; = Y‘t" 

var (log F") = 



JOURNAL OF POLITICAL ECONOMY 


624 

regardless of social and political institutions and regardless of taxation 
The law implies that allocative and distributive effects of an income 
tax system need not be conflicting * * 

To discuss what effectively happens to those conditions, one must 
first investigate the allocative effects of the different forms of income 
tax on the investments in human capital Then it remains to evaluate the 
effects of a change (if any) in investments in human capital on the rate 
of return and on the variance of these investments The schooling model 
and the restrictive assumptions retained in this note might help answer 
these questions However, in the context of a general equilibrium frame¬ 
work of incidence theory (Harberger 1962, Mieszkowski 1967) and with 
a model of income inequality which introduces ability and quality con¬ 
siderations and refers to a general version of the human capital model, 
those questions appear quite complex and no attempt will be made here 
to discuss them 

In thi^^te, I have shown that the human capital model of income 
distribution enables us to incorporate the income tax system of a society 
into an analytical framework which explores the relationship between the 
before- and after-tax income distributions The importance of the results 
presented in the text is strictly bound to the importance of the human 
capital model to explain the difference in earnings, which, in the final 
analysis, proves to be an empirical question 


References 

Aitchison, J , and Brown, J A C The Lognormal Distribution Cambridge 
Cambridge Univ Press, 1957 

Becker, G S Human Capital New York Columbia Univ Press, 1964 

- Human Capital and the Personal Distribution of Income An Analytical Approach 

Ann Arbor Univ Michigan Press, 1967 

Chiswick, B R “Earnings Inequality and Economic Ucvelopment ” Q J E 85, 
no 1 (February 1971) 21-40 

Friedman, Milton Price Theory A Provisional Text Chicago Aldme, 1962 

Goode, R The Individual Income Tax Washington Brookings Inst , 1964 

Hansen, W L “Total and Private Rates ol Return to Investment in Schooling ” 
JPE 71, no 2 (April 1963) 128-40 

Harberger, A C “The Incidence of the Corporation Income Fax "JPE 70, 
no 3 (June 1962) 215-40 

- “Corporation Income Taxes ” In International Encyclopedia oj the Social 

Sciences Vol 2 New York Macmillan, 1968 


‘ ’ Note that as human capital models attempt to explain labor income inequality, 
equity issues are not a difficulty Furlhi rmore, to extend the results ol the text to the 
general human capital model to explain income distribution and income inequality (see 
Chiswick 1971), conditions similar 10 those affecting the return and variance of schooling 
must prevail on the return, variance, and covariance of all other forms ol investment in 
human capital 



qomhunioations 


625 

Levine, D. B., and Singer, N M. “The Mathematical Relation between the 
Income Density Function and the Measurement of Income Inequality ” 
Econometrua 38, no 2 (March 1970) 312-24 

Mieszkowiki, P “On the Theory of Tax Incidence’’ JPE 75, no 3 (June 
1967): 215-40 

Mincer, J “Investment in Human Capital’’ JPE 66, no 4 (August 1958) 
261-302 

- “The Distribution of Labor Incomes A Survey with Special Reference 

to the Human Capital Approach ’’ J Econ Literature 8, no 1 (March 1970) 
1-26 

Weisskoff, R “Income Distribution and Economic Growth m Puerto Rico, 
Argentina and Mexico ’’ Rev Income and Wealth 16, no 4 (December 1970) 
303-32 



A Note on Equilibrium in Monopolistic 
Competition 

Yoram Peles 
The Hebrew University 


Two major results obtained in Chambcrlinian monopolistic competition 
are [a) in the long-run equilibrium the firm earns only normal profits, 
and (6) it has excess capacity, that is, the firm produces in the range of 
declining average costs This paper emphasizes the importanc e of short- 
run fixed of product differentiating, sue h as store location, advertising, 
R & D, CTC , for the determination of both the location of the demand 
curve and short-run equilibiium Moreover, in the extreme case of 
completely immovable assets it will be shown that under certain con¬ 
ditions, the short-run equilibrium solution in which there exists both 
excess profits and rising average costs obtains in long-run equilibrium 
as well 

The excess-capacity theorem is based on the existence of a U-shaped 
cost curve However, the declining portion of the long-run average cost 
curve may serve as a barrier for new firms This harrier would arise if 
there is not enough room to support new firms at the prevailing price, 
though existing firms enjoy excess profits and not necessarily excess 
capacity 

An example sometimes given for the monopolistic competition is 
spatial competition—the location of firms along a line For convenience 
let us analyze a specific example, however, the conclusions are generally 
valid for this type of competition Let consumers be spread evenly along 
a road going from L to M At every point along the road the demand for 
a given product is given by P = B — q — aT, where B is a. constant, a 
denotes the consumer’s cost of transporting one unit of good one unit of 
distance, T is the number of units of distance from pioint of selling to 
point of destination, and q is quantity demanded at a particular point 
on the road Put a = 1 Information is perfect and available to all, 
hence consumers do not shop around but buy from the seller whose 
pnee plus cost of transportation is the lowest All producers have the 
same cost function, which includes an initial investment m fixed assets 

1 acknowledge the helpful comments of Lester Telser and a referee 


626 



COMMUNICATIONS 


627 

(or better with no alternative use in the short run) denoted F, and zero 
marginal costs, The annual cost of capital is a. Let us start by assuming 
that these fixed assets are not amortized Firms are free to enter the 
market and they will do so as long as it pays them, namely, if they expect 
annual revenue > aF Firms are free to change the price of their product 
and prices may differ between producers, but it is not possible to have 
price discrimination by the same producer 

Now, firms start to enter this market If there is only one producer 
along the line he will charge a monopoly price P which gives the maxi¬ 
mum revenue In this example, the total demand confronting the pro¬ 
ducer, to the left and right together, is given by 

(•B-F 

Q = 2 (B - P - T)dT 
Jt»o 

The price charged is therefore selling to customers up to a distance of 
\B to his left and \B to his right (fig 1) The producer might locate 
himself at point \B thereby securing his left wing from any new entry, 
as will be shown At this point he sells a quantity q — B — \B = ^B, 
to his left, he sells up to the beginning of the road at point L, at which 
point ^ = 0, to his right he sells up to point l^B, where again q = 0 
Total quantity sold is gB^ units and PQ = A second producer 

will sell under the same conditions, so let him be located at point 2B, 
once again securing his left wing from new entry, with a market range from 
point 1^5 to 2jfl So, as long as there is room along the road toward A/, 
new producers will enter on these terms 

L l^B 2B 2iB -.M 


Fic I 


Suppose the road has already been colonized up to M and a new 
potential producer wants to enter the market, knowing that firms arc 
already located at points IB, 2B, etc , and they cannot move awa\ 
(short-run analysis) The best place for locating his firm is at the mid¬ 
point between two producers, such as at point Ijfi The intruder enters 
the market at this point Now if prices aie still kept at F = f, the in¬ 
truder will take awav customers from points B to l^B, having PQ = 
-^B^ However, once the intruder is in the market, F = f is not the best 
price for either the neighboring old producers or the new one With a 
new producer located at midpoint between two old ones, the best policy 
for the existing firms and the new ones together is to charge still the same 



628 


JOURNAL OF POLITICAL ECONOMY 


pncc. * Hence each producer sells a distance of f on either side, therefore 
having a demand of 

I*B/3 

Q = 2 {B - P - T)dT 
Jr-o 

The new price set is P = 1 25f, at which the entrant sells a quantity 
of Q = ^- B^, getting a revenue of ^ B^ 

So, as long as the old producers do not change their location, the 
potential producer cannot expect an annual profit of more than ^~^B^ 
With cost of capital a he will not enter the market as long as F > 
although those already in the market might benefit from monopoly 
profits (if F < fjOt) 

If the assets are amortized over a finite number of years, the time will 
come when they have to be replaced (long-run analysis) Then it will 
be to the benefit of the old producers to move away from a new intruder, 
if he enters the market, thus increasing his market size and allowing him 
more profifl^l^n the maximum given above In general the 

longer is this long-run the lower is the probability of a new entry and 
therefore the stronger the monopoly conditions for those already in 
market 

Let F < the old producers can adopt a policy of preventing 

new entry and still getting a monopoly profit In the previous case each 
producer kept the best market interval giving him the maximum mono¬ 
poly profit If producers expect a new entry, they might locate themselves 
closer together from the beginning, reducing the market area and the 
maximum profit an intruder can get The cost of this strategy for the old 
producers is a lower monopoly profit ^ 

In the previous case each firm was producing under conditions of 
constant short-run average costs but declining long-run ones However, 
having a long-run analysis, in which long-run costs include both short- 
run variable costs and short-run fixed costs, the monopolistic competition 
solution might exist at the decreasing, minimum, or increasing part of 
the long-run average cost curve (and, as before, with excess profits) 
To show this let us take the extreme demand condition of zero price 
elasticity below a given price at every point along the road but positive 
elasticity of demand with respect to distance Let the demand at every 
point along the road be 17 = 1 unit of product at P < B and q = 0 
at P > B The first producer entering this market can sell up to a dis¬ 
tance of T = B — P to the right and the same distance to the left The 


‘ The following analysis assumes that between every two old producers enters a new 
one 11 only one new producer enters the market, the producers already in the market 
prefer a lower price than that of iht intruder because they face competition only on one 
side of the road However, even with the higher price the conclusions will not be changed 
^ Sec Pashigian 19faB 



COMMUNICATIONS 


629 

quantity demanded to his right and left together is the quantity demanded 
at each point 9 times the distance, that IS <3 = q x 2T = q x 2{B — P), 
and since ? = 1, therefore Q = 2B — 2P or P = B — ^ In order to 
simplify the mathematical expressions (though not affecting the conclu¬ 
sions) let = 100, such that/* = 100 — 2 Let short-run me = then the 
best price for the first firm to enter the market is P = 95 25 with Q = 
9 5 and T = 4: 15 The total revenue PQ = 904 9, and total short-run 
variable costs arc 285 8 This solution is at the tninmtum long-run 
average cost if this minimum cost amounts to 90 5, so that the annual 
outlay on the short-run fixed cost is 574 Now, after the road is colonized 
with firms located at intervals of 9 5 starting at point 4 75, a new firm 
wanting to enter this market can expect the best at the midpoint between 
two old producers, with a market range of 4 75 With such range the 
highest profit is obtained charging P = 91 625, selling Q = 4 75, with 
revenue PQ = 463 7, and after subtracting 35 7 for short-run variable 
cost, the potential firm can expect an income of no more than 428, 
which falls short of the annual tost for short-run fixed tost of 574 

If no new entry occurred by the date of replacement, each firm— 
whether already in market or a potential intruder, will try to be the first 
to build the new plant Since everyone knows what is going to happen 
on the day of replacement, the potential intruder (and therefore old 
firms already in business) may anticipate the day of replacement by 
building the new plant earlier It will pay them to do so as long as the 
expec ted future monopoly profits exceed expected losses until the old 
plant IS scrapped In other words, firms will obtain normal profits only 
from the new plants The new plant can be sited anywhere along the 
road from a point immediately next to an old plant to a point midway 
between two old plants Once the old plant is scrapped,^ the best location 
for the new plant is at the site of the old, with the new plant serving the 
road interval prcviousl) served by the old plant and keeping the old 
prices and quantities I he excess profit now pays for the premature 
investment * 

However, while the old plant is still in use, short-run profits arc at a 
maximum when the new plant is located at the midpoint between two 
old plants Hence, a new plant built before an old one is scrapped will 
be somewhere between the site of the old plant and the midpoint— 
closer to the midpoint the further ahead is the day of scrapping It is 
better to stop at this point before the picture of the very long-run solution 
becomes too complicated 

^ If the new plant is owned by an old producer, early investment can be accompanied 
by premature scrapping 

* Instead ol building a new plant early the entrepreneur may declare his fixed inten¬ 
tion of building the new plant at the due time and so save losses However, let us not go 
too far into strategics of this kind 



JOURNAL OF POLITICAL ECONOMY 


630 

In short, monopolistic competition can operate under conditions 
where each firm faces competition only from neighboring firms, whether 
in terms of distance or quality The existence of untransferable fixed 
assets allows individual firms to adopt a policy leaving them monopoly 
conditions with excess profits, although free and fast entry is allowed 
The quantity these firms produce might be at a minimum point on the 
average cost curve (or along the increasing part of the curve) and not 
necessarily at decreasing average cost, as usually claimed Hence firms 
can produce “efficiently 


References 

Pashigian, B Peter “Limit Price and the Market Share of the Leading Firm ” 
J Indus Econ 16 (July 1968) 165-77 

Telser, Lester G “On the Regulation of Industry A Note ” J P E 77 (November/ 
December 1969) 939-52 

’ On the pr^^!ln of oplmuim social production under the above conditions of spatial 
competition, see Telstr (1969) 





Optimal Replacement of Capital Goods in 
Early New England and British Textile 
Firms: A Comment 

David Denslow, Jr , and David Schulze 

University of Florida 


Professor Jeffrey G Williamson (1971) states in a recent artu le that if 
dll lethnital progress is labor saving and capital embodied then “the 
level of unskilled wages prevailing is irrelevant” (p 1327j to the 
optimal freqtienty of replacement of capital equipment, vshuh is con¬ 
trary to what Habakkuk (1962) has written “ft appears,” says William¬ 
son, “that Habakkuk’s model is not internally consistent ’ (p 1327) 

In fact, however, Williamson’s model is inconsistent ‘ His is a \mtagc 
model in whidi a pfiysical unit of capital, A'(ti), is operated b\ a homo¬ 
geneous labor input, L{v), to produce textile output, .V(t/), from machinery 
of vintage V The capital-output ratio, a'‘ = K{i}lX{v), is assumed 
constant over time, w’hilc the capital-labor ratio, b{v) = K{v)IL{v), is 
assumed to increase exponentially 

b(l) = e'‘<'-’>A(ti) (1) 

Because of the labor-saving technical piogress expressed by (1), firms 
expict commodity puces to fall at the same rate as b{t) increases 

P{t) = (2) 

On these assumptions it may be shown that the picsent value at time e 
of net revenue os'er the entire useful life, u, of a machine of vintage v is 

- wb{v)-']e-'^'~'> dt, 

Ji 

where w is the annual wage rate for unskilled labor and t is the monev 
rate of interest The firm assumes Ixith i and iC to be constant The 
net worth of the new acquisition can be written as the diflercncc between 


The authors arc indrbtid to Rafael Lusky for helpful discussions 
* Wc have borrowed freely Irom Williamson’s phrasing in the following summaiy of 
his model 


631 



63 a JOURNAL OF POLITICAL ECONOMY 

this net income stream and the price, p, of a new capital good of vintage v : 

n{v) (1 - - P (3) 

p + t ib{v) 

The price, p, of capital goods is assumed to remain constant over time 
Williamson says that “by maintaining constant capacity and replacing 
the capital stock every u years, wc can get an expression, ^(0), for the 
net worth of future physical capital stocks required for replacement” 
(p 1325) ^ He derives an equation for ^(0) as follows “The physical 
stock of capital required for the jth replacement is K(ju) = x{v)j« — 
X{0]ja, and its net worth at / = 0 is a"*A’(0)n(0)(l 4- + 

+ + ) Thus ^(0) = [A'(0)n(0)]/[a(l - (p 1325, 

n 5) 

Williamson rewrites the above equation as 

^ ^ ^^^(0)” j «^(Q) ft - _ P ] m 

^ Wa |/i + I (1 - e-'“) ib(0) (1 - e-'")J 

To obtain the first-order condition for maximization of net worth, he 
sets dA{(})ldu = 0 and rearranges 

^ = _L_ - 0i + Oe-"] (5) 

aP(0) IX + i 

Equation (5) presents a couple of puzzles first, one would not expect 
the initial price ol output to affect optimal replacement, since Williamson 
writes that “our interest is in the replacement decision under constant 
capacity conditions” (p 1325) But from implicit differentiation of (5), 


du _ ipe'"' 

dPiO) ~ a/iP(0)^(«-'“ - 1) ’ 


( 6 ) 


which IS negative The frequency of replacement would increase with 
a rise in the initial price level, even though at constant capacity the 
frequency of replacement can have no effect on total sales revenue To 
put It another way, under conditions of arbitrarily selected constant 


^ Italics in the original Williamson sayi his model is an extension of one presented by 
Brems (1968) Brems’s model is one of neutral technical progress None of our criticisms 
of Williamson's model applies to Brems s Williamson’s appendix follows Brems's pre¬ 
sentation closely and contains minor errors only when Williamson deviates from Brems 
TTic equilibrium condition, for example, should be 



the equations arc invalid when /i = 0, and tt is unreasonable to expect 



to assume negative values if that expression is a good approximation to u 



COMMUNICATIONS 633 

capacity, one would expiect maximization of net worth to have a cost 
minimization dual in which the price of output has no direct effect 
Second, since newer machines use less labor than older ones to produce 
the same amount of output, one would expect that the higher the wage 
level, the more frequently the entrepreneur would replace machines 
But in Williamson’s model, as he points out, “unskilled wages do not 
appear to influence the replacement decision at all” (p 1327} 

Such implications lead one to suspect that equation (4') is incorrect, 
and indeed it is To see why, let us spell out the implicit steps m William¬ 
son’s derivation of that equation His crucial sentence is, once more 
“The physical stock of capital required for the jth replacement is K [ju) = 
X(v)la = X{Q)jaL, and its net worth at < = 0 is a“*A’(0)n(0)e"‘-'“” 
(p 1325, n 5) Now the net worth at time ju of the capital required for 
thejth replacement is a.~'X{0)n{ju) To find the value of this net worth 
at < = 0, one must discount ix~'X(O)n(ju) by a factor of obtaining 
a” 'X{0)n[ju)e~‘-^'‘ An implicit step in Williamson’s derivation of equation 
(4), then, is 

a-'X{0)n{ju}e-‘J‘‘ = a'' A(0)/i(0)e-(7) 


n{ju) = r(0) (7') 

Equation (7') is incorrect, however, except for special cases irrelevant 
to Williamson’s presentation To show this, we modify equation (1) to 
express b(ju) in terms of i(0) and equation (2) to express P{ju) in terms 
of/’(0) 

b{ju) = e^''‘bi0), (T) 

P(ju) = r'-^'-PiO) (2') 

Substituting from equations (!') and (2') into equation (3), we obtain 


« 0 “) = 


ixP{0)e-'‘ 
H + t 


_ ,-»(»+0 


Also, 71 (0) may be expressed as 

„(0) = ^ [I - (1 ^ (3") 

H + I tb{ 0 ) 

Given equations (3’) and (3"), equation (7') implies that 

(1 _ «-“(“+■))- — (1 _ «-'“)] <.-'‘7- 

|_p + I ib{ 0 ) J 

= (1 _ + - (1 _ ( 8 ) 

H + I i*(0) 


or 


[n(0) + = 7i(0) + p 



JOURNAL OF POLITICAL ECONOMY 


634 

From equation (8'), either «(0) = —p or fi = 0 when ju is positive 
In the first case, the entrepreneur would not install the first machine 
The second case is not useful for analyzing the effects of technical progress 
Equation (4), which forms the basis of all of Williamson’s article but the 
appendix, is thus seen to be wrong Given the assumption that an 
entrepreneur would be willing to replace his machinery every u years 
over an infinite horizon the torreef form of equation (4) is 

Am (9) 

or 

.4(0) = a-*A'(0', _ LA 

\fi + i ii(0) (I - + 1 - e-“‘\ 

The hrst-oidei condition loi inaxmuzaiion of net woilh is 

X [i{l - - (,, + i)(l - (10) 

At this point m our analysis assume ue weit inttmiptcd liv a textile 
manufacturer who hap])iti<d to be in oui olfiie s(eking advice about 
when to leplaic liis maihiiurv Hi would not obey equation (10), 
because his rule of tluiinb is nivir to invest in a machine losting p even 
if he plans to usf it forcvir (to say noiliing of only u vears) unless the 
net .iniuial ri turn fioin a will lx at least tp t,quation (10) rcquiics him 
to buy an infinite nutnixi o{ machines vvhicli could never repay their 
cost He quickl) driw a sketch loi us showing wh) (fig 1) 

With dollars ploilccl against time, the upper line shows the flow of 
revenue, oiP{t). fioin a single machine at time I, ignoring the elasliiity 
of demand foi simplicitv,^ and the lower Inn shows the flow of wages 
paid the labor that opi rates a mai lime of vintage / The vertical distance 
bctw'ecn the lines represents the flow of net rcv'cnue, r[l, I), from a 
machine of vintage / at the instant it is installed The equation for that 
distance is 

r(l, t) = oiP{t) - wbil)~' (11) 

or 

ril,l) = faP(O) - wbm~ 

Clcarl-), r(/, i) ajiproachcs zero as I approaches infinity, and there is 
some t* alter which r(/, 1) is always less than ip 

Chastened by the manufacturer’s aicusation that he would be the 

^ Brems included a parameter for the elasticity of demand because he was analyzing 
price deicrmination as well as optimal replacement We do not see why Williamson 
included demand parameters 



COMMUNICATIONS 


635 



Victim of ati unreasonable requirement, wc allowed him to change his 
price expectations so that P{l) remains high enough to permit liirn to 
pay his wage bill and buy new machines each u years With that out 
of the way, we pointed out that product priees arc irrelevant vs hen the 
entrepreneur is constrained to constant output Equation (10) can be 
derived thiough minimization of the present value, CIO), of costs per 
a" ' units of output per year 


(;( 0 ) 


w (I - e-'“) p 

!i(0) (I - 1 _ 


( 12 ) 


As wc weie about to write out demonstrations that assuming second- 
order conditions hold, impheit differentiation of (10) vields duldw 
negative, dujdp positive, and so on, the manufacturer broke in again 
“The gam from replacing a machine,” he told us “is the saving m the 
wage bill freirn using less lalxir Hut since the vs age bill approaches 
zero as a result of labor-saving technical progress and constant wage 
rates, there will come a tune when I shall icfusc to buv a new machine 
in order to reduce the wage bill ” Since this .iiguincnt was analogous 
to his earlier one, he did not have to draw us a picture T shall replace 
a machine,” he continued, “whenever ip. the marginal cost of replacing 
It this instant instead of next, equals the reduction m the vs age bill from 
installing a new machine.” 

Seeing how (restfallen we were at his forsaking an infinite horizon, 
however, he suggested a way out “Mv experience has been,” he said, 
“that wages rise just as fast as new machines save labor ” We realized 
that this was the right assumption to introduce statioiiaritv into the 
problem, and that Williamson’s mistake in adapting from Brems (1968) 



636 JOURNAL OF POLITICAL ECONOMY 

was to introduce nonstationanty when he assumed labor saving instead 
of neutral technical change * 

Now let the entrepreneur’s expectations about wages be 

w(t) = w{0)e^‘ (13) 

Equation (12) must be rewritten to accord with this new assumption 


C(0) 


- 1 ) ^_ t- 

b{i))n{l - «■■“) 1 - 


(14) 


The first-order condition for tost minimization is 


bfitpw{0)-' = - in + !)^'‘“ + * (15) 


Letting represent the setond derivative of C’(O) with respect to u, 
we assume the setond-ordci condition for cost minimization holds, 
Cuu > 0 from implicit differentiation of the first-order condition, 


du 

dw{Q) 


-(C„„i(0)p)- 


e-’“)-^[(l - s-'")//?"" - («"“ - l)t<t"'“J, 


(16) 


which IS negative The higher the initial wage level, the greater the 
frequency of replacement 
Similarly, 


dp ~ C„„(l - 


(17) 


which IS positive The higher the price of machines, the lower the fre¬ 
quency of replacement The effect of p on u is given by 


^ lu - 

du C„„i(0)(l - e-'")L (l-r-'“)V p" / 


(18) 


The higher the rate of technical progress, the lower the frequency of 
replacement, since rfu/rfp is negative for plausible values of p and 1, 
specifically for 

I + -j > 2 (19) 



It must be remembered, however, that an increase in /i implies not only 
more rapid technical progress but also more rapidly rising wages 


* A model exhibits staiioiiarity when historical time is irrelevant to inferences drawn 
from It Only relative time matters Our assumption that the rate of increase of wages 
equals the rate of technical progress is not satisfying but may be no more rciitriclivc than 
Williamson’s assumption (with which he was not content) that wages are constant 
Also, our revision of the model requires less restrictive assumptions about expectations 
regarding product prices than does Williamson's version 



COMMUNICATIONS 637 

Equation (20) pves the effect of changes in interest rates on equipment 
longevity. 

- =«-'“(! - 
di 

X + (1 - «■'")(! - I*) + 2i^s-''‘] (20) 

The value of dujdt is positive provided 

1 ^ + «■'“ < 1 ( 21 ) 

From the preceding analysis it can be seen that the major conclusions 
of Williamson’s article are unfounded He has not shown that with labor- 
saving capital-embodied technical progress “the level of unskilled wages 
prevailing in the textile sector is irrelevant’’ (p 1327) He has not shown 
that Habakkuk’s model is internally inconsistent He has not shown that 
“a tariff structure which distorts relative prices by raising the relative 
price of consumption goods should increase the optimal replacement 
rate’’ (p 1330) 

Why were there differences between American and British behavior 
in the replacement of textile machinery^ More data about wages and 
machinery prices are needed before Habakkuk’s hypothesis about the 
effect of relative wages on replacement can be tested Our analysis 
suggests that relative wages might matter Aside from that, Williamson’s 
explanations at the beginning of his second section (pp 1321-22) appear 
plausible 


References 

Brems, Hans Quantitative Economic Theory New York Wiley, 1968 
Habakkuk, H J American and British Technology in the Nineteenth Century New 
York Cambridge Univ Press, 1962 

Williamson, J G “Optimal Rtplacement of Capital Goods The Early New 
England and British Textile hirm” J PE 79, no 6 (November/December 
1971) 1320-34 

Note Added in Proof The sentence following equation (18) should read “The 
higher the rale of technical progress, the higher the Irequency of replacement, 

D D and D S 



Optimal Replacement of Capital Goods In 
Early New England and British Textile 
Firms; Reply 

Jeffrey G Williamson 

University of Wisconsin 


While attempting to make sense out of an important difference in early 
nineteenth-century Anglo-American replacement behavior, my 1971 
article (Williamson 1971) developed a model which produced an ap¬ 
parent par^^x The unskilled wage influenced replacement only under 
conditions m neutrality in technical progress In the historically more 
relevant case of labor-saving technical progress, it appeared that the 
unskilled wage played no role in determining the replacement decision 
in textiles Professors Denslow and Schulze correctly point out that the 
paradox was more apparent than real The first-order condition for the 
maximization of net worth derived from my original paper (p 1326) 
was 



— [i 


(/t + 


( 1 ) 


where 5 = (/t -f i) In the Denslow-Schulze version we get 


P (1 - e-‘'‘y 

w i^i(0)(l - + 

X [i(l - ^-‘'■ + ‘)“) - (n + t)(I - e-‘“)«-n, (2) 

which looks very different indeed The differences rapidly diminish 
when the Denslow-Schulte expression (2) is rewritten 



A(0) = 1 [i -p + 

lA 


ifi + O*'”"], 


where i4 = (1 — <~'“)“^[i(l — Recalling that 6(0) is the 

initial capital-labor ratio and multiplying numerator and denominator 
of the left-hand side by P(0) A(0), we get a new corrected expression (3) 


ac(0)'' + + O*"”"]. (3) 


638 



COMMUNICATIONS 


639 

where (Xi{0) IS the initial labor share in textiles Expression (3) begins to 
look more like my original expression (11 Furthermore, it is far more 
useful for historical analysis, since income shares and capital-output 
variables are directly observable for the antebellum period, while input 
price ratios are far more difficult to document 

After having made this correction, Denslow and Schulze tend toward 
the overzealous They may be forgiven, I suppose, since thev apparently 
have no interest in the historical issues for which the analysis was used 
as a tool They spend the remainder of their comment exploring the 
implications of their corrected model under the assumption that entrepreneurs 
expect wages to rise at the same rate at which labor-saving technical change takes 
place One can, of course, assume anything one wishes, but in historical 
analysis one is compelled to be more discreet Mv primary focus in the 
earlier paper was the 1820s and 1830s, when Anglo-American differences 
in replacement policy were most apparent Scraps of real wage data 
from this period suggest ver^ moderate, if any, rises m w Lebergott 
(1960, p 493) estimates that real wages of nonfarm employees fell bv 
as much as 10 percent from 1818 to 1830 and rose bv only 10-20 per¬ 
cent in the 1830s Average real daily earnings of cotton textile workers 
actually fell by 10 percent between 1832 and 1840 * The evidence is 
no different regarding the ratio Pjw The price of machine-shop products 
declines from an index of 100 to 97 between 1834 and 1844 Money 
wages paid female operatives at Lowell after 1835 declined to 94 b\ 1842 
but rose to 102 by 1846 ^ All of this evidence indicates that the Denslow- 
Schulze assumption makes their subsequent analysis a sterile exercise 
Indeed, their inevitable result that higher rates of obsolescence (higher 
p) implies lower rates of replacement is downright sillv 

But let me return to the key issues in the debate Denslow and Schulze 
conclude that when the correct expression (3) is employed in the histor¬ 
ical analysis “the major conclusions of V\illiamson’5 article are un¬ 
founded ” This IS a curious statement, since it seems to me remarkable 
how little the conclusions of the earlier argument require revision’ 

Numerical analysis of equation (3) for (A Y =30 and 3/^= 0 403, 
a capital-output ratio and labor share prevailing in American textiles 
in the 1830s, yields results very similar to those reported in figure 1 
(Williamson 1971, p 1329j of m\ original article Thus, m\ earlier 
conclusion remains unchanged “The evidence of far higher replace¬ 
ment rates in the United States textile sector in the 1820s and fairh 
similar replacement rates m the 1850s can only be explained bv a con¬ 
vergence of expected rates of technical change between the two econo- 

' Money wages lor U S cotton textiles from Lebergott (1960, p 476, table 4' deflated 
by the Warren-Pearson price index 

* The prices of capital goods arc from Brady (1966, p 110, table 2a) The Lowell daily 
wage rates are taken from Lebergott (1960, p 47b, table 4j 



JOURNAL or POLITICAL ECONOMY 


640 

mies. . Interest rate differentials apparently had the opposite impact” 
(p 1328) Note further that {KIX)*a^{0)-^ = (plw)b{0). Given b, 
the capital-labor ratio, constant, numerical analysis shows that a decline 
in plw would induce lower replacement rates or a higher average age of 
equipment Since this key input price ratio does in fact decline dunng 
the 1830s and 1840s, we have an explanation for the convergence in 
Anglo-American replacement rates over the antebellum period In 
addition, my original conclusion that “the low relative price of invest¬ 
ment goods created by the United States tariff structure in the 1820s 
should have further widened differences in replacement policy” (p 1331) 
still stands if we are willing to accept the conventional view that the 
tariff structure acted to raise the relative price of (industrial) wage 
goods, as It surely did ^ 


References 

Brady, D S.^^ice Deflators for Final Product Estimates ” In Output, Employ¬ 
ment, and Produefbnty tn the United States after 1800 New York Nat Bur Econ 
Res, 1966 

Brito, D L , and Williamson, J G “Skilled Labor and Nineteenth Century 
Anglo-American Managerial Behavior ” Explorations Econ History 10, no 3 
(Spring 1973) 235-53 

Lebergott, S “Wage Trends, 1800-1900 ” In Trends m the American Economy in the 
Nineteenth Century Princeton, N J Nat Bur Econ Res , 1960 
Williamson, Jeffrey G “Optimal Replacement of Capital Goods The Early 
New England and British Textile Firm ” J P E 79, no 6 (November/ 
December 1971) 1320-34 

’ A far more comprehensive analysis of contrasting Anglo-American nineteenth-century 
industrial managerial behavior can be found in Brito and Williamson (1973) 



A Note on the Stability of Metzler's 
Phase Diagram 

Don PaUnkin 
The Hibriw (/nuierstty 


Lloyd Metzler’s classic article on “Wealth, Saving, and the Rate of 
Interest”—which appeared in this Journal over 20 years ago—is noted 
for its clear and searching analysis of the implications of the Pigou or 
real-balance effect, and for the crucial distinction it made in this context 
(from the viewpoint of optimum-portfoho-composition, and hence 
neutrality) between a monetary change generated by a current deficit 
and one generated by an open-market operation 

But in addition to this valuable substantive contribution, Metzler’s 
article also made an important contribution to technique by providing 
a graphical analysis of the stability of the system in terms of what is now 
termed a “phase diagram ” And though such a diagram is essentially 
to be found much earlier in the literature,’ Metzler’s arucle is, to the 
best of my knowledge, the first instance of its being applied to the 
Keynesian system ^ The purpose of this note is first to make a nunor 
refinement of Metzler’s diagrammatic analysis, and then to point out that 
Its geometne proof of stability in the small is actually inadequate—as 
IS also the case for the corresponding proofs that accompany other phase 
diagrams in the literature 

For convenience, let me carry out the discussion in terms of the version 
of the Metzlerian diagram that I have used elsewhere (Patinkin 1965, 
pp 232-33 et passim) The curve CC in figure 1 represents the locus of 
combinations of the rate of interest, r, and price level, p, which satisfy 
the equilibrium condiuon in the commodity market 



I am indebted to William Baumol, Stanley Fischer, David Levhan, and Jerome Stein 
for comments on an earlier draft of this note Needless to say, none of them is to be held 
responsible for the views hefe expressed I wish also to thank the Central Research Fund 
of The Hebrew University and the Israel Academy of Sciences and Humanities for 
research grants to cover the costs of technical assistance 
' Sec, e g , Marshall 1923, pp 341 and 353-55 

* Though, of course, the first analytical treatment of the stability of the Keynesian 
system—imd hence of the forces that lie behind the corresponding phase diagram—is to 
be found m Samuclson’s famous 1941 arucic on the stability of equilibnum (pp 113-20) 

641 


642 


JOURNAL OF POLITICAL ECONOMY 



FlO I 


where Fg is the fixed full-employment level of output, and Mq is the 
fixed quantity of money Similarly, BB represents the equilibrium 
condition in the bond market 



As usual, let the respective directions of the arrows in the four regions 
of the accompanying diagram represent the dynamic assumptions that 
whenever there is an excess demand (supply) in the commodity market, 
the price level will be driven upward (downward), and that whenever 
there is an excess demand (supply) in the bond market, the rate of 
interest will be driven downward (upward) Then, concludes Metzler, 
“When the economic system is out of balance at least one force is always 
operating to bring the variables of the system closer to the equihbnum 
point” (1951, p 105) 

First, my refinement of this argument Immediately after the inter¬ 
section of the dynamic adjustment path with one of the equilibrium 
curves, both forces must push in the direction of equihbnum For consider 
the dashed dynamic path that begins (say) at point P By assumption. 



COMMUNICATIONS 


643 

as the path approaches CC, the extent of excess supply in the commodity 
market becomes less, indeed, when the path reaches the CC curve, this 
market is in equilibnum, so that the path must be vertical at the inter¬ 
section point, Q Thus immediately after point Q, both p and r must 
move in the direction of the equilibrium point E A corresponding state¬ 
ment holds, mutatis mutandis, for the intersection point R, where the path 
must be horizontal 

Furthermore, we can in a similar way see that the slopic of the adjust¬ 
ment path throughout (say) region III must become increasingly flatter 
For as the path moves from intersection point Q, the extent of the excess 
demand in the commodity market increases, thus increasing the horizontal 
component of the dynamic vector, at the same time, the fact that the 
path IS moving in the direction of BB means that the extent of the excess 
demand in the bond market is decreasing, thus decreasing the vertical 
component of the vector The consequent decrease (absolutely) in the 
slope of the adjustment path in region 111 will continue until the path 
becomes horizontal at its intersection with BB at jxiint R, as already 
noted 

The foregoing argument, even without this refinement, has generally 
been considered as constituting a proof of the stability of the system in 
the small—and I have certainly used it in this way throughout my own 
work But I have recently realized that this proof is not really valid— 
as IS demonstrated (in fig 1) by the counterexample provided by 
the dotted dynamic path PQ'R'S' that meets all of the preceding 
restrictions on its slope yet diverges from the equilibrium point E (This 
path has been drawn to take full advantage of the relative slopes of the 
CC and BB curves, which, as the reader may have guessed, have been 
chosen so as to correspond to the case of a divergent cobweb) Such a 
divergence is impossible in the analytical solution which, under the 
aforementioned dynamic assumptions, yields an adjustment path that 
must be contained within circles of ever decreasing radii, centered on the 
equilibrium point E The verticality of the adjustment path when it 
crosses CC —and its horizontality when it crosses BB —does imply that 
this must be the case in the respective neighborhoods of the intersection 
points Q', R', S', T',. But there does not seem to be any way of 
demonstrating graphically that this must continue to be the case as the 
path moves out of these neighborhoods In other words, the geometrical 
properties of the dynamic adjustment path as described above do not 
seem to be sufficient to prove that intersection point R' must he within 
the circle centered on E and passing through Q'? 

What can, however, be graphically established is the divergence of the 

^ My colleague, Eytan Shcahuuki, has suggested that this inability to provide a 
graphical proof of stability in the small may be due to the fact that the graphical argument 
does not make use of the assumption of linearity (in the neighborhood of the equilibrium 
point) on which the analyucal proof of stability is based 



JOURNAL OF POLITICAL ECONOMY 


644 



bystem in the case in which both dynamic forces push the variables away 
from their equilibrium values For example, if the dynamic path in 
sector III of figure 2 is subject to the forces represented by the arrows 
there, then the path must remain in this sector and continuously diverge 
from the equilibrium position In particular, it is impossible for the 
path to cross the BB curve and thus come under the convergent force of 
sector IV For, as we have seen above, in order to cross BB, the curve 
must, just before leaving sector III, move horizontally eastward (as on 
the path TZ), whereas the arrows show that the adjustment path in this 
sector (represented by TW) can never move in such a direction 

References 

Marshall, Alfred Money, Credit, and Commerce London Macmillan, 1923 
Metzlcr, Lloyd A “Weallh, Saving, and the Rate of Interest ” J P E 59 (April 
1951) 93-116 

Patinkin, Don. Money, Interest and Prices 2d ed New York Harper & Row, 1965 
Samuelson, Paul A “The Stability of Equilibrium Comparative Statics and 
Dynamics ” Econometnca 9 (April 1941) 97-120 Reprinted in The Collected 
Scienltfic Papers of Paul A Samuelson Vol 1 Cambridge, Mass MIT Press, 
1966 



Managerial Tenure under Private and 
Government Ownership in the 
Electric Power Industry 

Louis De Alessi 
George Washington Untoerstty 


Recent interest in the econonuc consequences of government regulation 
and government ownership of business enterprises has led to some fruitful 
applications of economic theory The effects of an active profit constraint 
on the behavior of a wealth-maximizing firm (Averch and Johnson 1962) 
have been subjected to increasingly rigorous theoretical scrutiny (e g, 
Takayama 1969) Though the discussion continues (eg, Zajac 1972), 
there is growing consensus about the general implications obtained (cf 
Baumol and Klcvonck 1970), Concurrently, other researchers have 
investigated the choices of utihty-maximizing decision makers within 
government-regulated, privately owned firms (Alchian and Kessel 1%2), 
wUhin government-owned business firms (Alchian 1965), and within 
regulatory agencies themselves (Eckert 1968) Empirical results to date 
(e g , Davies 1971, Peltzman 1971) support the conjecture that differences 
in the structure of property rights associated with these institutions 
provide government and private decision makers with different cost- 
reward structures and yield systematic differences in the outcome of the 
decision process In particular, the approach has been shown (De Alessi 
1972) to be useful in explaining the published evidence relating to regu¬ 
lated and unregulated, private- and government-owned firms in the 
electric power industry (eg, Stigler and Fnedland 1962, Moore 1970, 
Peltzman 1971) 

One of the alleged implications of the utility-maximization hypothesis 
IS that government decision makers enjoy longer tenure than their 
counterparts within privately owned firms (eg, De Alessi 1%9, 1972) 
This paper presents a test of the hypothesis, using data drawn from the 
electric power industry Section I contains a brief derivation of the 
hypothesis,* Section II, a description of the data, and Section III, 
the test results Section IV contains the summary' and conclusions 


The research for this paper was supported by a grant from the National Science 
Foundation J L Gastwirth and an anonymous referee offered helpful suggestions 
‘ This section is abstracted from a broader study of the economics of government 
control (De Alessi 1972) 


645 


646 


JOURNAL OF POLITICAL ECONOMY 


I. The Hypothesis 

An individual decision maker is hypothesized to maximize a single¬ 
valued, convex, twice-difFerentiablc utility function The variables in the 
function reflect a variety of goals, such as health, secunty, prestige, and 
the welfare of others (De Alessi 1967) At the margin, all goods (sources 
of streams of utility to the chooser) are substitutable and, for each in¬ 
dividual, at least some are scarce Fulfillment of the usual intra- and 
intertemporal equilibrium conditions implies that, among other things, 
the lower the cost of any good, the more will be acquired 

Accordingly, a utility-maximizing manager has the incentive to in¬ 
crease the size and the duration of all job-related pecuniary and non- 
pecuniary streams of income For example, to increase his wealth, he 
has the incentive to increase the value of his marginal product by em¬ 
ploying moiji^taff and capital (Mayer 1960) and by increasing the size 
of the bud^T djat he administers (Niskanen 1971) This strategy, in 
turn, IS facilitated by reallocating activities from the future to the present 
(Dc Alessi 1969) and by increasing such variables as the output, the sales, 
and the size of the firm Similarly, a manager has the incentive to in¬ 
crease his nonpecuniary sources of income by hiring co-workers and 
employees with more desirable characteristics (as lie sees it), by arranging 
more pleasant working conditions (as he sees it), and so on Up to a 
point, of course, such managerial behavior yields an increase in the 
firm’s profits and is implied by the standard owners’ weallh-maximization 
hypothesis The manager’s ability to extend his activities beyond this 
point, thereby increasing his welfare at the expense of his employers’ 
wealth, depends primarily upon the extent to which his contractual 
obligations (explicit and implicit) arc enforced and upon the effectiveness 
of internal control forces 

For present purposes, the crucial difference between private and 
government-owned (1 e , political) firms lies in the relatively higher cost 
of transferring ownership shares in the latter (Alchian 1965) A taxpayer 
who wishes to change his portfolio of ownership in political firms 
normally must change the political jurisdictions in which he works and 
resides, or change the activities of the firms, or engage in similar high- 
cost activities As a result, property rights in political firms effectively 
are nontransferable This rules out specialization in their ownership and 
inhibits the capitalization of future consequences into current transfer 
pnees, thereby reducing the incentive to detect and police managerial 
behavior which is inconsistent with employers’ welfare Among other 
things, this implies that managers of political firms have greater oppor¬ 
tunity to increase their own welfare at the expense of their employers’ 
wealth, and that they find policies designed to increase noncapitalizcable 



COMMUNICATIONS 647 

returns to be particularly attractive * It might also be noted that control 
devices (c g , operating rules, line appropnations) internal to government 
firms become less market oriented as political considerations become 
more important in evaluating the outcome of the decision process 
In the case of privately owned firms subject to government regulation, 
an active profit constraint implies that the gain to the owners from en¬ 
forcing contracts falls Indeed, it is zero over some range Accordingly, 
the manager’s transformation surface between pecumary and non- 
pecuniary sources of utility—the boundary of the attainable region in 
his field of choice—shifts to the right, and he is able to increase all 
activities which contribute to his welfare Regulated firms, however, 
are more exposed than political firms to the discipline of the market, 
and their managers are more inhibited from making those choices which 
lower the wealth of the employers Among other things, the market for 
corporate control does act to constrain managerial behavior (e g, 
Hindlcy 1970) and to encourage market-oriented internal control 
devices (e g , De Alessi 1973) Moreover, the profit constraint need not 
be binding and, to the extent that it is, its effect is weakened by regulatory- 
lag On the other hand, the managers of government firms, particularly 
those endowed with a polttcally influential clientele, can survive and 
even prosper in the presence of persistent deficits and significant economic 
losses Accordingly, they have g^-cater oppiortunitv for utility-maximizing 
behavior than the managers ot private, regulated firms ^ 

The salary and other contractual pecuniary rewards of government 
employees in general are subject to greater constraints than the pecuniary 
rewards of executives of private, regulated firms Among other things, 
the pecuniary rewards of government employees normally are subject 
to statutory ceilings, whereas those of the employees of private, regulated 
firms normally are not even reviewed by the regulatory agencies * One 
obvious consequence is that the opportumt) cost of nonpecumary sources 
of utility IS lower within political firms More important, the manager 


’ Employers’ incentive to enforce contracts may be expected to vary inversely with 
the political level (c g , local, state, federal) of the political organisation m question 
^ The stream of benefits received by the manager of a political firm is less closely 
related to the future profitability of the firm, since he is unable lo fully capiializc and 
capture job-related gains that will accrue in the future Accordingly, he attaches a lower 
value to such future gams relative to present gams than would otherwise be the case, 
and he has the incentive to increase the latte r at the expense of the former Among other 
things, this implies that he will adopt lower discouni rales in evaluating the firm’s invest¬ 
ment options (De Alcssi 1969) Dams will be bigger and imcsimcnts more capital con¬ 
suming than would otherwise be the case Note that, for exactly the same reasons, 
employers also have less incentive to police such deviations 

* Mr Jewell, director of public relations of the National Association of Regulatory 
Utility Commissioners, was queried After looking into the matter, he stated that the 
salaries of executives of regulated firms normally are not reviewed by regulators 



648 JOUKNAt OF POUnOAL EOONOUV 

of a political firm will find that the opportunity cost of activities designed 
to enhance the probability of his survival in office is also lower, and he 
will have greater opportunity to mcrcase the present value of his pecuniary 
and nonpecuniary sources of income by increasing his job security For 
example, he will have greater incentive and opportunity to reward 
subordinates for their loyalty to him, to institutionalize decision making 
in order to dissipate responsibility for errors, and indeed, to institutional¬ 
ize tenure (Alchian 1959) As a result, the manager of a political firm 
may be expected to survive longer in office 


n. The Data 

The hypothesis that managers of government-owned firms have longer 
tenure than the managers of privately owned, government-regulated 
busine;||^finns was evaluated over the penod from 1962 to 1971 using 
data draWW from the electric power industry 

Time period —The choice of penod was narrowed by the observation 
that prior to 1962 the Federal Power Commission did not collect the 
necessary information (e g, the names of managers) for government- 
owned electric utilities Although it might have been possible to obtain 
data for years prior to 1962 by sending questionnaires to the individual 
firms, It did not seem imperative to do so The sample penod ended in 
1971, the last year for which data were on file when this study began 

The samples —Two random samples without replacement were chosen 
One sample consisted of 100 government-owned electnc companies, 
mostly municipals, drawn from the 489 firms listed in the index of 
Statutics of Publicly Owned Electric Utilities in the United States, 1962 (U S 
Federal Power Commission, 1963i) The population did not include 
federal utilities The other sample, also consisting of 100 firms, was drawn 
from the 223 firms listed in the index of Statistics of Privately Owned Electnc 
Utilitiesin the United States, 1962 (U S Federal Power Commission, 1963a) 

Both samples excluded firms which did not report the desired in¬ 
formation m 1962 (four government and four private firms), which 
ceased to report prior to 1971 (13 private firms), and which shared 
identical management with firms already in the sample (two government 
and 10 private firms) 

Data —The basic data consisted of the name of the top executive of 
each firm for each year from 1962 to 1971 The initial intention had been 
to observe the tenure experience of mdividuals at several different levels 
of top management Government-owned electric companies, however, 
only report the name of their top executive (e g, supenntendent, 
manager) to the Federal Power Commission As a result, comparisons 
between samples was limited to that slot The possibility of systcmaUc 
differences in the tenure experience of individuals at different levels of 



OOMMUMICATIONS 


649 


TABLE 1 


FKSQintNcy or Chamoe in Top Executive of Firms in the 
Electric Power Imductry, 1962-71 


Ownership 

Category 


Frequency op Change 

Sample 

Size 

0 

1 

2 

3 

4 5 

Political 

Private 

36 

18 

42 

59 

17 

18 

3 

4 

1 1 

I 0 

100 

100 


top management was explored within the privately owned electric firms 
using data on the top three executives (president and below) for the 
first 50 firms drawn 

Length of tenure was indexed by the number of times that the name 
of the manager changed during the sample period A change was deemed 
to occur if the name reported in 1 year was different from that reported 
for the preceding year and if the name had not appeared before This 
last condition, designed to give individuals credit for coming back as 
well as for hanging on, had a negligible effect on the test results The 
frequency of changes is shown m table I 


III. The Evidence 

Given the nature of the data (table 1), the test seemed eminently 
suitable to evaluate the null hypothesis that the tenure of managers is 
the same for government and for privately owned electric companies * 
The test results [x^ — 8 89, df = 3) indicate that the null hypothesis 
may be rejected at the 3 percent level of significance (two-tail).* Indeed, 
a comparison of the frequency distributions shown in table 1 suggests 
that managers of government-owned electric companies enjoy longer 
tenure, as hypothesized earlier 

According to table 1, 36 percent of the government-owned firms did 
not change their top executive over the entire sample period, compared 
with 18 percent for privately-owned firms To examine th^statislical 
significance of this difference, the data were combined in a 2 x 2 


’ If degrees of freedom > 1, the r* tat may be used onJy if fewer than 20 percent of 
the cells have an expected frequency of less than five and if no cell has an expected 
frequency of less than one The cells m the last two columns of table 1 fail to meet these 
conditions The prescribed remedy is to combine adjacent columns until the require¬ 
ments are met (Siegel 1956, p 110), and this was done by consolidating the last three 
columns of table I 

^ If a change is noted every time that the name of the manager is different from that 
reported for the preceding year, disregarding whether the name had appeared before, 
then the test (z* = 10 13, df = 3) allows rcjccuon of the null hypothesis at an even 
lower level of significance (2 percent, cwo-tatl) 



650 


JOURNAL OF POLITICAL ECONOMY 


TABLE 2 


FsEftOENCY OF Change in Top ExsoimvE of Firms in the 
Electric Power Industry, 1962-71 


Ownership 

Category 

No Change 

Change 

Sample 

Size 

Political 

36 

64 

100 

Private 

18 

82 

100 


contingency table (tabic 2) ’ An additional advantage of the 2x2 
classification is that it allows an unequivocal examination of the direc¬ 
tional hypothesis The statistic, adjusted for continuity, was computed 
The test lesults = 7 33, dl = 1) arc statistically significant at the 
00) level (one-tail), and suppoit the hypothesis that the managers of 
governnu^t-owned electnc companies enjoy longer tenure than the 
managers of privately owned clettui tompanus 

It IS possible that the leiiuie experience of a top cxtciitivc is sys¬ 
tematically different from that of his immediate subordinates and that, 
if the experience of (he lattei were taken into atcount, the lest results 
would be less favoiable to the hypothesis Although tin re is no obvious 
theoretical justification foi this argument, it seemed useliil to examine Us 
validity insofar as the data pirmitted Accoidingly, the frequency of 
change of the top three executives of privately owned electric companies 
was collated over the sample period 

A simple examination of the data (table 3) suggests that the tenure 
profiles are quite similar This is not surprising, since higher-lcvtl 
managers mav be expected to move up together lie that as it may, the 
results are not inconsistent with the analysis presented earlier in this 
paper 

Although the managers of government-owned electric firms enjoy 
longer tenure, it is at least arguable that this may be due to factors other 
than the ownership of the firm In the electric power industry, most 
of the political firms are municipal and —relative to the private firms—• 
are generally smaller®’ and purchase larger amounts of the power they 
distribute ® If smallei firms have a simpler hierarchical structure, their 
chief executives might be promoted to that position earlier in life and 
thus have a longer expected tenure Similarly, if firms which purchase 

’ The data shown in table 2 are the same for both dehnilions of change 
* Using total assets and electricity produced (as defined below) as alternative measures 
of size, the I-test for the difference between two means was used to evaluate the null 
hypothesis that the average size of private Arms is smaller than or equal to the average 
size of political firms The results (< = 20 6 for output and 7 3 for assets) allow rejection 
of this hypothesis at less than the 0 0005 level in the electric power industry, private 
firms generally are larger than political firms 

’ Of the firms sampled, one private (I percent) and 48 political (48 percent) purchased 
all of the power they distributed 



COMMUNICATIONS 


TABLE 3 


651 


FilEftUENCY OP ChANOE IN THREE ToP ExeCUTIVEI OF 
Privately Owned Firms in the Electric Power Industry, 1%2-71 


Rank of 
Executive 


Frequency of Change 

Sample 

Size 

0 

1 

2 

3 

4 5 

Top 

8 

27 

12 

3 

0 0 

50 

Second 

6 

30 

8 

5 

0 1 

50 

Third 

6 

25 

II 

5 

2 1 

50 


a larger portion of the power they distribute present their managers with 
simpler decisions and thus fewer possibilities of error, then their chief 
executives would also have a longer expected tenure 

To examine the possible importance of these considerations, the tenure 
experience within each sample was related to the size of the firm and to 
the amount of electricity purchased Average tenure (T), defined as the 
length of the sample period divided by the number of chief executives 
over the period, was regressed upon the size of the firm, defined as the 
amount of electricity produced (P), and upon the amount of electricity 
received (R) Note that P and R together account for the total amount of 
electncity distributed by each firm 

Tuvn = 6 62469 - 0 02686 P - 0 07663 R, R^ = 0 041 (1) 

(104) (0 98) 

= 5 65197 - 0 00461 T 4-0 00548 /? /?^=0013 (2; 

(1 12) (0 43) 

The results (the t-statistic is shown in parenthesis) are encouraging 
Although the size variable has the right sign, the level of significance is 
high—1 5 percent in the case of the political sample and 13 percent in the 
case of the private sample (onc-tail test) *' The variable for kilow'att- 
hours (kwh) received takes the opposite sign in the two regressions and 
is not statistically significant in cither case Moreover, the coefficients of 
determination are not impressive 

'®The kwh produced and kwh received were measured by the average of the data 
reported in Statislus for 1962 and for 1970, the latter being the last year for which pub¬ 
lished data were available The data, rounded to the nearest 1,000, were dinded by 
100,000 for scaling 

*' As a further check, the regression was run using total assets (the aierage of the data 
reported in Slalistus for 1962 and for 1970) as an alternatite measure of sire (A) Assets, 
rounded to the nearest 1,000, were also divided by 100,000 for scaling The results arc 
practically identical to those obtained using output 

7’mun = 6 62594 - 0 29628 A ~ « 07273 R, «» = 040 (14) 

(0 96) (0 88) 

Tpv^ = 5 64780 - 0 06412 ^ -1- 0 00609 R 
(1 14) (0 47) 


R} = 013 (2A'l 



To exphre nuttm further, thesea^tke mpookdMod the ngr 

computed witha dumniy vwwbte/browiw^ j 

added. As shomi ia tsquMtion {3}, the owaenhJp variable (D) (,,/ 

predicted sign: 

r = 5 74353 - 0 00572/ + 000131 X + 0.66686D, 

(1 27) (0 09) (1.6!) 

= 0 040 

and IS the only one which is statistically sjg’ni/icant at the 5 percent Jpvd 
or less The size variable has the predicted sign ("managers of larger 
firms have shorter tenure), but the relationship observed could have 
occurred by chance with a probability of 10 percent The variable for 
kwh received also has the predicted sign, but it is not statistically different 
from zero The coefficient of determination again is relatively small 
Althou^l^additional tests would be useful,*^ these results are not in¬ 
consistent '<jith the hypothesis that the managers of political firms have 
longer tenure 

rV, Summary and Conclusions 

Recent theoretical contributions have suggested that the utility max¬ 
imization hypothesis may usefully be extended to predict the choices of 
decision makers within government-regulated and government-owned 
business firms Preliminary empirical rcsylts seem to support this ap¬ 
proach Although the beginning is auspicious, the precise nature of the 
constraints on individual decision makers associated with alternative 
property rights structures and the range of application of the theory 
have yet to be fully established 

According to the analysis presented in this paper, the utility-max- 
imization hypothesis implies that managers of government-owned 
business firms have longer tenure m office than managers of regulated, 
privately owned firms The implication was tested using data drawn from 
the electric power industry over the period from 1962 to 1971 Although 
additional tests would be helpful, the evidence fails to deny it 

This study supports present extensions of the utility-maximization 
hypothesis and provides additional evidence regarding the range of 
application of the theory and the nature of the constraints on individual 
decision makers 

’ ^ For example, other specjficatioru of the tcnure-iize relationship could be examined 
to provide additional assurance that size has been fully taken into account Better results 
might also be obtained by increasing the variation in tenure, perhaps via information 
prior to 1962, and by introducing additional variables Thus, some index of a manager’s 
performance should be helpful Although the improvement expected did not seem large 
enough relative to the cost to justify more elaborate tests with these data, the points 
being raised would provide a useful base for further research 



coumiMicAnom 653 

JLetertmetm 

Alchian, A. A. “Private Property and the Relative Coit of Tenure.” In 711* Puilu 
Stake in Union Power, edited by P. D. Bradley Charlottesville' Univ, Virginia 
Press, 1959. 

- “Some Economic* of Property Rights.” Politico 30 (December l%5) 

816-29 

Alchian, A A , and Kessel, R. A “Competition, Monopoly, and the Pursiut of 
Money ” In Aspects of Labor Economscs Special Conference Senes, vol. 14, 
Princeton, N.J.' Pnnceton Univ Pres* (for Nat Bur Econ. Re* ), 1962 
Averch, H., and Johnson, L L “Behavior of the Firm under Regulatory Con¬ 
straint ” A ER 52 (December 1962) 1052-1069 
Baumol, W J , and Klevonck, A K “Input Choices and Rate-of-Rctimi 
Regulation An Overview of the Discussion '* Bell J Econ oral Management 
Set 1 (Autumn 1970) 162-90 

Davies, D G “The Efficiency of Public versus Pnvate Firms TTie Case of 
Australia’s Two Airline* ’’ J Law and Econ 14 (April 1971) 149-65 
De Alessi, L “A Utility Analysis of Post-Disaster Cooperation ’’ Papers on Non- 
Market Decision-Making 3 (Fall 1967) 85-90 

- “Implications of Property Rights for Government Investment Choices.” 

A,ER 59 (March 1969) 13-24 

- “Government Ownership and Regulation Theory and the Evidence 

from the Electric Power Industry ” Unpublished manuscript, 1972 

- “Private Property and Dispersion of Ownership in Large Corporations ” 

J Finance 28 (September 1973)' 839-51 

Eckert, R D “Regulatory Comrmssion Behavior Taxi Franchismg m Los 
Angeles and Other Cities ” Ph D dissertation, Umversity of California, Los 
Angeles, 1968 

Hindlcy, B “Separation of Ownership and Control in the Modern Corporation ” 
J. Law and Econ 13 (April 1970) 185-221 
Mayer, T “The Distribution of Ability and Earnings ” Rev Econ and Statu. 42 
(May 1960) 189-95 

Moore, T G “The Effectiveness of Regulation of Electric Utility Prices.” 

Southern Econ J 36 (April 1970) 365-75 
Niskanen, W A. Bureaucracy and Representative Government Chicago Aldine- 
Atherton, 1971 

Peltzman, S “Pricing in Public and Private Enterprises Electric Utilities in 
the United States ” J Law and Econ 14 (April 1971) 109-48 
Siegel, S Nonparametrw Statutics for the Behavioral Sciences New York McGraw-Hill, 
1956 

Stigler, G J , and Fricdland, C “What Can the Regulators Regulate^ The Case 
of Electricity ” J Law and Econ 5 (October 1962) 1-16 
Taka^ama, A "Behavior of the Firm Under Regulatory Constramt ” A.ER 
59 (June 1969) 255-60 

U S , Federal Power Commission Statutics of Privately Owned Electric Utilities in 
the Umted States, 1962. Washington. Government Pnnting Office, 1963 (a) 

-. Statutics of Publicly Owned Electric Utilities in the Umted Stales, 1962 

Washington. Government Printing Office, 1%3 (b) 

Zajac, E E “A Geometric Treatment of Averch-Johnson’s Behavior of the 
Firm Model Reply.” dER. 62 (March 1972) 142 



Utility Maximization Sufficient for 
Competitive Survival: Comment 

Robert Piron 

Oberltn College and University of Calt/omta, Berkel^ 


In a recent issue of this Journal, Ladd (1969) demonstrates that utility 
maximization sufficiently guarantees competitive “survival,” that is, it 
IS not necessary to assume the profit maximization criterion If he had 
proceedB^ further in following his logic, however, he would have shown 
that his model implies {a) that profit maximization also sufficiently 
guarantees competitive survival and {b) that dropping the profit max¬ 
imization postulate, given his assumptions, changes nothing in the 
theoretical sense (though some interesting empirical issues may well be 
raised) These two observations stem from (c) the theoretical identity of 
profit maximization and utility maximization at the critical point in 
Ladd’s model Thus, his conclusion becomes trivial and does not con¬ 
stitute substantive criticism of traditional wisdom 

Using Scitovsky’s classic construction (2), Ladd argues as follows 
(see fig 1) Let MJL, K^) = M, be the implicit form of the equation 
of the “minimum” indifference curve / (A', being the minimum accept¬ 
able level of satisfaction) Since the net income function for both the 
profit maximizer and the utility maximizer results from the maximization 
of M subject to the production function for various values of the param¬ 
eter L, say L*, we can write = M[L) 

Thus (we ignore second-order conditions, Ladd does not), the neces¬ 
sary conditions for utility and profit maximization are, respectively, 

dM{L)\dL = (1) 

and 

dM[L)\dL = rfM,(L, A,)/rfZ, (2) 

If (1) and (2) are satisfied for every value of dM{L)ldL at the same value 
of L, the “Scitovsky condition” for double maximization for all L is 

dMJL,K,)idL = (3) 

and a firm satisfying (1) and n = 0 also satisfies (2) and n = 0, where 
“tt = 0 means the firm is restricted to points on curve /• the zero-profit 
indifference curve” (p 481) So far, so good But, since n = 0 constrains 
the range of relevance to indifference curve /, the converse of the previous 

654 



COMMUNICATIONS 655 



Fio 1 —M measures entrepreneurial money income, L measures entrepreneurial 
leisure, Q measures the physical output of the firm 

result IS also true a firm satisfying (2) and 7t = 0 also satisfies (I) and 
7 t = 0 This IS easily seen as follows If the firm is maximizing profits 
at the level of zero, it must mean that we are at a point such as b' Here, 
as contrasted with the case for profit maximization at some piositive (or 
negative) level of economic profits, the tangents to the net income func¬ 
tion and the indifference curve are coincident and not merely parallel 
But, since the tangents are coincident, this means the indifference curve 
and the net income function must be tangent at b’, which is precisely 
the condition for utility maximization, and, therefore, the two con¬ 
ditions become indistingfuishable Thus, by the constraining condition 
n = 0, utility maximization implies profit maximization, and, simul¬ 
taneously, vice versa ' Thus, once the zero-profit constraint is effective, 


' The equation 

dM(L) ^ _ dUtlSL 

<a, 9f/,/?A/ ^ 

states that the slopes of the net income curve and the minimum indifference curve are 
equal to each other As written, this is Ladd’s equation (I I), which was derived for the 
case of utility maximization But this must imply 

= dAf,(£,, K,)ldL = UJUu (5) 

dL 

from the text above Thus, when there is only one relevant indifference curve (the 
minimum curve I), the profit- and utility-maximizing conditions imply each other 
necessarily, a trivial case of the Scitovsky condition, which in general holds for all L 



JOURNAL OF POUnOAL ECONOMY 


656 

the zero*profit equilibrium is formally identical with the utility-max¬ 
imizing equilibrium by the rules of the Scitovsky model which dictate 
that “normal” profit be found as the vertical distance between the 
honzontal axis and the minimal indifference curve It seems that what 
Ladd has done, therefore, is to find a case of double maximization which 
need not necessarily imply the “Scitovsky case,” though it is implied 
by It 

Therefore, point b' m figure 1 (imagine indifference curve II and 
others above it as not being vertically displaced from curve I) would be 
a case of “double maximization” without the “Scitovsky ’condition” 
holding Does it matter if (3) does not hold? Not really, since regardless 
of the assumptions made concerning the argument of the utility function, 
long-run competitive cquilibnum {n = 0) will occur at exactly that 
value of L which would hold if (3) were true Theoretically, therefore, 
It IS as ifltt'competitive firm were a profit maximizer (and a “double 
maximizer” at that) in the long run, and one suspects the triviality of 
Ladd’s conclusion Finally, we need only appeal to the principle of 
Ockham’s razor at this point and ask, What plausible criterion ivould a 
competitor maximize if not profit’ Revenue, perhaps’ Not if we want a 
finite result Market share’ Impossible Security’ The model does not 
define it in any meaningful terms Whatever the criterion, the model 
incorporating it must be more “complicated” than the profit maxim¬ 
ization model, and since long-run results, as we have seen, are totally 
indistinguishable from one another, the choice of models seems clear on 
methodological grounds 


References 

Ladd, G W. “Utility Maximization Sufficient for Competitive Survival ” 
J P E 77, no. 4 (July/August 1969) 478-82 
Scitovsky, T “A Note on Profit Maximization and Its Implications ” In Readings 
tn Prue Theory Homewood, Hi Irwin, 1952 



On the Economics of a University: Toward 
the Determination of Marginal Cost of 
Teaching Services 

Shlomo Brovender 
Umutrstty of Pittsburgh 


I. Introduction 

The current interest in the econonuc behavior of institutions of higher 
learning is motivated both by their financial plight and by the reabzation 
that economic theory is not well adapted to analyze the problems of non¬ 
profit institutions The present publications in the economics of higher 
education tend^to consider the university m the aggregate as a single, 
homogeneous plant The aim of this paper is to consider the university 
on the disaggregated level of departments and programs, and to suggest 
an alternative method of costing university teaching services (see also 
Brovender 1970) 

n. The Distinction between Departments and Programs 

Traditionally the university is viewed as a locus of departments, where 
each department pursues its own ends in undergraduate and graduate 
teaching, research, and other activities I assume that the primary out¬ 
puts of the department arc the number of credit hours it produces in a 
trimester or a year* and that the cost of unit of output can be measured 
in terms of average cost per credit hour ^ 

In this paper I try to change the focus of attention from departments 
to programs For my puiposes the program is defined by the final degree 
conferred on the student upon graduation, and conceptually many 
departments participate in each program (before only one department 
was usually associated with the program) 

Thu paper ii based on a study conducted at the Umvenity of Pittsburgh in 1970 I 
wish to thanlc Daniel B Rathbum, who as the director of Institutional Planning and 
Budgets in the University of Pittsburgh made this study possible, and Mark Perlman 
for his advice and support All errors and shortcomings are of course mine. 

^ I do not consider here the problems of output of research and community services 
but strictly the teaching activiUes 

‘ If we consider the ratio of total credit hours to full-time equivalent faculty (FTE) to 
be a measure of average productivity, then average cost per credit hour • (salary bill/ 
FTE)/(total credits/FUE) ~ average salary/avcrage productivity 

657 


JOURNAL OF POUnCAL ECONOMY 


658 

in. The Program Matrix 

Assume that we can construct the following matrix, which I refer to as 
the A Matrix 

The A Matnx is a rectangular matrix of coefficients in which the rows 
denote the academic program and the columns denote the academic 
departments The Ojj coefficients within the matrix arc the average credit 
hours per period of time^ required for a degree Hence, the matrix 
represents the curriculum structure of the university Consider table 1. 

The relevant credit coefficients are those along the row vector (program 
vector) and not along the column vectors. A program major “buys” 
(uses) credits from different departments, but departments neither buy 
credits from nor “sell” (offer) credits to other departments This is a very 
important distinction that shows the validity of distinguishing between 
deparUjjj^ts, which are both “production” entities and administrative 
entities, amd “programs,” which are functional entities When we say 
that the program major “buys” credits from other departments, essen¬ 
tially what we mean is that the program major imposes a load on other 
departments, and the extent of the load on each department will depend 
upon the number of students in the program, and the size of the co¬ 
efficients a,j within the matrix 

In the A Matrix the sum of all rows 

1 = 1 

will be about the same magnitude, since the number of credits which a 
student is allowed to take in any period of time is constrained institu¬ 
tionally, that IS, the student is not allowed to take more than a certain 
minimum and up to a certain maximum of credits—14^16 credit hours 
for a full load 

The size of the matrix will depend on the number of identifiable 


TABLE 1 
A Matrix 


Program— 
Destination 

Academic Departments—Source 


1 

2 

N 

1 

"ii 

“iZ 

“in 


2 

“it 

“12 

“Zn 


M 



^jun 



’ The time dimenaion u important, since we can construct matrices for trimester, year, 
and 4 years, and by level of undergraduate/graduate instruction 






COMMUNICATIONS 659 

programs and departments, but the numbers of departments and programs 
are not necessarily equal * 

The distribution of the coefficients (z,y within the matnx is postulated 
as follows The largest coefficients will be distributed along the main 
diagonal (i =j) This is expected, since along the mam diagonal the 
program major and the department which offers the majors coincide 
Off the main diagonal we expect to find an area of concentration of the 
coefficients * As we move farther away to the right or the left of the mam 
diagonal, the coefficients will tend to be smaller and irregular For 
example, we can expect a student in premedicine to take English courses, 
but we hardly can expect a major m English to take premedicine courses 
Similarly, when we aggregate all the departments into larger units such 
as divisions or schools, we would expect to find a higher degree of con¬ 
centration and interdependency within schools than between schools 
This results from the fact that some programs or schools tend to impose 
loads on the rest of the university while they do not offer, m a reciprocal 
way, credits to other majors from other programs So, the higher the 
degree of specialization of a school, the less the degree of interaction we 
can anticipate with the rest of the university 

We expect that the coefficients Hjy within the matrix will tend to be 
relatively stable within the row vector over time To put it differently, 
we do not expect that the curriculum requirements will tend to change 
rapidly from penod to period 

Table 2 shows an actual A Matnx derived from a sample of 1,691 
graduating students m the University of Pittsburgh m 1968-69 


rV. Cost Considerations 

Let r, = / (x) be a cost function, where f, = salary bill m department i, 
and X, = number of credits in department i Then dcjdx = f'[x) = c' 
will be the marginal cost of producing 1 credit hour m department i, 
and cjxf = [/(x)]/x, = c will be the departmental average cost 
Furthermore, x, = x^ + x ,2 -h x,j, where x, = lower-level under¬ 
graduate tredits, Xj = upper-level undergraduate credits, and Xj = 
graduate credits * 


* Since some departments are service departments, which do not have majors such as 
ROTC, continued education, etc 

’ The A Matnx refers mainly to undergraduate education In graduate programs we 
can expect a higher degree of specialization within departments and therefore less inter¬ 
dependence It IS postulated that 90-95 percent of all coefficients will be along the mam 
diagonal 

‘ The credit hours used in the following regression equations were considered to be of 
equal weights The use of credit hours as a measure of output involves the following 
problems (1) determination of the absolute number of credits and primarily the dis- 
tmeuon between Xj and and (2) development of a nonarbitrary weighting scale 
Actually one of the purposes of the regression analysis is to develop a measure of relative 
costliness among ■^l> •^Sl and Xf credits 



a 

i-j 


s ^ 

2 

« 

o 

<n 

rs, 

ssg 

in 

CM 

e J 

o 



in 

^ CO 


CM 


eo 

CM 

eo 

« 

-^oo 


o 

E*? 1 

•n 

cn 

o> 

m 

r««- 

in 

p m 


Oi 

cs 

X 2 ! 


m 

<o 


p o 



O c/D 1 

O 

o 


o 

o o 




c 

S> 

c 

w 


0. e 






O CM 1^ 

CM fv. o 

CO CO 


— CMO 

OCM O 

OOO 

0^0 


o 

oooco^ 

0 m -1 o rx 


5 op op CM m 

o ^ — c 

0 Cfj O CM ff> 

ooo- 

- *n ooo 


o 


eo O « O r^ 1 *^ 

CM00-*C^»O CMiOO 

00"0-^ ooo 


Co o 

CM O 


CMO^r*‘^^«tN.peM 

C<^CMeOCM-*.-Na)OCM 


s 

in 

in 


1 ^ eo r» - . _. 

00 0> CM — ^ CO 
<T>co — «ficoin — cocn 
— — in — CM CM 


encMt^Oinpo^cocM 
inCT>oio— 
co'^mo — ®o — CM 


5 <£> ^ Q in —« o% 

> in in lo 

> cn '«♦* c> ^ — o> 


r^o^^QQinio — 
Ot^pcMwCr^cotc 
CM cn ^ CM — — 


d)(6es|CMNc6<S9)^ 

222 g8iS8$S 1“* 


C f! 

c C = 

eggs? cJ 

illliMil 

l;g||(3l3£cS 


660 




COMMVNICATIONI 


66l 


TABLE 3 


Resulti or Multiple Rboeeiooh Analveb 
Total Salabiei = io + ii (Total Undeeoeaduate) + i, (Graduate) 




Total Undergrad Graduate 


Group 

(1) 

(2) 

(3) 

(4) 

Humanities 

249 4920 

4 5527 

74 8682 

78 


(77 8472) 

(1 6169) 

(25 9845) 


Natural sciences 

414 7910 

9 0277 

30 3118 

60 


(132 7140) 

(2 6964) 

(1 6174) 


Social sciences 

-42 8606 

18 9147 

17 2558 

78 


(109 6140) 

(2 6505) 

(15 1325) 


Note —Numbers m parcntheita => MandArd erron 





TABLE 4 




Results op Min-npLE Regression Analysis 



Total Salarces = 

+ ij (Total Credits) 



!>„ 


Total Credits 


Group 

(1) 


(2) 

(3) 

Humanities 

373 9870 


8 1254 

73 


(66 3854) 


(0 8994) 


Natural sciences 

431 5740 


11 4350 

56 


(132 8490) 


(1 7621) 


Social sciences 

-42 8411 


18 7621 

78 


(107 1200) 


(2 1237) 



Note —Numben in par«nth«9is » itandard crron 


It should be emphasized that the above cost considerations follow the 
departmental definition of locus of activity and not the program defimtion 
For exposition purposes only, tables 3 and 4 give the results of six 
regressions which attempted to estimate a cost function of the type 
described above in two variations 


V. Synthesis 

The existence of the A Matrix and both the cost coefficients from the 
regression equations (c') and the average cost coefficients (c) will en¬ 
able us to cost university programs on both an average-cost basis and a 
marginal-cost basis 

For exjjosition purposes I will use a segment of 3 x 3 from table 2, 
with the corresponding coefficients from tables 3 and 4, to calculate the 
average program cost and “marginal” program cost^ for a “typical” 
student in the divisions of humanities, natural sciences, and social sciences 

’ The coefficients of the regression equations are taken to represent the "marginal 
cost” of the divttion 



662 


JOURNAL OF POLmOAL ECONOMY 


TABLE 5 
Composite Table 


No OF Credit Hours Cost per Credit Hour 




Nat 

Soc 


(J) MC* 

Coefficients of 


Human 

(1) 

Sci 

(2) 

Sci 

(3) 

Total 

(4) 

ATCt Tables 
(5) (6) 

Table 4 

(7) 

Humanities 

69 0 

18 5 

19 9 

107 4 

1500 

4 55 
(162) 

8 13 
(0 90) 

Natural sciences 

34 7 

58 5 

189 

112 1 

18 87 

9 03 
(2 70) 

11 44 
(1 76) 

Social sciences 

34 9 

20 6 

51 5 

107 0 

17 87 

18 91 
(2 65) 

18 76 
(2 12) 

SolRtE -Cols ! 

4 from tdble i, col 

b from table 1 col 2 

col 7 from table 4, col 

2 



Note - Numbers m parenth«*4es = staiulurd rnon 
* Maff^ina] c<m( 

t AverMAcOst per credit hour in the respective divisiims 


TABLE 6 

Calcui ation of Prooram Cost 




Total Program Costs 
(S) 

Program Cost per 
Credit Hour ($) 


Credits 

AC 

MC, 

MC, 

AC 

MC, 

MC, 

Humanities 

107 4 

1,736 

855 

1,143 

16 19 

7 97 
(I 24) 

10 66 
(0 76) 

Natural sciences 

112 1 

1,982 

1,043 

1,305 

17 68 

9 30 
(I 56) 

11 64 
(102) 

Social sciences 

107 0 

1,832 

1,318 

1,485 

17 12 

12 32 
(147) 

13 88 
(1 12) 


Noth - AC' = dvrra^r tost, MC - niarttmal cost MC| refers to cio-fricirnts from table T, tiul MC 2 , to 
ccicflicients from tabJe 4 NurnbcM in parenthesw = 8(<tn<Iard errors 


The data in table 5 enable us to calculate the program costs in the 
humanities, natural sciences, and social sciences It should be realized 
that in essence the so-called program cost is nothing more than the 
weighted unit cost of the program, that is, the number of credits which 
were delivered by each “adniinistrative” division (or department) times 
Its unit costs for producing 1 credit hour * 

Table 6 gives the results of these exercises The total program costs 
are equalized for the difference m the number of credits which are 
represented in the reduced matrix Now wc are m a position to consolidate 
the arguments 


^ The program cost is equal to 








COMMXnnCATIONS 


663 


TABLE 7 


Prooram CkMT {in Dollars per Credit Hour) 



Department 

Program 

Program 

Program 


AC 

AC 

MC, 

MCi 

Humanities 

1500 

16 19 

7 97 

1066 




(1 24) 

(0 76) 

Natural sciences 

18 87 

1768 

9 30 

U64 




(1 56) 

(102) 

Social sciences 

17,87 

17 12 

12 32 

1388 




(1 47) 

(1 12) 

Notf - AC’ ^ average coal, MC’ ^ niargiiiat c<>8t Number* in 

parcnihrtM - standard 

errors 



TABLE 8 



DlVlATION OF MaROINAL CosT FROM PROGRAM AVERAGE CoST 



Program Average Cost 

MC, 

MCj 

Humanities 


100 0 

49 2 

65 8 

Natural sck nets 


1000 

52 6 

55 8 

Social sciences 


1000 

72 0 

81 1 


Th< first column in table 7 represents the customary way in which 
university services arc priced, that is, the cost is the departmental cost 
(divisional cost) of producing 1 credit hour according to this customary 
calculation The humanities courses are the cheapest, while the natural 
sciences, as some may agree, are the most expensive, and social sciences 
are intermediate 

The second column represents the program-weighted average costs 
According to this calculation the order of costliness does not change The 
humanities programs arc still the cheapest and natural sciences are the 
most expensive, but the range of differences between each program is 
reduced substantially 

The last two columns in table 7 demonstrate the “marginal program 
cost” by utilizing the regression coefficients from tables 3 and 4 These 
calculations point immediately toward two important relationships 
First and foremost, the marginal costs per credit hour are substantially 
below those of the department average cost and the programs’ average 
cost And second, we observe a reversal in the costliness of the program 
The social sciences, rather than the natural sciences, have become the 
most expensive program 

Table 8 summarizes the extent of deviation of the marginal cost from 
the average program cost ’ 

* The fact that the marginal coil is smaller than the average cost may be indicative 
of the optimal level of output and the relative levels of uUlizaUon of capacity 








JOURNAL OF POLITICAL ECONOMY 


664 

VI. Summary and Conclusions 

The preceding sections have suggested that analyzing the cost of services 
of a university can be rewarding, particularly if modern techniques are 
used The conventional average costing method, which considers the 
department vectors as the relevant cost accounting unit, is not able to 
provide a good indication of the costliness of the teaching activities 
because of the following reasons (1) The average cost does not permit 
differentiation between the teaching and nonteaching activities that are 
earned out in the department (2) It is not able to differentiate between 
the levels of teaching (undergraduate or graduate instruction) (3) It 
does not take into account the random elements that contribute to the 
cost vanabihty between departments (such as seniority structure, tech- 
nology^tteaching, department size, and market conditions which affect 
salaries) Average-cost accounting identifies the department with the 
program and therefore neglects to consider the extent to which the inter¬ 
action between departments and programs at any given moment affects 
the level of average cost 

I think that the proposed approach overcomes most of the deficiencies 
of the average costing method primarily because it emphasizes the crucial 
role of the interaction between departments and programs m the 
university 


Reference 

Brovender, Shlomo “Interactions between Departments and Programs in 
Analyzing a University as an Economic Unit The University of Pittsburgh 
in the 1960’s ” Ph D dissertation, Univ Pittsburgh, 1970 



Book Review 


The Channels of Monetary Effects on Interest Rates By Phillip Cag\n 
New York Columbia University Press, 1972 S5 00 

The stated purpose of this book is to examine, theoretically and empirically, two 
types of monetary effects on interest rates, the portfolio and the credit effect A 
third effect—the impact of inflation on nominal interest rates, or the Fisher effect 
—IS also mentioned Most of the book is devoted to an analysis of the portfolio 
and credit effects Two additional chapters, which to me arc less interesting and 
will not be discussed here, present a simple mathematical model of monetary 
effects on interest rates and an empirical investigation of the lags in monetary 
effects on interest rates and income 

Under the portfolio effect, portfolios are rearranged in resjxinse to a rise in the 
rate of growth of money until interest rates have changed sufficiently to make the 
quantity of money demanded equal to the existing stock The changed interest 
rate affects income, which feeds back on desired money balances An increase in 
the money growth rate ultimately produces an increase in the rate of growth of 
nominal income, but the new equilibrium is at the same real rate of interest 
Under the credit effect, increased money growth is viewed as producing an in¬ 
creased supply of loanable funds The larger the supply of loanable funds, the 
lower is supposed to be the interest rate 

I'he weakest part of Cagan's argtiment is his attempt in chapter 1 to make sense 
out of the credit effect 1 he discussion relies on the concept of “loanable funds ” 
Cagan writes “The concept of ‘loanable funds’ here is all-inclusive and is meant 
to be the flow of funds or credit (I use the terms inlcrchangeablv) appropriate to 
the determination of the general level of interest rates in the economy Only 
net purchases of financial claims by each person or business are included, such 
purchases can be financed by income receipts, net reductions in money balances, 
or the creation of new money” (p 3, n 2, emphasis in original) Simultaneous 
changes in assets and liabilities for an individual economic unit must be netted 
out, since such changes do not affect the excess demand for funds, btmilarly, 
exchanges of one asset for another, or of one liability for another, must be netted 
out 

There is, I believe, a serious problem with Cagan’s argument First, note that 
only income receipts not spent on goocb and services are available for net purchases 
of financial claims Now, consider the following example Vliss X has a flow of 
nonspiending pf $1,000 per year which she invests m bonds issued b\ Mr who 
IS spending $1,000 per year more than his income receipts If the two should marry 
and become one economic unit while maintaining the same total of spending, has 
the flow of loanable funds declined by $1,000’ If so, should we expect there to be 
an effect on market interest rates’ 

Marriages and mergers may or mav not affect the aggregate of nonspending by 
the partners, but such events certainly do affect the size of the flow ol loanable 
funds as defined by Cagan. Furthermore, any method of aggregauon that nets 
out intrasector claims, as is done in the Federal Rcscrv'c’s Flow of Funds Accounts, 

665 




666 


JOURNAL OF POLITICAL ECONOMY 


raises the same issue Finally, the problem cannot be avoided by takmg the differ¬ 
ence between aggregate income and aggregate spending on goods and services, 
this figure is simply the statistical discrepancy in the national income accounts' 

The flow of loanable funds may either rise or fall as interest rates change de¬ 
pending on whether the demand or the supply function has shifted But Cagan 
never attempts to measure the total flow of loanable funds Instead, he works with 
that part of the flow accounted for by new money creation In chapter 2 he argues 
that whether the credit effect exists depends on the disposition of the revenue from 
new money creation The first part of this argument is an analysis of the revenue 
from money creation The revenue from government currency issues is examined, 
and then the revenue from deposit expansion under a variety of assumptions in¬ 
volving monopoly versus competitive banking Much of this discussion is designed 
to show that revenues (bank profits) from money creation are smaller than they 
appear at first blush because of the resources used in providing the services with 
which banks bid for deposits 

The second part of chapter 2 consists ol a discussion of the disposition of the 
revenue from money creation Cagan discusses the disposition of the government’s 
revenue in terms of the extent of distiibution effects between the government and 
the private sictor and concludes that “no general conclusions can be drawn about 
the directly or magnitude of the effects ’’ Similarly, the tffecls of private cur- 
renev isstSHaad deposit expansion also depend on whdher distribution effeets 
exist A rise in deposit expansion generates inflation, which transfers real usources 
from the holders of deposits to bank owners, a net effect on the supply of loanable 
funds can only occur if ilu gairieis and losers icact asymmetrically 

Cagan’s argument would. I believe, have been more powerful if he had begun 
his analysis with the very simple assumption of random distributions of govern¬ 
ment hat money to the pruate sector This simple assumption has the advantage 
of letting the rev enue from money cication arc rue to the private sector 1 he effect 
on the supply of loanable funds is then clearly a matter of distribution effects 
Cagan’s elaborate analysis of commercial banks is relevant for the question of the 
amounl of uvenuf but is largely im levant for the question of which way the 
distribution effects will go luuheimore, if the government raises revenue through 
money creation rather than through taxation, the pioblem is still one of distribu¬ 
tion effects within the private sector because a tax on real money balances has been 
substituted for a lax on, say, ineomc Distribution effects between the piivate sector 
and the govcrnnu nt are relevant only if money cicatioii is a substitute for govern¬ 
ment bond issues 

1 he implications of the assumption of landom money distnbulieiris have already 
been explored in the context ol money growth models In terms ol ihc exposition 
by Johnson (1969), under some assumptions a higher rate of money creation can 
accelerate real investment and pioducc a higher equilibrium capital/labor ratio 
accoiTipamtd by a lower real rate of inleicsl J he michamsm producing this 
result is that inflation lowers the demand for real balances, which in turn lowers 
the saving going into real balances and raises the saving going into capital forma¬ 
tion The growth model mechanism is (he exact opposite of that discussed by 
Cagan Cagari’s concern is whether money creation adds to the supply of real 
loanable funds rather than whether it reduces the demand for real loanable funds 
by reducing the demand for real money balances However, both Cagan and 
Johnson stress that any long-run effects that do exist appear by virtue of the 
assumption that no interest is paid on money 

The empirical work in chapter 3 is designed to show that monetary growth does 
indeed have a negative impact effect on interest rates The method used is to 
divide the reference cycle (the business cycle with turning points defined by the 
NBER) into nine stages Stage I is the trough, stage V the peak, and stage IX the 



BOOK REVIEW 


667 

next trough (Stage IX is stage I of the next reference cycle ) The level of the 
interest rate and the monthly rate of change of money (Afj) are averaged for each 
stage The change in the stage average for the interest rate is then regressed on the 
change in the stage average for money growth Six different interest rate series 
were used, their correlations with money growth range from — 16 to — 45 

In chapter 4 Cagan attempts to test for the credit effect He constructs a con¬ 
solidated monetary system balance sheet by consolidating the accounts of the 
Federal Reserve, the Treasury, and the commercial banks The fully consolidated 
balance sheet has three principal assets and four principal liabilities The assets 
are (a) gold (plus special drawing rights and foreign exchange at the present 
time), (A) commercial bank loans and investments other than Treasury securities, 
and (c) physical assets owned by the Treasury, Federal Reserve, and commercial 
banks The liabilities are {d) deposits held by the public in commercial banks, 
(e) Federal Reserve and Treasury currency outside the monetary system, (/) 
Treasury securities outside the monetary system, and (g) the net worth of the 
Treasury, the Federal Reserve, and the commercial banks Credit granted by the 
consolidated monetary system to the remainder of the economy is defined to equal 
Item b less item / Call this variable h, and its average monthly change over a 
reference cycle stage, dE 

Cagan runs regressions of the form Ai = Cq + a^A(dEIM) + a^EidMjM), 
where A refers to the difference between the stage averages The a^ coefficient is 
supposed to represent the credit effect Using the sample period 1919-66, Cagan 
finds that the credit effect is 10-40 pierccnt of the monetary effect depending on 
the interest rate series used (1 e, the a, coefficient is 10-40 percent of the 
coefficient) Both a^ and aj are negative, as expected 

This argument seems to me to confuse two meanings of “credit effect ” Cagan’s 
argument in chapters 1 and 2 has nothing to do with whether money creation is 
associated with an increase in the earning asstts of the consolidated monetary 
system T here is no change in E if new money is distributed randomly from air¬ 
planes or systematically through tax cuts I he argument underlying “credit 
effect A” IS that money creation will depress the real rate of interest bv producing 
distribution effects (“forced saving”) whereas under the portfolio effect the real 
rate would eventually return to its initial level with a new higher equilibrium in¬ 
flation rale Cagan’s test is relevant to “credit effect B," the argument that it is 
credit creation rather than money creation that drives economic activity The 
extreme form of this argument is that money creation per se does nothing, while 
money creation through government deficits, or through new bank lending, is 
cxpansionaiy 

T he argument for credit effect B is in fact very close to the Keynesian argument 
that autonomous expenditures drive economic activity An exogeneous rise in E 
can occur without a change iii the money stock (the sum of balance sheet items d 
and e) if, for example, the government reduces its expenditures so that government 
debt is retired and net worth rises The decline in autonomous expenditures that 
increases E depresses interest rates according 10 both the Kevnesian model and 
Cagan’s regressions * With the exception of international capital flows, there is 
no way for E to change over time with AI fixed without there being an associated 
change in private or government expenditures (or taxes) In a general equilibrium 
model with thiee goods—commodities, money, and securities—analysis may be 
conducted in terms of any two of the three markets The argument for credit 
effect B IS simply the same as the argument that both monetary and fiscal policy 
affect interest rates 

' Whether income changes depends on the extent to which the interest rate change 
crowds out (or in) other expenditures. 



666 


JOURNAL OF POLmCJAL ECONOMY 


Another version of credit effect B is that bank credit plays a speaal role. Cagan 
tested two versions of this theory by defining a variable E' to equal total commer¬ 
cial bank loans and investments plus the Federal Reserve’s securities portfolio, 
and by running a multiple regression splitting £' into commercial bank and 
Federal Reserve components In both versions the credit effects are estimated to 
be somewhat larger than before I find these results unconvincing Bank credit can 
change relative to the money stock only as a result of changes in the currency/ 
deposit and reserve/deposit ratios ^ Both these ratios are probably related to 
interest rates, and such relationships seem more plausible explanations than 
“credit effects ’’ 

It appears, then, that Cagan has not in fact tested credit effect A. However, it is 
not clear that U is possible to distinguish credit effect A from the portfolio effect 
Since both effects are properly defined in terms of the real rate of interest, a test 
for a long-run credit effect A is inevitablv entangled in testing for the Fisher 
effect In principle, distinguishing between credit effect A and the portfolio effect 
m the short-run might depend on distinguishing monetary effects on the level 
from those on the rate of change of the interest rate Under the portfolio effect the 
interest rate ought to change so long as there is monetary disequilibrium, if 
(equilibni^k money demand differs from money supply primarily because of 
changes in the’fatter, then it might be reasonable to argue that the portfolio effect 
should show up as a negative relationship between the rale of change of the 
interest rate and the rate of change of the money stock Credit effect A, on the 
other hand, should appear as a relationship between the level of the interest rate 
and the rate of change of money Unfortunately, all of the tests reported in this 
book relate the level of the interest rate to the rate of growth of money, and so no 
comparison along the above line is possible 

What have we learned from all this^ The credit-effect arguments make sense 
to me only if 1 substantially restate Cagan’s argument, and I am not at all sure 
that my restatement reflects what he had in mind I cannot think of any purpose 
for which I would use the estimated coefficients in the regressions of interest rates 
on money growth The primary value of Cagan’s empirical work would seem to 
be that there can be little remaining doubt about the major direction of causality 
between the money stock and the interest rate From what we know about the 
operation of the U b monetary system, if the money stock were an endogenous 
variable, then suiely the short-run correlation between money and interest should 
be positive Cagan has examined this issue in almost every conceivable way and 
has found that the short-run correlation between the interest rate and money 
growth IS negative This finding may not have been Cagan’s aim, but monetarists 
should be happy to have this one securely salted away 

William Poole 

Federal Reserve Bank of Boston 

Reference 

Johnson, H G. “Money in a Neo-classical One-Sector Growth Model ’’ In Essays 

tn Monetary Economics Cambridge, Mass Harvard Univ Press, 1969 

* With the growth in nondeposit liabilities in recent years we might include such 
liabilities as a separate item in the consolidated balance sheet of the monetary system 
In constructing the variable £ this item would be subtracted out, E would be unaffected 
by equal growth in bank loans and bank nondcposit liabilities However, nondeposit 
liabilities might not be subtracted out in constructing E' There would not appear to be 
a good reason for this treatment except insofar as improvements in financial markets, 
including intermediation, reduce the demand for money Given the money stock, a 
change in bank credit is more likely a symptom than a cause 



An Econometric Evaluation of 
Sulfur Taxes 


James M Griffin 

Unwirstfy oj Pemsylvmia and Vnaernty aj Houston 


In contrast to the standard comparative static analysis of alternative 
policy instruments for dealing with air pollution, the present study 
provides a dynamic framework of analysu through the use of an econo¬ 
metric model An econometric model of the electric utility industry is 
utilized to calculate the welfare gains associated with the imposition 
of a sulfur tax as enumerated in the Pure Air Tax Act of 1972 The 
dynamic properties of a recent proposal by Baumol and Oates which 
would combine an ambient air standard and a charges policy are also 
examined 


I. Introduction 

There is considerable doubt as to what the appropriate policy tools for 
dealing with sulfur oxide emissions are Recent propiosed legislauon such 
as the Pure Air Tax Act of 1972 calls for a fixed tax rate * Aitematively, 
several economists, including Baumol and Oates (1971), propose the use 
of both an ambient air standard and a tax which the pwllution control 
agency would adjust periodically to achieve target reduebons in pollution. 
At least four major problem aretis constram meamngful comparisons of 
these alternative policy instruments Apphed welfare analysis u circum¬ 
scribed both by the lack of rehable estimates of the damage funebon and 
by deficiencies in the esbmabon of control costs Furthermore, Ayres and 
Kneese (1969) suggest that second-best considerabons are likely to be 


I wish to express thanks to F G Adams, George Daly, R William Thomas, Tom 
Mayor, and Henry Steele for numerous helpful comments This prtgect was partially 
financed by a Faculty Development Resesu^ Grant from the Umvemty of Houston 
and by a gnmt from the Energy Policy Pre^t of the Ford Foundation 

' Specifically, this proposal calls for a 15^/pouixl tax on suliitr emissions begmning 
January 1, 1976, for thoae areas that do not meet the primary standards and a lO^pound 
tax effective in those sueas that meet the pnmary stsmdards but not the secondary 
standards as set forth in the Clean Air Act (see Congnsmitat Rocord 1972) 

669 




jOUItMAL OP POUnOAl. ECONOMY 


670 

important, implying an inherent weakness of the trat&iaaal partial- 
equilibrium approach to welfare analysis. Finally, while the theoretical 
literature examines alternative pohey instruments, it does so within a 
comparative static framework, thereby overlooking the inherently 
dynamic nature of the pollution control problem. 

This papier seeks to demonstrate how the overly restrictive assumptions 
of compiarative static analysis can be remedied by an econometric model 
As well as providing a dynamic framework, an econometric modeling 
approach provides a method of estimating control costs Spieafically, 
simulations with an econometric model of the electric utihty industry 
(Gnffin 1974«) are presented to assess the likely welfare effects of the 
Pure Air Tax Act of 1972 and the dynamic propiertics of the Baumol and 
Oates approach Study of the electric utility industry is particularly 
appropriate becpiuse m 1970 piowcr plants contributed 54 percent of the 
nation’s su^l^^oxide emissions ^ 

Theoretical^ one should be able to recommend a tax on piollutant 
emissions which would minimize the sum of the damage costs, the 
piollutcrs’ control costs, and the regulatory agency’s administrative costs 
Unfortunately, our understanding of these cost relationships is generally 
not sufficient to allow such a calculation Considerable effort has been 
devoted to determining the damage function of sulfur dioxide and 
piarticulate ermssions through analysis of projierty-value data These 
studies generally obtain the expiectcd negative relation between property 
values and pollutton levels,^ but the esttmated coefficients are highly 
sensitive to the specification of the model There is also some question as 
to the shape of the marginal damage function Ridker and Henning 
(1967) found the function to be fiat, while Anderson and Crocker indicate 
that it declines over low and intermediate ranges of pollution (Crocker 
1971, p 70) While recognizing Its impxirtancc, this papier does not attempt 
to add to the damage function literature but, rather, draws upion existing 
studies when necessary Fortunately, the policy conclusions of this study 
hold for a wide variety of assumptions about damage functions 

In contrast to the attention given the estimation of damage costs, the 
problems of estimating control costs have been generally overlooked 
Perhapis because engineenng data are a necesstiry input, the estimation 
of control costs has tended to be pjcrceived as strictly an engineering 
question. For example, in the case of a sulfur tax, the estimation of control 
costs has been viewed merely as the cost of reducing a fuel of high-sulfur 
contoit to low-sulfur content after some specific picnod of transition * 
Yet economic theory would suggest the pxissibilities of factor input sub- 

^ For an inlbraiative review, see tJ.S. Congros (1970, pp. 172-88) 

’See Ridker (1%7), Ridkv and Henning (1%7), Andenon and Crocker (1970), 
Wieand (1970), and Crocker (1971). 

* For an example, see Hausgaard (1970). 



•OLnm TAXES 671 

sdtudoB and final product subsdtntion by the concomer, %vhicb effects 
would tend to reduce control outs. 

In addition to affecting the estiinadon of control costs, the preKnce of 
various lagged substitution effects points to weaknesses in the traditicHial 
comparative static analysis. If polludon charges arc to be adjusted 
annually to achieve some ambient air standard, knowledge of the dynamic 
as well as long-run multiphers associated with the vanoiu subsUtudon 
effects is required. From experiences with stabilizauon policy, it is con¬ 
ceivable that certain policy instruments may have destabilizmg effects 
because such instruments are sensidve to unknown and variable dynanuc 
phenomena Since the econometric model used in this study attempts to 
characterize both current and lagged-pienod substitution effects, it u 
appropriate to consider the dynamic imphcations of proposals to vary 
taxes m order to achieve stated emission levels 

Section II presents a simplified desenpuon of the basic structural 
relationship present in the econometric model as well as the assumpdons 
and estimation techmques underlying it In SecUon III, the probable 
welfare and distributional effects of the recently proposed sulfur tax are 
estimated for alternative technologicai assiunpuons and tax rates. Hie 
dynamic properties of the Baumol and Oates approach are considered in 
Section IV as taxes are adjusted annually to meet some specific sulfur 
emission constramt In Section V, factors ennitted from this study, such 
as regional and seasonal effects and second-best considerations, are 
discussed A brief summary is given in Secoon VI 


n. The Model 

The simulation results reported herem are based on an econometnc 
model of the electric uulity industry (Gnflin 1974a) and subsequently 
modified in Gnffin (19746) to consider the sulfur tax issue. The mterested 
reader is referred to the above two sources for a detailed description of the 
model The desenpbon here is intended to acquaint the reader with the 
basic charactensucs of the model and to illustrate the major subsUtution 
possibilibes, which will become important m Section IV. 

After modification for sulfur tax analysis m Gnffin (19746), the model 
consists of 33 equaUons of which 12 are stochastic. Of the 27 exogenous 
variables, the four major macroeconomic variables are Linked to the 
Wharton Annual and Irulustry Forecasting Model (Preston 1972), which 
provides simulation inputs to 1981 With the exception of one equation, 
which follows MacAvoy (1%9) m using a cross-section tune series panel, 
the sample consists of annual observauons for the period 1950-71 To 
distinguish short-nm and long-run elasticities, Almon polynomial dis¬ 
tributed lags arc estimated in several equations using maximization of the 
coefficient of determination as the basis for the choice of lag structure 



JOURNAL OF POLmCAL SOONOMT 


67a 

Two-Stage least squares is the principal estimation approach because of 
the numerous simultaneous relationships in the model. A 5 percent 1-test 
criterion is used for mclusion of variables that are first determined as 
relevant on theoretical grounds 

In applying a model to long-run simulations for 15 years beyond the 
sample period, a number of pitfalls are encountered First, the choice of 
values for exogenous variables is subject to considerable error, and un¬ 
fortunately the policy implications are not independent of this choice A 
second source of concern is that the estimated structure based on a re¬ 
stricted range of observations will prove deficient over an extended 
sample range As shown in Gnfhn (1974a), the simulation period utilizes 
ranges of exogenous vanabics well beyond those in the sample penod 
This is especially true of the exogenous values for the respective fossil fuel 
prices Unli|||;Bliately, litde can be done except to warn the reader of the 
piotential grastty of this problem and to label the results as rough 
approximaoons ’ 

The electricity model contains endogenous determination of the de¬ 
mands for electrical power, the choice of the primary fuel input sources 
to meet these generation requirements, and determination of electncity 
prices In the model, prices of fuel and capital inputs, which are exog¬ 
enous, play an important role Factor input prices not only affect the 
choice of fuel type and the efficiency of conversion to electricity, but 
indirectly affect the price of electncity and consequently, the demand for 
electricity Sulfur tax analysis is achieved by adjusting the various fossil 
fuel pnees for a given tax rate and exogenous set of technological control 
cost assumptions 

To acquaint the reader with the model and yet avoid the detail of 
Gnffin (1974fl, 1974/>), the model is condensed here from 33 equations to 
17 and presented in general functional form This condensation imphcs 
(1) that numerous less important endogenous variables appear as ex¬ 
planatory vanables but do not reappear as dependent variables, (2) that 
only major identities are included, and (3) that some behavioral equations 
appear in forms other than those in which they were estimated Exogenous 
vanables are indicated by an astensk following the vanable The model 
IS best charactenzed as consisUng of the following four blocks electncity 
demand, fuel source for generauon, electncity pnee determination, and 
welfare calculation for sulfur taxes. No attempt is made to endogenize 
the demand for factor inputs other than fuel Consequently, the simula¬ 
tions presume that capacity will grow at a rate sufficient to meet genera¬ 
tion requirements 


^ Chow tests for stability of coefBcienU over the sample period were performed, but 
they provide only weak evidence for observations over much different sample ranges 



sinuvR taxsi 


673 

The demaod ibr electriaty is separated into two disdnct functional 
forms. In equation (1), the residential demand for electricity, DR, 
depends on distributed lags, fi(L) and y{L),* involving per capita real 
disposable income, YD*IN*, and the average residential price of elec¬ 
tricity, PR, deflated by the GNP deflator, P*, the residential stock of air 
conditioners, KR, and population, N* 

[ YD* PR 1 

PiL)£^,y(L)^,KR,N*\ ( 1 ) 

The residual, consisting primarily of mdustnal and commercial de¬ 
mands for electncity, DI, is a function of the level of econonuc activity 
as measured by real GNP, X, the stock of commercial air conditiomng, 
KC,’’ and a distnbuted lag on the average mdustnal and commercial 
pnee of electnaty, PI, deflated by the GNP deflator, P* 

DI =f^X*,KC,p{L)^'^ (2) 

Given a determination of residential and mdustnal demands for 
electricity, we can determine total generaUon requirements to be met by 
electnc utilities, TG, from the following relauonship by specifymg net 
imports, Zl*, transmission losses, Z2, and own-use generaUon by 
industries, Z3* 


TG = DR + DI - ZI* -I- Z2 - Z3* (3) 

It also follows in equation (4) that total generaUon by electnc uuhues, 
TG, must be met from one of the following five sources nuclear genera¬ 
Uon, NG, natural-gas-fired generation, GG, oil-fired generaUon, OG; 
coal-fired generation, CG, and hydroelectnc generaUon, HG* 

TG = NG + GG + OG + CG + HG* (4) 

In equauon (5), nuclear generaUon, NG, depiends on the previous level 
of installed nuclear capaaty, which is reflected m the previous year’s 
nuclear generaUon, NG_,, and the increment m generaUon requirements 
to be met by nuclear and fossil fuels, ^(T’G — HG*), muluplied by the 


* fi(L) IS the. lag operator indicating a distributed lag on JT as follows* P{0)X -f- 
P{l)X_, + P{2)X_, P(it)X,~n Note that the subscripts below AT indicate the number 
of periods lagged 

’’ Both residential and commercial stocks of air condiUoning are endogenous and 
depend on lagged income and GNP vanabics, respecuvely Their direct inclusion in 
(1) and (2) allows the adiustment of the user cost of fowl capital for peak shaving in (6). 



674 JOURNAL Of rounoAL Roomniv 

share of new nuclear capacity relative to nuclear and fi>8^ capacity 
additions, ^CN^{^CN + AC) 

NG =. NG^i + A(7TC - HG*} 

{pi^\ 1 ( 3 ) 

ACJV + ^C{\PFF■HR).■, \PFK*J.yj ^ 

The share of new nuclear capacity is assumed to depend on the relative 
prices of nuclear fuel, PNF*, relative to fossil fuel, PFF HR * and the 
relative capital pnces of nuclear, PNK*, and fossil, PFK* The sub¬ 
scripts indicate that new capacity expansions depend on relative pnces 7 
years previously 

Before explaining the generauon rate by the vanous fossil fuels, it is 
instructi^tO determine the efficiency with which Btu inputs of fossil fuel 
arc con^mlll^ into kilowatt-hours of electricity This efficiency measure 
IS charactenzed by the heat rate, HR, which indicates the number of 
Btu’s of fuel input needed to generate a kilowatt-hour of output The 
heat rate, HR, depends on distributed lags involving the rate of new fossil 
capaaty additions, ACjC*, and the user cost of capital in fossil plants 
adjusted for peak-shaving requirements, UC, relative to the average price 
of fossil fuels, PFF 

™-/[«!.) ( 6 ) 

Given the exogenous value for hydroelectric generation, HG*, the rate 
of nuclear generation, NG, in (5), and the determination of total genera¬ 
tion, TG, in (3), we now need to determine oil, coal, and gas generation 
in (4) Natural-gas generation, GG, is assumed to depend on the conver¬ 
sion efficiency, HR, and on supply conditions of “low-pnority” gas for 
boiler fuel uses, AG* ■ 

GG=f{HR,AG*) (7) 

In equation (8), coal generation, CG, depends on the conversion effici¬ 
ency of fossil fuels, HR, the remaining market to be met by coal or fuel 
oil, OG -(- CG, a dummy sulfur pollution control variable, DU*, for the 
period 1969-71, and a distnbuted lag on the price differcnual between 
fuel oil and coal, PO* — PC* 

CG =f[HR, OG + CG, DU*, ^{L){PO* - PC*)] (8) 

Given HG*, NG, GG, CG, and TG, oil generation, OG, is treated as a 
mathematical residual and solved for in equation (4) The assumptioi 


* PFF 11 the average price per million Btu’l which must be luljiisted by the beat rati 
HR, a measure of conversion efficiency, in order to provide a fonil fuel coat in lulowat 
hours 



•Btjnm ’tAMMt 675 

that dl k the marginal fuel lource certauily it not inconsittent with recent 
experience. 

The electricity price block of the model relates the fuel choices and the 
exogenous values for the prices of the various fuels to the price of elec¬ 
tricity. Equation (9) expresses the simple identity that the average price 
of fossil fuel in Btu’s, PffF, which appears in (5) and (6), depends on the 
pnees of natural gas, PG*, fuel oil, PO*, and coal, PC*, weighted 
appropriately; 


PFF - PG* 


GG 


+ PC* 


GG + OG + CG 
CG 


+ PO* 


OG 


GG + OG + CG 


(9) 


GG + OG + CG 


Similarly, in equation (10), we calculate the average price of all fuel 
inputs on a kilowatt-hour output basis, PF, by weighting appropriately 
the price of nuclear fuel, PNF*, and the price of fossil fuel adjusted to an 
output basis, PFF• HR. 


PF = PNF* — -I- {PFF HR) - "tf - (10) 

TG ‘ TG 


The average residential price of electricity, PR, is treated as a markup 
over these unit fuel costs, PF, capital costs, AUC,^ and a vanable to 
reflect the declining rate schedule as per customer purchases increase, 
DRIN* 

PR =/(pF,AUC,^\ (11) 


Similarly, the average industnal pnee is treated as a markup over costs 
with a scale proxy, DljX* 






( 12 ) 


The welfare block of the model calculates the net welfare gains arising 
from a sulfur tax In equation (13), control costs arc calculated as a func¬ 
tion of the use of coal and oil mputs, OG, CG, and HR, and the exogenous 
control costs per 1 million Btu’s for oil and coal, CCO* and CCC* 


CC = f{CC0*, OG, CCC*, CG, HR) (13) 


As illustrated later, the welfare losses, WL, from a tax consist of the con¬ 
trol costs, CC, plus the loss m consumer surplus,/( ), which depends on 


* Bued on a wei{^ted average of past values of the user cost of capital for utilities as 
provided by die annual 'Wharton model (Preston 1972) 



676 JOURNAL OF POUnGAL KCOMOKV 

price and quantity diflcrentials from a base case assuming the air to be a 
free good 

= CC + f{PI - PI*, DI* - DI, PR - ^*, im* - DR, P*) 

(14) 

The GNP deflator P* is included to provide welfare calculauons in con¬ 
stant 1971 dollars 

In order to calculate the welfare gams from imposing a sulfur tax, it 
is first necessary to calculate the emissions of sulfur from a given tax 
Merely by using coal and oil generation, OG and CG, together with the 
heat rate, HR, the Btu inputs of the two sulfur-bearing fuels is determined 
Sulfur oxide emissions, SU, can then be calculated given the fuel inputs 
and the percentage sulfur in each fuel remaining after control processes, 
%SC* and %S0*, as follows 

- SU = /{%SC*, CG, %S0*, OG, HR) (15) 

Welfare ^ifiS WG, resulting from the improvement m air quality simply 
equals the incremental damage cost of a ton of sulfur, IDC*, multiplied 

by the reduction m sulfur emissions as a result of the tax, {SU* — SU) 

WG = IDC*{W* - SU) (16) 

Net welfare gams, NWG, is simply the difference between welfare gains, 
WG, and welfare losses, WL 

NWG = WG - WL (17) 

Sulfur tax simulations are easily performed because the prices of all 
fuels are exogenous to the model For a given tax rate, the price of fuel 
oil and coal are adjusted to reflect their respective control costs, CCO* 
and CCC*, as well as the tax paid on the residual sulfur content, which 
involves the tax rate applied to the piercentagc remaining sulfur, %SC 
and %S0 Note that the assumption of exogenous fuel prices, independent 
of the sulfur tax rate, is somewhat unrealistic, but to endogenize such 
elements is well beyond the scope of this paper 

The model includes many of the substituuon effects discussed m the 
mtroduction that depend on relative factor and product prices For 
example, in addition to the substitution of low-sulfur fuel for a high-sulfur 
fuel, a sulfur tax will affect the pnee differential between coal and fuel 
oil, causing an mterfuel substitution effect in (8) Similarly, a sulfur tax 
will raise the price of fossil fuel, PFF, and induce a substitution for 
nuclear generation in (5) The model also includes the possibility of sub¬ 
stitution between fuel inputs and capital inputs, as the heat rate in (6) 
depends on the relative prices of capital and fossil fuel The lagged price 
variables in both the residential and industrial demand equations imply 
that consumers will respond to the higher electricity pnees by substituting 



lULFUK TAXU 


677 


TABLE 1 

Selbcted Short- ard Lono-Run ELAmcnv Evimatis 
(Calculated about Sample Mbame) 


Variablci 


Short-Run Elasticity Long-Run Elasticity 


DR 1 PRIP 


-006 

-0.52 

D1 1 PUP . 


-004 

-051 

liCN + AC 

PNFIPFF HR 

0 

-1 40 

acjv 

ACJV -t- AC 

PSKjPFK 

0 

-086 

HR 1 uqPFi 

r 

003 

030 

CG\PC 


-0.02 

-0.20 

CC\PO 


0 02 

0 24 


Other goods for electncity-intcnsivc goods The treatment of substitution 
effects IS incomplete, since interfuel subsutudon among coal, fuel oil, 
nuclear, and the other power sources—natural gas and hydroelectncity— 
does not occur in the model This omission is not serious, however, as 
supply conditions are likely to restrict their substitutability 

While the model includes the pnmary subsdtuDon effects, it is important 
to realize that these substitudon effects do not occur instantaneously To 
illustrate this, table 1 reports the elasticiues of various variables m the 
model with respect to given factor or product price changes for 1 year 
(a short-run elasticity) and after all adjustments (a long-run elasticity) 
The estimates are calculated about sample means For example, the first 
entry shows the elasticity of residential elcctnaty demand, DR, with 
respect to the real price of elcctnaty, PRjP These esumates indicate that 
substanUal subsUtuUon effects are present in the long run but, m the 
short run, choices as to fuel consumption and elcctnaty demand are 
highly circumscnbed by existing capital configurauons 

m. The President’s Sulfur Tax 

In this section, we consider the welfare and distnbuuonal effects of the 
Pure Air Tax Act of 1972 as they relate to elcctnc utility generauon. The 
model IS simulated over the period 1972 and 1985 with varying tech¬ 
nological assumpuons about sulfur control costs and with alternative 
sulfur tax rates, then these results arc compared with a base case solution 
in which the air is treated as a fiee good The relative difference between 
the alternative simulations are probably much more informative fixim a 
policy perspective than the absolute welfare losses and gams, which are 
conditiona] on quite restrictive assumptions 

For a comparuoD of these erttmatei with other studies sudi as by Aadenon (1972) 
and Wilson (1971), see Gnf6n (1974ti). 








678 


JOUKNAI. OP POUnCAt BOCWOMy 


p 



Fio 1 —Welfarr eifecU of sulfur tax 



Esumates of welfare gams and losses, which are calculated in the manner 
suggested by Harberger (1971), are particularly dependent on determina¬ 
tion of the appropriate damage function Welfare gains are defined as 
those resulting from the reduction in sulfur emissions, while welfare 
losses are the social costs m terms of control costs, losses in consumer 
surplus, and administrative costs The net welfare gain is the difference 
between the two components This analysis presumes that sulfur control 
IS the least-cost alternative, for example, neither plants nor residents can 
relocate at lower costs The administrative costs of the tax are excluded 
from the t alculation both because of inadequate data and because of their 
insignificant effects on the pnncipai conclusions of this section 

Figure 1 depicts the welfare losses resulting from the tax assuming a 
flat long-run marginal cost function The LRAC denotes the long-run 
average cost function for electricity generation before the imposition of 
the tax After the imposibor of the tax, the new long-run average cost 
function becomes LRAC" as the costs of the taxes, area BCDE, and con¬ 
trol costs, area ABEF, arc internalized While the industry's average costs 
increase by AC, the incremental welfare loss consists of the control costs, 
ABEF, and the loss in consumer surplus, DFG These sources of welfare 
loss comprise the two shaded areas in figure 1. Of course, these calculations 
presume that the full amount of the control costs and taxes arc reflected 
in both residential and industrial electricity prices 

Estimates of welfare gains resulting from reduced sulfur emissions are 
piarticularly suspiect because of the uncertainty surrounding the damage 
funedon Following Ridker and Henning’s results, we assume that the 
marginal damage function is flat over the ranges considered in this 
analysis. Furthermore, a margmal damage cost of $580 p>er ton of sulfur 
in 1971 dollars is assumed, as it is the average social cost per ton of 





tOtrVR TAXES 679 

tulfiir emissons implicit in the Environmental Protection Agency’s 
estimate of $16 1 billion damage from all forms of air pollution.** Because 
of the dubious methodological procedures for this estimate, the reader 
may want to substitute his own estimate of the marginal damage cost 

Before a comparison of sulfur emission profiles for various tax rates can 
be made, the cost curves for removing sulfur from foci oil and coal must 
be specihed In this context, engineering cost data arc a necessary input 
The process technology for the desulfurization of fuel oil is well known, 
and the cost data are reliable Detailed private estimates** of the cost 
function were used which postulate that the marginal control cost function 
for fuel oil desulfurization is a step function with two steps,** 

The technology for economically reducing the sulfur content of coal 
IS not well developed By the process of wet cleaning, high-sulfur coals 
can be reduced to 1 percent sulfur, but this appears to have only limited 
applicability Flue-gas desulfurization expenments arc being conducted 
with a wet hmestone scrubbing process as well as several others, but 
neither the exact cost nor effectiveness is knovm For this reason, the 
model was simulated under a variety of assumptions, ntch as “not 
available’’ to costs ranging from 14 50/10* Btu {$3 50 to 27 10/ 

10* Btu ($6 50 per ton of coal) ** The most likely cktt Xt this pomt 
appears to be the simulation at 14 50/10* Btu ($3 50 per ton) In all 
simulations, the process is assumed to remove 87 percent of the sulfor 
While there is the possibility of opening new low-sulfur coal nuncs, the 
average sulfur content of coal before control is assumed to remain constant 
at Its 1964 rate (2 3 percent) * * 

'' Sec Council on EnvironmenUil Quality (1972, pp 103-7) and Barrett and Waddell 
(1973) 

'' Pnvatc estimates by a large petroleum refining firm arc believed superior to public 
estimates such as those in Hausgaard (1971) 

' ’ On the first step, the reaid desulfurization process can reduce the sulfur content oT a 
barrel of fuel oil from 2 5 to 0 9 percent sulfur at a coat of 420 /baiTcl or 8^/poiuid of 
sulfur removed To reduce the sulfur content of fuel oil from 0 9 to 0 3 percent, vacuum 
gas oil desuIfurizaUon must be utilized at a cost of 14 50 /pouiid of sulfiir removed 
The process technology to reduce sulfur content below 0 3 percent apparently does not 
exut except by blcnduig low-sulfur crude oils (< 0 3 percent), the supply of which u 
quite limited Thus, the marginal cost curve u aswmed to become vertical at the point 
where the sulfur content of fuel oil approaches 0 3 percent 

' * One of the more recent estimates of the ftue-gas desulfurization costs u given in the 
National Economic Research Associates discustioo of electric power in Council on 
Environmental Quality (1972a) According to my rough calculations based on their data, 
the cost of sulfiir removal they ore impliatly asMiming u S2 30/ton of coal This u based 
on taking 7 percent of average revenue per kilowatt-hour (1 70/kilowatt-hour), which u 
1 2 miHs/kilowatt-hour Of this 1.2 miUs/kilowatt-bour, fixed costs for cooling towers and 
flue-gas processes constitute 0 75 nulli/kilowaU-bour, the flue-gas component ttf fixed 
costs being 0 6 nulls Combined fixed and variable cau are 0.95 nuUs/kilowxtt-hour 
(0.6 -f- 0 35 mills per kilowatt-hour), which implies a fuel cost of 9 5)^10* Btu, or SZJOl 
ton of coal with standard Btu content This estimate is believed to be highly ofMmuMic 

' ’ Note that no attenqit was made to difletentiate cools by sulfur content The U.S. 
average sulfur content based on U S Ihueau of Mines (1966, p 39) was used 



68o 


JOURNAL m POLITIQAfL. BCXJNOUV 


A bcLK case solution to the model was calculated for the period 1972~^5 
under the assumption that pollution controls or taxes would not be 
utihzed This simulation provides a benchmark by which to evaluate the 
effects of a sulfur tax The exogenous assumpuons and the annual results 
for key endogenous variables in this simulabon are given in Griffin 
(19746) Of particular interest is the estimated annual emissions of sulfur 
in millions of tons based on the assumption of no sulfur taxes or controls 
Despite the rapid growth in nuclear capacity, sulfur emissions increase 
markedly as generation by fossil-fired plants continues to expand 

Table 2 gives the prinapal results of nine alternative simulations which 
vary in their assumptions about the tax rate and the cost of flue-gas 
desulfunzation With the exception of the tax adjustment and control cost 
variants, the same exogenous variables are utilized as in the base case 
simulation 

In It is assumed that the tax becomes operative in 1976 In 

additkiri 01 a 15)f/pound sulfur tax, alternative calculations are made 
for tax rates of 100/ and 200/pound ** The cost and sulfur content (after 
control) for fuel oil desulfurization are similar except in cases 1 and 2, in 
which, under the 100/pound tax, only the use of the rcsid desulfunzation 
process (which costs 80/pound of sulfur removed) is economic 

Table 2 reveals that in all cases the cumulative net welfare gains dis¬ 
counted back to 1976 at 7 percent annually are not only positive but of 
substantial magnitude The magnitude of these numbers is in large part 
due to the combination of (1) a large estimate of damages ($580 per ton) 
relative to the tax ($200-$400 per ton), (2) the sustained growth of sulfur 
oxide emissions in the base case solution, and (3) the effectiveness of the 
sulfur removal processes The differences between the various cases 
depend primarily on the availability and cost of flue-gas desulfunzation 
Net welfare gains are the lowest in cases 1, 3, and 6, in which it is assumed 
that an economically feasible flue-gas desulfunzation process is unavail¬ 
able In the cases in which this process is available, the discounted welfare 
gains over the 10-year penod range from $31 to $42 billion 

More important than the absolute size of these net welfare gams is the 
implication that even if the actual damage rate is one-half the assumed 
rate of $580 per ton, the net welfare gams in all cases would be piositivc 
In fact, if the reader wishes to substitute his own esUmate of the incre¬ 
mental dcimage cost for the EPA esumatc, these can be calculated simply 


**Tbe 100/pound calculauon wu made became under the Nixon proposal, some 
areas that meet the primary standards would qualify for the lower rate, so that on the 
average the effective tax rate will be between 100 and 150/pound The 200/pound tax 
calculation is made to conform roughly to the Praxmire proposal, which would begin 
with a 50/pound tax in 1972 and an increment of 50/pound each year until it reaches 
a rate of 200/pound in 1975 The Praxmire proposal makes no provuion for differential 
rates depending on the air quality of different geographic areas 



a 


J 


8 


85®=®' 


CO a> on 

sg®=® 


*o on O) on 

gs®=® 


^on^on 

O^CM — O 
- 


00 on 0) on 
— 8 ® — ® 


un cn 0 J on 
^ o — o 


^ an Oi on 
irt ^ CM — o 

-j, ^ 


‘<nen^cr> 
O O o 


w Wl 

O ^ CM <0 o 


8SS i^aim 

S5S> ””8£ 


*0*^lO<O 


M- 

I 



— oiS Stem '«'(C>r>oo> 
S55 “8S •ot^OTiojo 
I 

8eoi^ ^eoeomm 

S £ 3 ! “ 8 S '*■•"778 


on 

♦ CMun o 

^CMCM CMCMI^ 
^ —CM 


sss s« 


i CM on 
>COCM —COff) 




^ ^ CD rv CO 
CM — © CM© (A 


R5g =r»o 

525! 


© tD ^ r** 

Mn ® t>. © M* © 

® o on on ^ 


o ^ © tn f*» 

“=2sa 


an on © ® un 
»o c-o 40 »o 

— CM 

I 


OtOCMC^ — 
M* lO © ^ © 


-^♦CM©© 

©©lOCMg^ 


ento©or^ 


— on lO 
^ un © on © 


sm 

iiSj”i” 



68i 


683 


JOURNAL OF POUnOAL BOONOHV 


by adjusting wdfare gains propoititHtRtely.' ’’ The new welfare gains can 
then be subtracted from welfare losses, which are unaffected, to obtain 
net welfare gains. 

In cases involving available f]ue*gas desulfurization and similar tech¬ 
nology, the net welfare gains are only slightly larger for the cases with 
higher tax rates (e g,, cases 2, 4, and 7 and cases 5 and 8) Substitution 
possibilities other than intrafuel subsUtution have little effect on sulfur 
levels in this situation, as their effects arc operative on fuels with sulfur 
contents reduced by 87 percent For example, the substitution of a unit 
of nuclear generation for a unit of coal generation with only insignihcant 
sulfur content has little effect on sulfur emissions Still another factor 
contnbuting to this result is the assumption of homogeneous sulfur content 
of coal in the model In practice, higher tax rates would force the installa¬ 
tion of flue-gas desulfunzauon for plants burning lower-sulfur coals, so 
that som^gj^lht higher welfare gains would be expected from the higher 
tax rates, ce/tih panbus 

In terms of the physical reduction of sulfur emissions, a comparison of 
the average annual sulfur emissions with the base case simulation indi¬ 
cates marked reductions For example, if an economic flue-gas desulfuriza¬ 
tion process is available, the sulfur emission level from electric utilities in 
1985 may be less than one-fifth of today’s level 

To demonstrate the point raised earlier with rcspiect to the possibility 
of subsutution between fuels, between capital and fuel, and between other 
goods and electriaty-intensive goods, the percentage of the sulfur reduc¬ 
tion attributable solely to the subsutution of low-sulfur for high-sulfur 
fuel of the same type is reported ' ® The differences between total sulfur 
reduction and that due to tntrafuel sulfur substitution arc attribuiable to 
the other forms of subsutution frequently overlooked in analyses o( this 
typic ” The importance of these three substitution effects vanes signifi¬ 
cantly depending on the technology In cases 1,3, and 6, which assume no 
economic flue-gas desulfunzation process, these three substitution effects 
account for about two-thirds of the change In these cases, fuel oil and 
nuclear fuel have a decided competitive advantage over coal and induce 
a substantial intcrfucl subsutution effect This is apparent from the 
cumuladve change in coal, fuel oil, and nuclear use given in the last three 


*For example, if the incremenul damage cost i« set at S290/ton, the welfare gains 
ID table 2 are reduced by SO percent. With control coats unchanged, the diicountcd net 
welfiuT gams in tnUions of dollar* equal the following for the nine cases respectively 
3 44, 13 13, 4 57, 13 14, 7.38, 5 19, 13.24, 7 51, and 7.60 

'* These were based on the coal and 6iel oil purchases in each simulation multiplied 
by the suljrfiur differential as a result of control Calculations of this sort are by necessity 
somevdiat arbitrary, as the sum of the partial effecu do not necewinly sum to the total, 
due to interaction eftecu. 

'* For example, see Hausgaard (1971). For an interesting study incorporating part 
cd' these substitution effects, sec Chapman (1973) 



tmrvm tammb 


683 

rows. In the renuuning cases, the three substitution effects are less un- 
portant, as the reduction in the level of emissions is primarily attributable 
to intrafuel substitution. This seems plausible, because these substitution 
effects now have an impact on fuels with a much lower percentage sulfur 
composition, so that their effects are mitigated 
While the net welfare gain may be the critical variable to the econ¬ 
omist, the politician is likely to be more concerned with the distributional 
effects of a tax Assuming these costs are fully passed forward to residential 
and industrial users, estimates of the mcrease m the residential price of 
electricity range from 3 7 to 8 0 percent For industrial use, price increases 
vary from 5 1 to 11 4 percent. Despite significant distributional effects, 
the area representing consumer surplus, DFG, is generally small; at most 
it constitutes S0.4 billion of the $2 06 biUion welfare loss m case 6 

As would be expected, sulfur taxes have a pronounced effect on fuel 
oil, coal, and nuclear sales as revealed by compiarmg cumulative sales with 
the base case soluuon. In all cases, nuclear generaUon will increase, but 
especially where taxes are high and flue-gas desulfunzauon is expensive. 
The petroleum industry stands to gain a marked competiuve advantage 
over coal only in the cases of high tax rates and expensive flue-gas desul¬ 
furization (cases 3, 6, and 9) In all cases, coal sales will deteriorate, but 
the coal industry is well aware, no doubt, of the long-sun unfeasibihty of 
the present situation, which is implicit in the base case soluuon. Thus, 
from the coal mdustry’s viewpoint, the cnucal vauiable is not so much the 
tax rate but the development of a cheap flue-gas desulfunzauon process 


rV. Appropriate Policy Instruments in a Dynamic Framework 

The modehng approach emphasizmg dynamic subsutuuon possibilities 
also gives insights into the sclecuon and effectiveness of air polluuon con¬ 
trol instruments In the Pigouvian tradition, economists frequently prefer 
a tax as the opumal control instrument However, sole reliance on a tax 
requires a detailed knowledge of both the damage and the control cost 
functions over ume. An altemauve approach suggested by Baumol and 
Oates IS to utihzc both a tax and an ambient air standard This approach 
may produce subopumal resource allocauon, but it has an administrative 
cost advantage of requiring no knowledge of the damage or control cost 
funcUon Baumol and Oates (1971, p. 45) opumisucally conclude* 
“Experience would soon perimt the authonues to estimate the tax levels 
appropriate for the achievement of a target reducuon m poUuuon ” 
Excluding the trivial case of an infinite tax, u it a simple matter to vary 
taxes in such a manner? Smee the control cost function depends on lagged 
as well as current tax rates, would the agency be required to make 
substantial rate adjustments over time? 



684 


JOURMAI. OF FOLmOM. SQONOMy 


TABLE 3 

"PBitFBCT Kmowlbooe” Sultur Tax Aojinmoum 
(Doixaki PB<t Pound) 



Year 

ZSG 

(0%/Year) 

1.0%/Year 

2.0%/Year 

1976 


0 15 

0 15 

0 15 

1977 


16.70 

11 11 

6.03 

1978 


11.08 

804 

5 72 

1979 


386 

0.15 

0 15 

1980 


8.01 

300 

015 

1981 


II 27 

6 74 

0 15 

1982 


12 47 

6 70 

0 15 

1983 


14 34 

6 77 

0 15 

1984 


11 73 

0 15 

0 15 

1985 


16 69 

2 59 

0 15 

1986 


21 96 

1580 

0 15 

1987 


1386 

0 15 

0 15 

1988 


13 76 

0 15 

0 15 

1989 


21 03 

2 61 

0 15 

1990 


23 22 

8 32 

0 15 


Suppose, after imtiating a 15^/pound sulfur tax, that in 1976 society 
selects a standard of a zero sulfur emission growth, ZSO, or some variant 
of It, and chooses to use annual tax variations to achieve this air standard 
The model is then solved for the necessary tax adjustments assuming the 
regulatory agency has “perfect knowledge” of the control cost funciion 
This simulation assumes exogenous assumptions similar to those used in 
table 2 along with the initial condition of a 15^/fiound lax in 1976, and 
the technology implicit in case 4 of table 2. The tax rate is endogcni/ed 
Table 3 illustrates the necessary variations in taxes for ZSG, for a 1 
percent/year emission growth, and for a 2 percent/ycar sulfur emission 
growth The results indicate that substantial year-to-year modifica¬ 
tions would have to be made in the tetx rate and that the more ambitious 
the standard, the greater the tax adjustments necessary The relatively 
large magnitude of the tax adjustments required should not be taken too 
seriously, because the standards selected arc quite ambitious by any 
measure ** Furthermore, at high tax rates, one would expect greater 
substitution possibilities than those implicit in a model estimated over a 
much smaller range of price vanations New process technology in the 
form of sulfur removal equipment (which is exogenous to the model) also 
could be expected to develop in response to the above tax rates, thereby 
reducing the tax adjustments necessary 

The sample period iimulatioiu were extended to 1990 to illustrate that the cyclical 
phenomena appear to continue even over the longer period. The exogenous values were 
simply extrapolated for the penod 1985-90 

The imcial target m 1976 ii set at I 63 million tons of lulfur emiaioni compared 
with an eiUmated 13 1 million tons of lulfur in the base case solution without air pollution 
controls in 1976 








•UUVK TAXES 


685 

The most interesting result is that the necessary tax adjustments exhibit 
a (ycliaU behavior which is not apparent a priori The cyclical variations 
arise from the fact that lagged electricity prices and lagged relative fuel 
prices in the model have a greater impact on current sulfur emissums than 
the current period’s prices, which mcludc the effect of the current tax 
rate. Consequently, the problem with using tax rates as a short-run policy 
instrument is much more complex than the case in which there are no 
long-run effects, and tax rates can be adjusted conUnuously until equil¬ 
ibrium is reached The problem becomes one of dynamic optimization 
rather than static optimization 

Several more general impheauons emerge from the preceding exerdsc 
First, the traditional static approach to pollution quesuons by economic 
theorists is likely to ignore the inherently dynamic nature of the problem 
This result also has important policy implicaUons The sale of transferable 
pollution licenses*^ is likely to be a more efficient (less expensive m terms 
of information requirements) approach to obtaimng target quantities of 
pollution control than the Baumol and Oates approach Presumably, 
market forces would produce prices more similar to those m table 3 than 
those set by an agency with imperfect knowledge as to the lagged subsbtu- 
Uon effects Another interpretauon of these results is to raise the question 
of whether the benefits of obtaining short-run poUuUon targets exceed the 
costs Even if the Dales proposal generated ticket prices approximating 
the tax rates in table 3, such wide cyclical variations in the tax rates may 
periodically generate substantial excess capacity of pollution control 
equipment, thereby creaung offsetung welfare losses The problems with 
using taxes to achieve year-to-year changes in sulfur emissions are in 
many respects analogous to those faced in stabilization policy However, 
in the case of stabilization piohcy, there are significant social benefits to 
correct policy, while the welfare benefits of meeting some arbitrary 
|x>llution standard are unclear, if possible While the use of taxes to 
achieve year-to-year adjustments in poUuuon levels seems dubious, it 
does not follow that tax pohcies desigptcd to hold taxes at a constant rate 
for long periods are also suspect To the contrary, the results of table 2 
mdicate that appropriate tax rates held at sustained levels serve as a very 
efficient policy instrument 


V. Other Factors 

The preceding aggregate analysis provides useful general guidehnes, 
nevertheless, there are several issues which should be considered in for¬ 
mulating an optimal policy By aggregating over space and over the 

For a further discuauon by an early advocate of thu approach, tee Dalea (1968, 
pp. 92-97). 



pe«M of 1 your, the modd nwy tHadt poteAtbdiy^^q^Hl^ 

«iMi seasonal t^in. In pracdce, a tax tmtt dkc^ ilEftdid fdbUtiiiy on 
regioiMl factors rather than on tone nationa] rate ^^ilicsUe to all 
rt^ons. For exan^ile, the relaticmsiup between <|uantity of auHur onit^ 
suns and air quality measures may vary sulntantially across regions for 
climatological reasons. The mtuginal damage function depends on 
popolanon densities, which vary with geographic region. Regional 
difl^srences in per capita income arc likely to affect the damage function, 
as pollution abatement is a superior good Also, tastes probably difidr 
between regions, depending in large part on individuals’ economic inter* 
dqxmdatoe with the local polluting industry. Thus, there are a number 
of variables, such as climatological factors, population densities, income 
levels, and tastes, which arc likely to produce get^aphically distinct 
damage {unctipfis. Despite these gcc^aphical differences, present pro¬ 
posals apojflr^xgivc at most only modest recognition of these problems 
They call for^nher a uniform tax rate (e.g , Pmxmire proposal) or a 
differenUal tax rate (c g , Nixon proposal), which would allow for a 
10^/pound tax in regions where the primary air quality standard is met 
and a zero tax tn regions where the secondary air quality standard is met 
In addition, if the marginal damage function is not horizontal, some con¬ 
sideration should be given to differential tax rates, depending on the 
seasonal variation in electricity output With the increasing peak load 
elecintity requirements in the summer, tax rates may have to be raised 
to reflect higher summer damage costs 

Following Harbcrger (1971), one might attempt to approximate 
second-best effects by adjusting the net welfare gams in table 2 to include 
the welfare gains or losses assoaated with distortions m industries related 
to electricity Wilson (1971) and Anderson (1972) show a small but 
significant cross-elasticity of demand between natural gas and electricity 
pointing to one source of distortion The administered pncing system for 
natural gas appicars to have resulted in a situation in which at current 
consumption levels the long-run supply pnee exceeds the administered 
price A sulfur tax will increase the price of electricity, thereby shifting 
outward the demand schedule for natural gas, raismg production in the 
short run, and exacerbating the welfare loss Likewise, if the environmen¬ 
tal costs of nuclear generaUon arc not internalized, sulfur taxes will in¬ 
crease nuclear generaUon and produce some additional welfare loss This 
paper will not attempt to estimate these additional losses, because they 
are not likely to change the pnndpal conclusion of table 2—that the 
imposiuon of sulfur taxes should result in a net welfare gam 

Nevertheless, the additional welfare loss docs have unphcations for the 
tax rate chosen A comparison of net welfare gams between tax rates of 
15yf and 20^/pound for similar technology (case 3 with 6, case 4 and 7, 
case 5 with 8) indicates only a slight preference for the 20^/pound tax. 









687 


■MMH If || lU-^-T-Tsr-'*” e • J ‘ 

,Ayty;^i[^beittadelttlB^ IS^^poutid tate due to the leawd- 
tfUne^ dcMiffiirinition were certain to be uvail- 
Mt&t$iMv^motooiiotUM,^»\0^pomdtax {aati2) would 
be preferable tor siinilar reaaoos. 


VL Sonunary 

The exteniioo of econometric modeling to problems of environmental 
policy offers a potentially fruitful and relatively unexplored area of 
research The dynamic framework, which includes various substitution 
possibilities, appears to offer some useful pc^cy insights despite a number 
of restncuve assumptions Over a wide range of marginal damage rates 
and technological assumpuons, the Pure Air Tax Act of 1972 should result 
m a net welfare gain However, it does not follow that the proposed tax 
rate is optimal, as the preceding section raises a number of unanswered 
quesUons The jjcrvasive nature of the dynamic subsUtuUon effects im¬ 
plicit in the model suggests skcpucism with proposals using tax adjust¬ 
ments to maintain some given atr quahty standard on a year-to-year 
basis Much more senous thought must be given the issue of selecting the 
appropriate policy instruments in a dynamic context 


References 

Anderson, Kent P “Residential Demand for Electricity Econennetne EsUmates 
for California and the United States *’ RAND Report R-905-NSF Santa 
Monica, Calif RAND Corp .January 1972 
Anderson, R. J , Jr , and Crocker, T D “Ar PolluUon and Housing Some 
Findings ” Paper no 264 Lafayette, Ind Inst Res Behavioral Emn. and 
Management Sci, Purdue Univ, January 1970 
Ayres, R U , and Kneesc, A V “Production, Consumpuon, and ExlcmaliUes.” 
AER 59 (June 1969) 283-97 

Barrett, L B, and Waddell, T E “Cost of Ar PolluUon Damage A Status 
Report ” Washington Environmental Protection Agency, February 1973. 
Baumol, W. J , and Oates, W. E “The Use of Standards anid fticei lor Protection 
of the Environment.’’ Swetbsh J Earn. 73 (March 1971) 42-54 
Chapman, D. “Intemahzing and Externality A Sulphur Emission Tax and the 
Electric Utility Industry ” Paper presented at the February 1973 M.I T 
Conference on Energy 

Cotigresswaal Record. Vol 118 Proxnure Bill (S 3057), January 24, 1972, S 276; 

Nixon Proposal, February 8, 1972, H 841, 92d Cong , 2d sess., 1972 
Council on Environmental Quality The Ecommu Impoct of PoUutton Control 
Washmgton Environmental Protection Agency, Mar^ 1972. (a) 

-. ErmronmenUtl QuaUtp, 1971 Washington- Envuonincntal Protccuon 

Agency, March 1972 (4) 

Crocker, T. D. “Urban Air Polluuon Damage Funcuons Theory and Measure¬ 
ments.’’ Final Report to the Nauonal Air Pollution Control Admuuttrauon, 
CPA 22-69-52 Madison, Wis . 1971 



688 


JOURNAL OF POUnCAL BCONOklY 


Dales, J H Pollution, Property, and Pncts. Toronto: Univ. Toronto, 1969. 

Griffin, J. M “The Effects of Higher Prices on Electricity Consumption ’* Bell J. 
(1974), in press (a) 

- “Energy Use and Environmental Q.uahty A Look at Prospects and 

Policy Alternatives " In Studies in Taxes and Energy, edited by Gerard Brannon 
Boston* Ballinger, 1974, in press (6) 

Harberger, Arnold C. “TTie Basic Postulates for Applied Welfare Economics 
An Interpretive Essay.” J Econ Isteraiure 9 (September 1971) 785-97 

Hausgaard, O “Proposed Tax on Sulphur Content of Fossil Fuels ” Public 
UitlUtes ForimghUy, September 16, 1971, pp 27-32 

MacAvoy, Paul W Economic Strategy for Developing Nuclear Breeder Reactors Cam¬ 
bridge, Mass M I T., 1969 

Preston, Ross The Wharton Annual and Industry Forecasting Model Philadelphia 
Univ Pennsylvania, 1972 

Ridker, R G Economic Cost of Atr Pollution New York Praeger, 1967 

Ridker, R. G , and Henning, J A “The Determinants of Residential Property 
ValiM^^ltfi Spectffe Reference to Air Pollution ” Rev Econ and Statis 49 
(MayT^g|i 246-57 

U S., Bureau of Mines Sulphur Content of Umted States Coals Information Circular 
8312 Washington Bur Mines, 1966. 

U S , Congress, Senate, Subcommtttec of the Committee on Public Works 
Hearings on Air Pollution, 1970, pt 1, 91st Cong , 2d sess , 1970 

Wieand, K F “Property Values and the Demand for Clear Air Cross-Section 
Study for St Louts ” Ph D dissertation, Washington Univ , 1970 

Wilson, John W “Residential Demand for Electricity ” Q Rev Econ and Bus 
2 (Spring 1971) 7-22 



Searching for the Lowest Price When the 
Distribution of Prices Is Unknown 


Michael Rothschild 

Pmutton UnwersUy 


Search theory, which purports to explain how individuals behave when 
they have imperfect or incomplete market information, has received 
much attention recently Economists have denved a number of results 
characterizing the effects of various changes on optimal-search behavior 
Almost without exception, these results depend on the untenable assumpi- 
tion that searchers know the probability distribution from which they 
are searching This paper studies the efTcci of assuming instead that 
searchers learn about the probability distribution while they search from 
It Not invariably, but in many instances, the qualitative properties of 
optimal-search strategics—-and thus the behavior of those who follow 
them—are the same as in the simpler case when the distribution is 
assumed known 


I. Introduction 

Slones about search occupy a central posiuon in the “new micro- 
economics of inflation and employment ” This accounts for much of the 
recent work by economists* analyzing variants of the following problem 
A man is considering purchase of some good which is sold at different 


A preliminary version of this paper ivas presented at the NBER conference on 
stochastic control at the University of Chicago, June 1973 I benefited from ditcussions 
with David Blackwell, Gregory Chow, Stanley Fischer, EmmeU Keeler, Robert E laioa, 
Dsuiiel McFadden, Roy Radner, and Yoram Weiss This research was supported by the 
Industrial Relations Section of Princeton Univenity, the Mathematical Social Science 
Board, and the National Science Foundation 

■ See Stigicr (1961, 1962), McCall (1965, 1970), Nelson (1970), Gastwirth (1971), 
Gronau (1971), Gould (1972), Salt^ (1973). Telser (1973), Whipple (1973), and 
Kohn and Shavell (1974) 


689 




JOURNAL OF POLITICAL ECONOMY 


690 

stores at diilerent prices He can elicit price quotations from the vanous 
sellers by paying a fee ^ What search strategy will the man follow? 

Economists are interested in the rules which searchers follow, because 
these rules determine the demand functions that sellers m such markets 
face and thus, in part, the nature of the markets themselves Although 
the only way to settle the question of what rules searchers ibllow is by 
observation, very little empirical work has been done on this problem.^ 
Instead, researchers have proposed search rules that reasonable con¬ 
sumers might follow and have examined their properties Stigler, who is 
largely responsible for introducing this topic into economic theory, sug¬ 
gested that the individual should visit n stores, obtain price quotations 
from each one, and then buy from the lowest-pnce store The expected 
price paid will be 

A/.-I'll 

wiiere F{p) is the distribution of prices If such a procedure is followed, 
the only decision variable is n, the number of stores visited Siigler pointed 
out that the familiar marginal calculations of microeconomic theory will 
suffice to determine n Clearly, Af, is a decreasing function of n, while the 
expected gain from searching, 

6; = Af,., - A/„ = j" - /='{/')]"■‘^(^) «>. 

also decreases with n Therefore, n should be chosen so that 

G„ > c > I 

This rule has several interesting properties 

1 If all Its potential customers follow this rule, then a firm faces a well- 
behaved demand function, expected sales are a nonincrcasing function of 
the price it charges 

2 Customers’ search behavior is a function of the cost of search c and 
the distribution of prices F{ ) Thus, it is jxissible to do comparative 

^ Thu fee, the coat of search, is generally interpreted as the cost of visiting a store and 
obtaining a price quotation li could also include the cost of doing without the item in 
question while search continues, in which case the fact that search takes time would 
have to be formally incorporated in the model (see n 10 below) Other rather trivial, 
from the formal point of view, generalizations are possible Price and cost may be 
measured in utility rather than money However, the utility function must be linear, so 
this M a small generalization The same framework may be used to analyze search for 
jobs when wages (or the utibty of jobs) arc random and unknown I shall largely ignore 
these matters of interpretation 

’ A cynic might suspect that the lack of enqtincal work reflects a lack of interest in 
the real world on the part of those concerned with search rules A glance at the authors 
in n 1 above suggests that thu is not the only reason It seems to me—and my interest 
m such matters is admittedly casual—that the problem of determining cfaaractenitics 
of search rules empirically is a very difficult one 



aiMU3HU«» M»l UOWBtT FUCS 6gi 

ststki lo ai to exainine the coiuequencef changes in F{ ) and c The 
most impmtant results of these exercises arc: 

3. If costa of search increase, the amount of search decreases 

4. As prices become more dispersed, expected total costs decrease. This 
follows from the fact that, for all a, Af, decreases as the distribution be> 
comes more variable.* Other things equal, customers prefer to draw from 
riskier distributions This preference for risk or uncertainty on the part of 
those normally deemed nsk averse can explam apparently odd or per¬ 
verse phenomena. For an application to the theory of migration, see 
David (1973) 

Interestmg as these results are, they depend on people following a 
particular search rule—and not a particularly attracUve one at that 
Although hxcd-samplc-size rules have a certain intmuve appeal, they arc 
not the best search procedures and are in some circumstances simply silly 
A person who rigidly follows a fixed-sample-size rule will, even if he gets 
a price quotation less than the cost of search, keep on samphng until his 
quota of pnee quotauons is fulfilled It is thus comforung to know that 
the opUmal-scarch rule has all the attracUve properues of the fixed- 
sample-sizc rule The optimal rule is sequenual (after receiving each 
price quotaUon, the searcher decides whether to conUnue searching or to 
accept the quoted pnee) and is characterized by a reservation price; there 
IS a price such that the searcher will accept any price less than or equal 
to F while he will reject a price higher than F Once agam, familiar 
economic reasoning suffices to detenmne F If the lowest pnee the cus- 

* This IS generally demonstrated by calculating < 0 for particular para¬ 

metric fismilies of distnbuuons However, the result is much more general Let F[p, 1} 
be a family of dtstnbuuon funcUoiu with support in [0, B] indexed by I Then, as 
Diamond and Stiglitz (1974) have shown, increases in I correspond to increases m nsk 
in the sense of Rothschild and Stiglitz (1970) if 



|'iff'(/>,0 = 1, 

(•) 


r F,{p,t)df,3:0, OsKsfl, 

(li) 

and 

Jo 

j’F,{p.t)^ = 0 

(lu) 

If A(p) IS any increasing function, then 





Note that 

= C" [I - F(p, t)]’4p 


and 

Jo 

“ r F.ip. t)4> £ 0, 


smee -a[l 

— F(p, i)] u increasing in p. 




69a JOURNAL or POLITICAL SO(»KniY 

tomer has received to date is S, then the expected gain from searching 
once more is 

^(5) = r (5 - p) dF{p) = r* F(p) dp. 

Jo Jo 

The opumal rule is to search whenever the cost of an additional search is 
less than the expected gam from that search so that R must satisfy 

giJi) = f (1) 

This reservation-pnce rule has the four properties listed above That 
properties 1 and 2 hold is obvious Increasing c increases the reservation 
pnee and thus lowers the intensity of search so that 3 holds. Kohn and 
Shavell (1974) have shown that increasing dispersion lowers the reserva¬ 
tion pric^Aitd thus total expected costs* so that 4 holds for this rule as 
well Th^tj||[ this by showing that if F{p, t) is, as in note 4 above, a 
family of price distributions which becomes more dispersed as t increases, 
then 

g{S, 0 = J Pipy 0 <^P 

IS an increasing function t Therefore, if 

c = g{R. 0 = n 

with t' > t, then R' < R Thus, the reservation price decreases as price 
variability increases, or 

5 Increased pnee dispersion increases the intensity of search 1 his 
last result—which does not hold in general for hxcd-samplc-size rules*— 
IS a kind of stability property which could be used in a complete model 
(one which explained pnee dismbutioris endogenously) to show the 
existence of an equilibrium distribution of prices It should turn out in 
most sensible models that increased search activity will decrease price 
dispersion ’ 

These results depend on the assumption that the searcher behaves as if 
he knows the distribution of prices In any economic context, this is a 
very bad assumption Little is known about the nature of pnee distribu¬ 
tions, and It seems absurd to suppose that consumers know them with any 

* Total expected costs to a searcher following a reservation-pnce rule are just equal 
to the reservation price This follows from the fact that expected costs do not change as 
the searcher continue* to sample The logic of rule (1) might bt stated as Accept any 
price less than the expected costs of continuing to search, otherwise continue 

‘ If, in the natation of n 4, C,(<) = Af,_,(l) — then it is not true that C„(<) 

increases with t whenever F{p, 1) satisfies ( 1 ), (ii), and ( 111 ) I suspect that most increases 
in pnee dispersion which ^ hpt lead to piling up of probability mass at the extreme 
points of the dutnbution ss4ll«icrcase <7,(1), but 1 have not yet been able to formulate 
this preasety 

For a discussion of the need for such a complete model, see Rothschild (1973) 



sBAMcasNovcm towMnmm 


693 

degMe of accuracy.* Since the major reason for believing that searchers 
follow optimal sequential rules is that they art optimal, it is important 
that their cost-minimizing properties not depend crucially on their being 
based on correct knowledge of the price distribution This is unfortunately 
not true Gastwirth (1971) explored the robustness of optimal-reservation- 
price rules. He found that modest specification errors could lead to dram¬ 
atic increases in the expected number of searches and in the expected 
cost of buying For example, someone who chose a reservation pnee on 
the assumpuon that pnees were distnbuted uniformly on the unit interval 
{F{*) = x) when they really were distnbuted according to a nght- 
triangular distnbution on the same interval (F^x) — x^) would on the 
average incur roughly twice the total costs and search five times as much 
as he would if he were correctly informed If results 1-5 are to be salvaged, 
the problem of what the searcher should do if he does not know the pnee 
distnbution must be attacked This can be done in two ways by cxplonng 
the properties of reasonable rules of thumb, or by characterizing optimal 
rules Telscr (1973) took the first approach He calculated (using Monte 
Carlo techniques) the expected costs of various search rules—which had 
properties 1-4 —against several differently shaped pnee distnbutions 
and compared these costs with those which would ensue if the searcher 
followed the naive rule of taking the first pnee offered him 

This paper takes the second tack Opumal-search rules from unknown 
distnbutions are denved and charactenzed ’ The results of this exercise 
are as follows In Section II the problem of a man who knows pnees 
belong to some finite set, but does not know how they are distnbuted, is 
formally desenbed Section III discusses an important example—the case 
where the pnor distnbution is a Ehnchlet In the next section, dynamic 
programming is used to denve the optimal strategy It is possible to 
parametenze the problem so that the opumal-valuation functions are 
continuous—a fact which is exploited in Section V, where it is shown m 
Theorem I that if a person follows the optimal strategy, search terminates 
after a finite number of searches This is used to prove that property 3 
holds for search from unknown distnbutions, that is, that search decreases 

' I suspect they vary a great deal Characteristics ot empirical wage distributions— 
which arc related to (but distinct from) distnbutions of wage offien—are unstable over 
occupations and across cities (Buckley 1%9) 

* The statistical literature on optimal-seardi rules is extensive Breiiniui (1964), 
DeGroot (1970, chap 13), and Chow, Robbins, and Siegmund (1971) are surveys of 
increasing sophistication Some of the results of this paper appear to be new, paiticularly 
Theorems I, II, and III The work most closely related to that presented hese is that of 
Yahav (1966) and DeGroot (1968) Yahav considers the general problem of searching 
with recall (the searcher is sdlowed to return to stores previously sampled) from an 
unknown distnbution, shows that optimal rules exist under quite general circumstances, 
and discuMcs how they could be approximated DeGroot completely solves the problem 
faced by a man •earching (with or without recall) limn a narmal distribution of known 
variance and unknown precuion 



JOURNAL or POUnOAL ECK»)OMY 


694 

as its cost increases. In Secdiui VI it is shown (in Theorem II) that if 
prior belwfs arc Ihnchlet the optimal-search rule has a reservation-price 
property. Searchers will accept a price if and only if it is leas than some 
particular price pn The reservation price is a functhm of the searcher’s 
beliefs. It changes as his beliefs are revised m the light of experience. 
Section VII is devoted to provmgr m Theorem III, an analogue of pro¬ 
perties 4 and 5 for the Dinchlet case 
The most important results of this paper are that for the example of 
Section III, opumal-search rules from unknown distributions have the 
same qualitadvc properties as optimal rules from known distribudons 
Since It IS easy to construct examples for which this is not true, it is 
natural to ask how general these results are The final section considers 
this quesuon. I believe them to be more general than it may appear, as 
they do not se^ to depend on prices being confined to a finite set How¬ 
ever, theylKgg|till quite speaal, as the proofs depend on the process of 
revising beheiito accommodate new information having a particular— 
and not terribly natural—local propicrty Whether similar results hold 
if this assumption is abandoned is an opien and difficult quesuon 


n. Preliminaries 

Consider a man trying to buy a good at the lowest total expected cost 
At the beginning of each period, this man pays c and receives a price 
quotation, which upon receiving he must decide whether to accept or to 
pay (at the beginning of the next period) c again and receive another 
price quotation Total costs include the price actually paid as well as the 
costs of search ^ ° I assume that the man does not have the pnvilege of 
recall, that is, that an offer once spumed cannot be taken up again As is 
well known, this assumption is innocent—it has no effect on the optimal 
strategy—when the price distnbuUon is assumed known It is not mnocent 
in the present case 

For simplicity, suppose that there are only a finite number of pnees 
Pu Pit • > Pn label them in ascending order so that 

Pi < Pn-i ( 2 ) 

The probability distribution of prices is a multinomial distribution It 
ts completely characterized by the vector n whose ith clement it, is the 
probability that the 1 th price is chosen Since fl is a probability distnbu¬ 
Uon, 

n e A = {(x,, , xj e Jr I X, ^ 0, 2 ; X, = 1} 

Given that the procca takes place in tune, perhaps it would be more logical to 
nunimize discounted expected costs This would introduce no serious complications 
See Kohn and Sbavell (1974) for a discuMion of the effect of varying the discount rate 
on search rules. 



SSMUSONO FOR LOWIST FRIOB 


695 

Previous work has assumed that the searcher knows 11. Instead, I assume 
that he has a prior dutribution F{ ) over A. As he condnues to search, 
he gathers more informadon about the distiibudon of prices which he 
assimilates by updating his prior according to Bayes’s rule. All necessary 
information about his experience is contained in the statistics 

N - . , N,), 


where A^j is the number of times price 1 has been observed 

It is convenient to parameterize his experience slighdy differently If 
S{N) = vector whose ith element is 


S{N) 


(3) 


represents the average number of tinics that each price has been observed, 
while 


p = w-' 


(4) 


represents the total number of prices he has observed, therefore, (/i, p) 
also contams all the information the searcher has accumulated This 
parameterization permits a distinction between the content of this 
informadon, represented by p, and its precision, represented by p With 
these definitions, new information is assiinilated as follows With the 
observation of pnee i, (p, p) becomes 


P) 


Pi 


\P + 1 


Pi + P P. P \ 

p + 1 p + 1 p + 1/ 


( 5 ) 


There is a slight techmeal didiculty with this convention The updating 
rule (5) is not consistent with the defimtions (3) and (4) when = 0 
This problem may be finessed Suppose X = , z,) and I = 

(< 1 , ,0. define 

X’ = y/zV z;- (6) 


A man with imtial behels F(n) and no experience is no different from a 
man with imbal beliefs /'(FI) and experience N = (Af,, , A/,), where 

/'(FI) IS a probabihty distnbuUon over A sausfymg ^'(11)0^ = JCF(n) 
for some positive constant K or, using the definiUons (3) and (4), 

= KF{Ti) 


In the sequel, we largely ignore the prior distribution F{ ) and focus 
instead on the informauon 


(p, p) e r = A X [0, 1], 
which is updated accordmg to (5) 


( 7 ) 



696 JOURNAL OF POLITICAL EOONOMY 

Ckkrresponding to any (/i, p) m a vector A(p, p) whose tth component 
is given by 

^i(M. P) = 

The term A(p, p) is a probability distribution which represents the 
searcher’s expected beliefs in that he would take a small bet on the 
proposition that the next price observed would be Pj at the odds A((p, p) 
to [1 — p)] Since X{n, p) may be considered the index of a pos- 

tenor distribution based on a sample size of p~* which converges to a 
normal distribution with mean equal to the sample mean as sample size 
increases, 

hm A,(p, p) = Pi (9) 

^-0 

m. A^^mple 

A simple and important example will illustrate the nature of the searcher's 
problem and clarify the meaning of N, (p, p), and A(p, p) Suppose that 
the searcher’s prior is a Dirichlet distribution Since the Dirichlet is the 
conjugate prior of the multinomial distribution, the posterior distribution 
will also be a Dinchlet The Dinchlet is an n parameter distribution, 
completely characterized by the numbers N = (A^j, , N„) *' The 

propierties of the Dinchlet are best illustrated by the following parable, 
which describes a problem exactly equivalent to that faced by the 
searcher Assume the A, are integers In an um there are S{N) = N, 
pieces of paper, p, is written on JV, of these slips, on N 2 of them, and 
so on A man draws from the urn at random If he draws a slip with p, 
on It he maj cither pay p, or pay c, return the slip to the urn, place another 
slip with pj on It in the urn, and draw from the um again That is, if he 
chooses to continue sampling, he faces the same problem as before except 
that the parameter describing the urn is 

y.(JV) = N + r„ (10) 

where e, is the vector with 1 in the i th place and 0 elsewhere The prob¬ 
ability of getting price p, from an um with parameter JV is just 


U dF(n) 
itf’(n) 


Xi(N) 


JL. 

S{N) 


** See, hr example, DeGroot (1970, pp 49-51) for a diacuaion of the propertici of 
the Dinchlet distnbutioii 

‘ ^ I am indebted to David Blackwell for pomUng out to me this way of looking at 
Dinchlet distributions 



BAACHINOPOR LOWEVT PSIOE 


697 

The quantity S(N) measurei how fast these probabilities will change with 
successive drawing^. If S{N) is small, new drawings will alter the com- 
posidon of the um considerably; i£S{If) is large, they will hardly affect 
it. There is no reason why the N^ have to be integers The above problem 
IS equivalent to the searcher’s problem for any positive N as long as the 
probability of drawing p^ from an um with parameter N is NJS(N) and 
N is updated by (10) when pi is observed Letdng = NJS(^N) and 
p = S(N)~*, It IS easy to check that the updating formula (10) is con¬ 
sistent with (5) and that >l/(/i, p) = A,(A^) = NJS(N) = p, 

It IS impiortant to realize that this story does not describe how pnee 
quotations are generated In fact, there is a real pnee distribution which 
generates pnee quotadons. However, the searcher docs not know this 
distnbution His knowledge of the distnbuuon is desenbed by N, the 
composiuon of the um If pnee quotations are really generated by 11, 
then the strong law of large numbers states that the proportion of pnees 
in the um will, with probability 1, eventually be equal to 11 

The Dinchlet case is more general than it might appear I have shown 
that if the pnor is Dinchlet then it is possible to parametenze expenence 
so that A,(/i, p) = Pi As I show below, this property characterizes the 
Dinchlet completely Thus, it follows from (9) that as expenence ac¬ 
cumulates, all searchers come to behave as if their pnors were Dinchlet 

Proposition I A searcher has a Dinchlet prior if and only if it is possible 
to parametenze his expenence so that A|(/i, p) = p, for all (p, p) 6 P 

Proof' The “only if” part of the proposiuon has already been demon¬ 
strated Suppose F{ ) IS a probability measure on A and that, when 
A((p, p) IS defined by (8), A((p, p) = p, There is a Dinchlet distnbution 
P( ) so that, when X{p, p) is defined analogously, l{p, p) = A(p, p) It 
follows that for all ^ 0 


dF(n) n'**" ^^(n) 
f^n^dF(n) - f4n''dF(n) 

It IS easy to see by induction that 


I n'^ dF{n) = j n" dF(n) 

J& 


( 11 ) 


for all nonnegative mteger N The Stone-Wcicrsirass theorem unphes 
that 

I u(n) dF(n) = j H(n) i/P(n) 

Ja Ja 

for all bounded conunuous functions, u( ), defined on A Hius, by 
theorem 1 3 of Bilhngsley (1968), F( ) and P( ) coincide 



JOmiNAl or POUTUUU. xoonomv 


698 

IV. The Optimal Strat^sgy 

It is now possible to describe the optinial strategy of a man whose knowl¬ 
edge of prices is represented by the parameters (^, p) which are updated 
according to (5) This is done in the standard way, by induction. Let 

Voifiyp) = (12a) 

and 

P) = 53 ^t(P. P) “‘n iPh P)1 + (12b) 

I 

The quantity Vf. i(/i, p) is the minimum expected cost incurred by a 
searcher with pnor experience {/ti, p) who is allowed to search at most T 
times but must accept the Tth price offer made to him It is easy to see by 
inductioi^hiKt Vf{p, p) ^ p) and Vj-(p, p) ^ p^ for all T, so 

that the p) converge Let 

I'Cp. P) = I»m 

r-co 

ITien r(//, p) satisfies 

yip, p) = Hp, p) *«•" [Pt, y[f*iip, p)] + 4 . (13) 

an equaUon which defines the optimal policy If />( is drawn when beliefs 
are (p, p), accept if 

P, S V[h,{p, P)] + c, (14) 

otherwise elicit another price offer 
Proposilion 2 V (p, p) is continuous 

This fart, although of little interest itself, is the basis of the proof of the 
next section and justifies the use of extreme examples in the sccuons that 
follow It IS impiortant to realize that all the proofs given hold for any 
(p, p) e r including sUch strange boundary values as (p = e,, p = 0) or 
(p = e,, p = 1), where e^ is the vector whose i th component is 1 while all 
other components are equal to 0 

Proof; Since k^ip, p) is continuous, so is To(p, p) A simple induction 
establishes that Vrip, p) continuous for all T It remains to show that 
the Vj converge uniformly 

Let Vf (p, p) be the expected total cost to a searcher who for T periods 
follows the policy of accepung the elicited price only if (14) is satisfied, 
if after T periods no price has been accepted, the searcher must pay p. 
The quantity Vf (p, p) is defined similarly except that if he has not chosen 
after T periods he receives p^ From their definitions it follows that 

VriP^P) ^ VriP^P) 2: F(p, p) ^ Ff(p, p). 



OAMOinNO rOR LOWUT PRKS 699 

It will suffice to show that [Vf (fi, p) ~ p)[ converges to 0 uni¬ 

formly: 

(M. p) - P)\ S ^r(M. P)P,, 

where P) ** the probability—calculated according to the prior 

(p. P)—that Umpling will not terminate after T periods However, from 
the definition of p) it must be that 

^t(P. P) tv ^ Ft(p, p) ^ p„ 

so Pt{Pj P) S Pii/TV and ^^(p, p) converges to 0 uniformly This com¬ 
pletes the proof 

V. Finiteness of Search 

In this section it is shown that search will cease after a fimtc time. The idea 
of the proof (for which I am indebted to Robert Lucas) is a straight¬ 
forward one Regardless of his beliefs, a jserson will always accept />, If 
p, does not appear for a long tune, then he will come to behevc that p 2 is 
the lowest possible price and will of course accept it Thus, there is a <2 
such that after <2 tnak either the person has stopped samphng or he will 
be wilhng to accept Pj Similar arguments establish the existence of 
hi hi I same property After trials, the searcher has 

either stopped or will accept any price that occurs 

Theorem I: There is a number t such that a person following the op¬ 
timal strategy will have stopped searching after I trials 

Proof The theorem u not a probability statement Let Q be the set of 
all infinite sequences of the pnees p,, , p, and let o> denote an element 
of n I shall show that there is a / such that for any oi samphng will have 
stopped by the time the tth pnee is quoted Before I do so, some notation 
IS necessary For any co, let eu, denote its xth coordinate Smee (p, p) are 
updated as functions of o), we may define (p, p)[a)(g)] as the values of 
(p, p) obtained after observauon of C 0 |, , to. 

Define the acceptance set Aj by A, = {{p, p) e F | p, < F[Aj(p, p)] -1- f) 
It follows from (9) that if K 2 = {(p, p) e F | p, = p = 0}, then K 2 <= A 2 
Smee A 2 IS opien and F compact, there is an e > 0 such that if 
iPii i P,i P) e F and p, p < «, then (pp , p,, p) e A 2 We may 
choose <2 such that if 

to, # Pi forr < tj, (15) 

thenp,[to(<2)] + p[<u(<2)] < e, thus if <0 satisfies (15), (p, p)[a)(t2)] 6 ^2 
Furthermore, if to satisfies (15) and q>„ » P 2 for S| > t 2 > fhen samphng 
will have terminated by Sp since either to, ^ pi for (2 < r < Si or 
(p, p)[a)(s,)] 6 A 2 . 



700 


JOUIINAL OF POUnOAL ECONOMY 


Now let /Tj a* p) 6 r I /ij s= /ij as p as 0}. Clearly JCj c Aj 
and, by arguments analogous to those used above, there exists a tj such 
that if CD satisfies (15) and 


then 


CD, ^ Pi, w, # pj for <2 S r < ^3. 

(p, p)[t<j{t3)] 6 .42 U .<3 


Continuing in this manner, we establish the existence of a t, such that for 
all CO either sampling has stopped by t, or 

(p, p)[co(01 e U 

1-1 

Letting t = I, completes the proof 

I has three important implicauons The first is substantive 
The proof was based on the notion that those who fail to find low prices 
in their initial searches despair of ever finding them and become willing 
to accept high prices In other words, persons who conunue searching 
become less finicky as they search more This seems an important property 
of search models For example, m two very different papers, Kasper 
(1967) and Diamond (1971) presume such behavior and examine its 
consequences They do not prove that searchers do behave this way, 
instead they state that economic theory (diminishing returns) implies 
they should Gronau (1971), Gould (1972), and Salop (1973) providt 
proofs that searchers’ standards fall the longer they remain in the market 
Their proofs work for different reasons than the one given here In the 
Gould and Gronau models, search takes time which is a scarce resource 
whose real value rises as more of it has been used, in effect, the cost of 
search increases as search goes on In the Salop model, searchers’ stan¬ 
dards fall because searchers first explore the more attractive possibilities 
If these prove unsatisfactory, they demand less of what remains 

The other consequences of Theorem I are somewhat more technical 
The theorem suggests that computing optimal-search rules and th 
expected costs of following them is a finite problem, thus, it should b 
possible to compute the loss from following such ad hoc rules as tho‘ 
discussed by Telser (1973) instead of following optimal rules Finally, 
shows that to prove propositions about y(p, p) it is sufficient to pro' 
them for all ^^(/i, p) This techmque is used repeatedly in the sequi 
For example, it may be used to prove that property 3 holds in general 

Proposition 3. As the costs of search increase, search decreases 

Proof: Since, according to the opumal-scarch rule, the search contin' 
whenever pi ^ F’[A,(/i, p)] -(- e, it will suffice to show that if V{p, p 
the value of having information (p, p) when costs are c and 9 (p, p 



gEAKORlNO FOR LOWOT PRIOR 


701 


the value of havii^ the tame information when coiti are t c, then 
r(n, p) 2: V{n, p) Theorem I impkea that this will be true if, for all T, 
^t(M> P) ^ ^t(P) P)' “ clearly true if T =0, that it is true for 

all T follows by induction from (12b), the dehmtion of 


VI. Reservation Prices 


When the distribution of prices is known, optimal-search rules are 
characterized by a reservation price, there is a pncc such that the 
searcher will accept all prices less than or equal to p^ and reject all prices 
greater than Pj, When the distribution of pnees is unknown, acceptable 
prices change as information changes, so that optimal Bayesian search 
procedures cannot be characterized by a smgle reservation price. It is, 
however, worthwhile to ask whether or not they have a reservation pnee 
property, that is, whether for every state of information (p, p) there is a 
Pmii.p) pnees below are accepted and those above it are 

rejected, if the acceptance sets of all customers in a market are charac- 
tenzed by reservation pnees, then the demand funcuon of each seller 
in that market is well behaved—if he raises pnees, expected sales will not 
increase Raising pnees may increase sales if customers’ acceptance sets 
are not characterized by reservation pnees 

In general, optimal Bayesian search rules do not have the reservauon- 
pnee property Counterexamples are easy to construct Suppose there are 
three pnees, SI 00, $2 00, and $3 00, and that the cost of search is $0 01 
Pnor bchefs admit the possibility of only two distnbutions of pnees 
Either all pnees are $3 00 or they arc distnbuted between $1 00 and S2.00 
in the proportions 99 to 1 A man with these beliefs should accept a pncc 
of $3 00 (as this is a signal that no lower pnees arc to be had) and reject 
a quote of $2 00 (which indicates that the likelihood that a much better 
pnee will be obtained on another draw is high). 

It IS easy to see what makes this counterexample work Pnee quotations 
have value as information If my bchefs are (p, p) then, the informational 
value of getting pnee 1 is just V[h,{p, p)] In the counterexample above, 
the differences in the information value of pnees far exceed the differences 
in the pnees themselves If differences in the value of pnee information 
are less than differences in pnees, then opumal-scarch rules have the 
reservation-price property 

Proposition 4: If for all (p, p) € A, 


then 

implies 


\Pi-Pk\ ^ I p)] - K[A.(p,p)]|, 
p)i + f ^ ^ pk 

P)1 + f ^ Ai 


(16) 

(17) 

(18) 



702 


JOUBNAl. or IHSUTiOAL WOOHOtn 


Note that (17) implies (18) m precisely the Fcservation*prioe propoty. 
Proof: Suppose that (16) holds Then, usmg (17), 

\Pt - Pkl =/>(-/>* ^ P)] - P)]\ 

^ v{K<.P, />)] - p)] 


Thus 


Pk - '^[ItkiP, P)] + c ^ A - P)] + c SO 


This IS a useful cntenon 11 is used to prove 

Theorem 11 • Optimal rules for searchers with Dinchlct pnors have the 
reservation-pnce property 


In view of proposition 1 and equation (9), this is about the most general 
result that could be hoped for m this context The proof which follows 
IS pure^lflbmal and conveys little insight It is not, however, hard to see 
why the tSRirem should be true Projxisiuon 4 implies that search rules 
will have the reservation price property whenever observing a price does 
not convey information about the relative likelihood—^as measured by 
X(ft, p) —of observing other pnees in the future. The rule for updating 
the Dinchlet is completely neutral in this sense For all distinct \,j, and k, 
observing p^ has no effect on 


Proof In view of propositions 1 and 4, it will suffice to show that 


A|(p, p) = P( 


implies that, for all fxisitive integers, s, 

\Pi - Pk\ ^ p)] - 

where 


k‘{p, p) 


Pi 

I + ’ 


Pi + JP 
* 1 * 
1 + sp 


V[hi{p, p)]|, 

, P ^ 

’ 1 + rp ’ 1 ■¥ spj 


(19) 

( 20 ) 
( 21 ) 


IS (p, p) updated according to (5) after s observations of pi Note that (16) 
IS just (20) with j = 1 Theorem I implies that it will suffice to show, for 
all f,*’ 


IP, - Pk\ ^ (22) 


which can be done by induction Suppose, as we shall throughout the 
proof, that 

P, > Pk (23) 


Then from (12) and (21), 

1 + jp y 1 + y 


* * The arguments (tt, p) will be lUpprcHed where thu can be done without confusion 



aukumma for uhwut pbks 


703 


Thns, 

- >^o(A» - 7^ (A - A) i (P, - A). 

1 + sp 

so that (22) holds for ( 0. The inductive step that if (22) holds for 

t mr T — 1 then it holds for t ~ Tis proved m a series of lemmas below 
The first is of independent interest It states that if ^ the man who 
has observed pi s tunes expects to incur more total costs, other things 
equal, than the man who has observed the lower s tunes 

Lemma 1: pi ^ p^, imphes ^ 

Proof: I have already shown that 


Vom - VoW = - A) > 0 

1 - sp 

Suppose the lemma true for t = T — 1 Then letting 

hjifi = Ay[A;(/i, P)] = AJAy. 

T,(Af) - Vrihf) = —L_ 2^ pj {min [p^, Fr..(AJA,) + r] 

1 + P 7 

- nun [Py, Fr_,(A{Ay) + f]) 

+ 7^ {«ntn |>„ Fr.i(A;*') + r] 

1 + p 

- mm [p„ Vr.,(K^^) + ^3) 

(24) 

The terms m braces which multiply 1/(1 + p) are of the form J{A, B, C) 
= min {A, B) — mm {A, C), where B ^ C by the mducUon hypotheses 
Thus, they are all nonnegative Consider the term muluplymg p/(l 4-p) 
It is of the form 

J{A, B, C, D) = mm {A, B) — mm (C, D), (25) 

where>4 =‘ p„B = Vt-iW*') + c,C = p^,!kiidD = Fj-_,(A1**) + c 
Thus, d ^ C and B ^ D hy the induction hypotheses. These two in¬ 
equalities imply that (25) is nonnegative. Since (24) is a weighted average 
of nonncgativc quantities, it is nonnegativc 

Lemma 2. If (22) holds for < = T — 1, then 

min [pj, Fr_,(AyA;) + r] - mm [pj, Fr-i(*y**) + 0 

^ A - A- (26) 

Proof: The left-hand side of (26) has the form 


« 


J{A, B, C) * min {A, B) — min {A, C), 



704 JOUKNAL OF FOUTIOAI. EOONOUV 

where A => Pj, B Fr_,(AyAJ) + c, and C = + *• Together 

(22) and lemma 1 imply 

O^B-C^p,-p, (27) 

If B ^ A, then C ^ A and J^=B — C If B> A, cither C ^ A and 
J = A~C^B-C or C>A and J = A- A= O^B-C Thus, 
J ^ {B - C) ^ p^ - p^ 

Lemma 3 If (22) holds for i = T — 1, 

mm [/>„ P^r-iW^*) + e] - min [/>*, Kr_,(AJ+‘) + c] 

^Pi- Pk (28) 

Proof: The left-hand side of (28) is of the same general form as (25), 
with A = p^, B = J^T-i(^?*‘) + C = p^, and D = F7._,(AJ^‘) + c 
Also, by J|tt)othesis, A C and lemma I imply B 'S:. D We want to 
show that r# A ~ C There are four cases to consider 

i) ^ if and C ^ D, so that J = A — C 

ii) A B and C > D This case may be disregarded, since, if it 

obtains, B-D^A-D>A-Cot Fj-.,(Af"’) - > 

Pi ~ Pki which contradicts the induction hypothesis (22) 

111 ) A > B and C > Z), so that J = (B — D) £ A — C by (22) 

w) A > B and C ^ D, so that J = {B — C) < A — C, since A > B 

Lemma 4 If (22) holds for < = T — 1, it holds for t = T 

Proof Use lemmas 2 and 3 to calculate 


\VrW - Vr{hl)\ = Fr(A^) - 

= -1- S itj {min [pj, Vr.f/ijh’) + c] 

1 + jp J 

- min [pj, + c]} 

sp 


+ 


1 + sp 


{min [pi, Vr-iik’t ) + c] 


1 


- min [/.*, l^T,,(Aj;^') -t- c]} 


L Pj iPi - Pe) + rr— 


I A sp *T .. . \ A sp 

= {Pi - Pk) = \Pi - Pk\ 

This completes the proof of Theorem II. 

Theorem I implies that the reservation price of those searchers who 
remain in the market must eventually increase to * * 


** In previous versions of this paper, I conjectured that if the prior were Dinchlct, the 
reservation price would increaie monotonicaliy I am grateful to Emmett Keeler for 
ihowing me how to construct a counterexample to this conjecture 



igtAKIHtNO VOR LOWEST PKICS 


705 


Vn. Effects of Increaeing Uncertainty 

The Introducbon lists some results on the effects of increased price dis- 
persion when the price distribution is known It u natural to ask whether 
these results—that increased pnce dispersion lowers total expected costs 
and increases search activity—hold when the distribution is unknown 
Since the searcher’s knowledge of the pnce distribution u defined by the 
parameters (p, p), the quesuon is what effect changes in p and p which 
represent increased uncertainty have on K(p, p) To answer this, it is 
necessary to decide what changes in (p, p) correspond to mcreased 
uncertainty There are two obvious candidates X{p, p) is the expected 
pnce distnbution in the sense that the searcher is just willing to bet that 
the next pnce will be p, at odds /l|(p, p) to [1 — X^{p, p)] However, p 
represents the precision of the searcher’s knowledge of the pnce distnbu¬ 
tion he faces (p = 0 is subjective certainty) Thus, A(p, p) represents 
what the searcher beheves while p represents how firmly he believes it 
Increased uncertainty could correspiond either to an increase in the dis¬ 
persion of A(p, p), holding p constant, or to an increase in p, holding 
A(p, p) constant With what 1 hope is a pardonable abuse of language, I 
shall call the first case “increasing perceived uncertainty” and the second 
case “increasing confidence ” This terminology is justified by the fact that 
increasing perceived uncertainty is the natural analogue of the increases 
in uncertainty studied when price distributions are assumed known Both 
cases are discussed below 

The effects of increased perceived uncertamty, at least for the Dinchlct 
case, are straightforward 

Theorem III If the searcher’s prior is a Dinchlet [A(p, p) = p], then 
if p IS nskier than p, ^(p, p) ^ F(p, p) 

Proof. Suppose p is nskier than p It will suffice to prove the theorem 


if p and ji differ by a single mean preserving spread (Rothschild and 
Suglitz 1970), so we shall consider four pnees < P2 < Pi < Pe ^nch 
that p, = p, for 1 ^ (1, 2, 3, 4) and 

Pi - Pi = Pi - Pi = ^ ^ 

and 

(29) 

Pz - Pi = - P*. = B > 0, 

where 

(30) 

^{Pi - Pi) + BiPi - ^ 4 ) » 0 

(31) 

I shall prove the theorem by showing 


^'r(Al. P) ^ f'AP. P) 

(32) 

for all T by induction. That (32) holds for T » 0 fidlows from (31) and 


(12). In an obvious notation, let ij b hj(fi, p). 



JOURKM. OV POUnCAt. ■OOMPMV 


706 

Let R sadsiy 

a It 

P) * 23 + ]C Py[^^T-i(A;) + «] 

1 JiTi 

Theorem II guarantees the existence of such an R. Note that 
yAfi> P) = 23 Ay »n»n [Py. f^r-i(^y) + 

^ E fijPj + 23 A[»^T-,(^y) + ^3 

1 ft-fl 

R m 

^ E AyPy + E Arf'r-iCAy) + «] 

1 a” 

(the last inequality follows from the induction hypothesis) Let 

= 23 (Pj - Pj)Pj + 23 (Pj - Ay)[f'r-i(^) + -^3 (33) 

IB! 1 a + i 

Then it will sufBce to show that //(/i, /?) 2t 0 

There arc five cases to consider, depending on the relationship ofto 

Pi> Pi>Pii P 4 

1) Pa ^ p 4 ‘his case, 

^(P.A) = (Pl - Al)Pl + {H2 - P2)P2 + (Ps - iii)p3 + (P4 - P4)P4 

= A{P2 - pi) + 5(^3 - = 0 

2) P* > Pn > Pi In ihts case, 

H{p, fi) = A{P2 - p,) + fi{P3 - [I'r-iCA*) + c]} 

S ^(P2 - Pi) + ^(Ps - P 4 ) = 0 

3) p 3 > Pa > p 2 In this case, 

//(M, /i) = d(P3 - p,) + B(K3..,(A3) - F,._,(AJ] 

^ ^(P 2 - Pl) + ^(P3 - P 4 ) = 0 

The second inequality follows from (22) 

'I) p 2 > Pa ^ Pl In this case, 

P^(P) P) — ^[I^r-i(^2) + ^ ~ Pi3 + ^[I^T-i(^3) ■" I^t-i(^4)3 
S A[Vr-^{h2) - Vr-^{h^)] + B[Vr.,(l>2) - Vr-iik*)] 

(34) 

It IS shown below in lemma 5 that 


Vr-tih2) - Vr-iihi) ^ Vy-xih^) - Vr-iil^i) _ (35) 


P2 - Pl 


p4 - p3 


SO that 


^[I't-i(A2) - I't-i(A,) 3 ^ di£l_iL> [Fr-,(A4) - I^t-i(A3)3 


p4 -p3 



707 


gBAMOBDIO TOK unvwr nUOB 


ComluBing (34) and (35), we obtain 

H(ji, ID ^ ~ [A(p^ - p,) + Bip^ - p,)] - 0. 

Pa - Pi 

5) Pi ^ Pn. In this case, H(p, /2) is just equal to the last expression in 

(34) , and (35) can be used to show that ft) ^ 0. 

To complete the proof we only need prove 

Lemma 5: Inequality (35) holds for all T. 

Proof: \fT =0, both sides of (35) arc equal to (/>/(! + p)] Suppose 

(35) holds for t = T — 1. Let satisfy 

^r(A*) “ f*Jpj 4 

1 Mt+l 

It follows from lemma 1 that Rt ^ R 2 ^ Rj ^ R 4 , so that 

Fr(Aj) - yr(ii) 5 T ^y^^'r-l(A^;) - f"r-.(Au)] (36) 

*7+1 

while 

FM - Fr(/‘3) ^ t, Uj[Vj_i{h4j) - Vr.y(h2j)] (37) 

*,+i 

The induction hypothesis imphes 


0 £ 



ikjj) - Vr-iif^ij) 
Pi - Pi 


^r-i(^j) ~ ^r-i(^i) ^r-i(^4) 


Fr-ijf^Aj) — ^r-i(^3j) ~| 

^4 - ^J J 

-JV-^ (38) 


P2 - Pi 


Pa -Pi 


The last incquahty follows from (36) and (37) 


An immediate consequence of Theorem III is an analogue of property 4 
that mcreased price dispersion leads to increased search 


Corollary; If the searcher’s prior is Dinchlet, then increasing perceived 
uncertainty lowers the reservaUon pnce 


The proof, which is obvious, is omitted 

The effects of increased conSdence are less clear. I suspect that if the 
prior is Dinchlet, then in most cases V{p, p) decreases as p decreases 
However, 1 have not been able to prove that this is always true or to 
construct an example where K(p, p) increases when p decreases 


' ’ Earlier vcruooi of thu paper cootained what I thought was an example of (''(a, a) 
increasing when a decreased I am gratdul to Mr. Richard Remler far showing me that 
this example was mistaken Reader’s senior thesis (Pnnoeton Umvemty, 1974) contains 
the details of this and other calculation! of the effects of vanous pniameter rhsngri on 
optimal-search strategies. 



7o8 

VIU. Conclusion 


JOURNAL at POtmOAL EOONOMV 


The most important results of this paper. Theorems II and III, state that, 
for what proposition 1 suggests is a quite significant example, the qualita¬ 
tive behavior of persons searching optimally from unknown distributions is 
the same as that of piersons searching optimally from known distributions 
Since It IS easy to construct examples for which this is not true, it is 
natural to ask just how general these results arc This problem is best 
stated by focusing on a very general and abstract formulation of the 
problem of optimal search. Let A' be a compact’ * set of the real line and 
let J({K) be the set of all probability measures on K. The quantity 
J([K) IS a compact separable metric space, so that we can, in the standard 
way, define probability measures on ^(K) ” Such a probability mea¬ 
sure, say represents a searcher’s a prion beliefs about the price distribu¬ 
tion he facai(which is some probability distnbution in J((K)) If a 
price p i^dbjKrved, then as long as the conditional distnbutions (on 
J({K) given ^ can be calculated, Bayes’s rule may be used to upidate his 
beliefs from ^ to, say, As in Sections II and IV above, the optimal 
Bayesian strategy can be devised The measure ^ completely determines 
the optimal-search rule The quesuon at hand is, For what class of ^ do 
analogues of Theorems II and III hold’ 

Each ^ has two rather distinct aspects The first kS{^), the support of 

which IS roughly the set of price distributions which the searcher 
whose beliefs are described by ^ judges to be conceivable This paper 
focuses on the case where S{&‘) contains only multinomial distributions 
The second aspect of^ is the updating rule Theorems 11 and III require 
that the updating rule be of a very special kind—if/>( is observed, then the 
(subjective) likelihood of observing in the future is increased while 
the likelihood of observing all other prices is decreased (and by the same 
proportion) 

Recent work of Ferguson (1973)’® suggests that it is only the restrictions 
on the updating rule and not those on S{^) that are required for 
Theorems II and III Ferguson introduces a class of distributions whose 
support IS essentially all of ./^(A), I conjecture that analogues of Theorems 
I, II, and III hold for these distributions However, these probabihty 
measures imply very special updating rules If we ignore many techni¬ 
calities, Ferguson’s work may be summarized as follows Let a be a finite 
measure on A and let B = (B,, Bj, , B„) be a measurable partition 

I continue to insist that pnee distributions be bounded No meaningful generality 
and much complication ensues if this assumption is abandoned 

” See Parthasarthy (1967, pp 39 ff) for details 

Blackwell and MacQpeen (1973) show how Ferguson distributions can be under¬ 
stood in terms of the um model of Section III In terms of that model, we can phrase 
our question as foUows Do our results hold when the composition of the um is altered 
(upon observation of price p,) in some other way than by adding another slip with Pi 
written on it’ 



MtAKCnNO FOR LOWBST PRICB 


709 

of K (that ii, the Bi are tneaiurable and ^ K and B, n ^ 

if I r^j) Consider the probability that the next observed pnce belongs to 
Bi, B 2 , or B^ This is a multinonual distribution described by the numbers 
{P(Bi), , P{B,)) e A Each such multinomial distribution belongs to 

Jt{K) and is thus assigned probabihty by ^ The distribution is a 
Ferguson distribution with parameter a if the distribution of {P{B^), 
P{B^, , P{Bg)) IS Dirichlet with parameter {«(B,), 0 ^( 82 ), , «(B,)) 

Ferguson shows (1973, pp, 213-16) -that the support of is the set of all 
measures fie J({K) which are absolutely continuous with respect to a. 
By choosing a appropriately, can be made very large 

The rule for updating is very simple If is a Ferguson distribution 
with parameter a, after price p is observed, it is updated to a Ferguson 
distribution with parameter 

A,(a) = a + (39) 

where 8 ^ is the probabihty measure concentrated on the point p This 
formula is an exact analogue of the rule given in equauon (10) above for 
updating Dinchlet pnors 

Proving analogues of Theorems I, II, and III for Ferguson distributions 
whose parameters have compact support should mvolve only techmeal 
problems ‘ ’ If these could be solved, it would show that the results of this 
paper are quite general in the sense that they apply to searchers whose 
beliefs are not restneted in any way beyond boundedness However, they 
are quite special in that they apply only to people who revise their beliefs 
in a very special way Formula (39) implies that information is strictly 
local or, to put the matter more colloquially, that a miss is as good as a 
mile My future beliefs about the likelihood of observing p arc affected m 
exactly the same way by observing p + 10“as by obscrvmg p + 10*® 

I would like to know whether the conclusions of Theorems II and III 
hold when information is not so completely local At present there seems 
no way to answer this quesuon with generahty 1 know of no class of 
^ e J({K) which has both large support and tractable updatmg rules 
other than the Ferguson distnbuuons 

It IS possible to examine the propierty of^’s with rather restneted sup¬ 
port by cxamimng parametne families of distnbubons with tractable 
updaung rules For example, it is a simple consequence of DeGroot’s 
(1968) work that optimal-search rules from normal distribuUons with 
unknown means and known vanance have the reservauon-pnee property 
when the pnor distnbudon is also normal 

Perhaps the strongest possible result along the Imes of Theorem II*® 
that could be hopied for is the following 

' * It should be pomble to push through proofr very smuUr to those m the text once 
a topology IS found in which proposition 2 holds 

I am not certain how propositions about the efieeb of increasing dispenion are best 
stated in this context. 



710 


JOURNAL or POUTICAL^BOONOKY 


Conjecture ' Optitnai-search rules from exponential families of distribu¬ 
tions have the rescrvation-pnce property if the unknown parameters have 
conjugate prior distributions. 

Establishing this would significantly expand the applicability of the 
results of this paper Still, I think enough has been done to establish that 
economists can without great loss assume that the quahtative properties 
of demand functions which arise from optimal search from unknown 
distributions are the same as those which arise from optimal search from 
known distnbudons 


References 

Billingsley,? Convergence of Probabdtly Measures New York Wiley, 1968 

Blackwell, D , and MacQueen, J B “Ferguson Distributions via Polya Urn 
ScheragA” Mnn. Aalw 1 (1973) 353-55 

Brciman^^ilf'Stopping-Rule Problems ” In Applud Combtnalortal Mathematics, 
edited by E F Beckenbach Berkeley Univ California Press, 1964 

Buckley, J E “Wage Dispersion in Metropolitan Areas ” Monthly Labor Rev 92, 
no 9 (September 1969) 24-29 

Chow, Y S , Robbins, H , and Siegmund, D Great Expectations The Theory of 
Optimal Stopping Boston Houghton Mifflin, 1971 

David, P A “Fortune, Risk, and the Micro-Economics of Migration ’’ In Nations 
and Households in Economic Growth Essays in Honor of Moses Abramointz, edited by 
P A David and M W Reder New York Academic Press, 1973 

De Groot, M H “Some Problems of Optimal Stopping ” J Roy Statis Soc B 30 
(March 1968) 108-22 

- Optimal Statistical Decisions New York McGraw-Hill, 1970 

Diamond, P A “A Model of Price Adjustment ” J Econ Theory 3 (June 1971) 
156-68 

Diamond, P A , and Stiglitz, J E Increases in Risk and in Risk Aversion J Econ 
Theory (1974), in press 

Ferguson, T S “A Bayesian Analysis of Some Nonparametric Problems ” Ann 
Stalls 1 (1973) 209-30 

Gastwirth, J L “On Probabilistic Models of Consumer Search for Information.” 
Unpublished paper, 1971 

Gould, J P “Information and Consumer Behavior Theory of Optimal Sequential 
Searching Policies.” Unpublished paper, 1972 

Gronau, R “Informalion and Frictional Unemployment” AER 61 (June 
1971)- 290-301 

Kasper, H “The Asking Price of Labor and the Duration of Unemployment.” 
Rev Econ and Statu 49 (May 1967) 165-73. 

Kohn, M G , andShavell, S “Optimal Adaptive Search ” Econ Theory A), 
in press 

McCall, J J. “The Economics of Information and Optimal Stopping Rules.” 
J. Bus. 38 (July 1965) 300-317. 

-. “Economics of Information and Job Search ” Q J.E 84 (February 1970) 

113-26. 

Nelson, P. “Information and Consumer Behavior,” J.P E. 78 (March/Apnl 1970) 
311-29. 

Parthaiarthy, K R Probabiltly Measures on Metnc Spaces New York Academi< 
Press, 1967. 



8BASCKINO FOK LOWBflT PUCE 


7II 

Rolluchildt M. “Moddi of Msriut Organtatbon with Imperfect Infermation.” 

J.PJE. 81 (November/December 1973): 1283-1308. 

Rothschild, M., and Stiglitz, J. E. “Increasing Risk. I. A Definition ” J. Eeon. 
Thtoty 2 (September 1970): 225-43 

Salop, S. “Systematic Job Search and Unemployment ” Rto. Eeon SUubes 40 
(April 1973) 191-202 

Stigler, G. J “The Econonucs d* Information.” J PE 69 (June 1961) 213-25. 

-. “Information in the Labor Market,” J P.E. 70, no 5, suppl (October 

1962)-94-105. 

Telscr, L G "Searching for the Lowest Price " A.E.R Proe 63 (1973) 41-49. 
Whipple, D “A Generalized Theory of Job Search” JPE 81 (September/ 
October 1973). 1170-88 

Yahav, J. “On Optimal Stopping ” Am Math Statu 37 (1966) 30-35 





Karl Marx on the Economic Role 
of Science 


Nathan Rosenberg 

UmtmsUy of Wtuonsm 


It IS not the articles made, but how they arc made, and by 
what instruments, that enables us to distinguish dificrent 
economical epochs [Makx 1906, p 200] 


This paper examines Marx’s treatment of rising resource productivity 
and technological change under capitalism Little attention has been 
given to Marx’s view of the role which saence plays m these proccsKS. 
It IS obvious that Marx (and Engels) attach the greatest importance to 
the development of modem saence, but the way m which scientific 
progress meshes with the rest of the Marxian system has not been fully 
imderstood The paper analyzes Marx’s treatment of the factors which 
account for the growth of saentific knowledge as well as capitalist 
society’s changing capaaty to incorporate this knowledge into the 
productive process 


The purpose of this paper is to examme ccrtam aspects of Marx’s treat¬ 
ment of nsing resource productivity and technological change under 
capitaUsm Many of the most interesting aspects of Marx’s treatment of 
technological change have been ignored, perhaps because of the strong 
polemical onentauon which readers from all shades of the pohbcal spec¬ 
trum seem to bnng to their reading of Marx As a result, much has been 
wntten about the impact of the machme upon the worker and his family, 
the phenomenon of ahenauon, the relationship between technological 
change, real wages, employment, etc At the same tune, a great deal of 
what Marx had to say concemmg some 300 years of European capitalist 
development has received relatively btUe attention. This apphes to his 
views dealmg with the complex mterrelations between science, technology, 
and economic development 

It is a well-known feature of the Marxian analysis of capitalism that 
Marx views the system as bringing about unprecedented increases in 

The author u grateful to Proieaon Stanley Engerman atid Eugene Smolensky for 
cntical comments on an earlier draft 


7>3 




714 


JOURNAL OF POUTICAL ECSONOMY 


human productivity and in man’s mastery over nature. Marx and Engds 
told their readers, in The Communut Mmifesto, that “the bourgeoisie, 
during Its rule of scarce one hundred years, has created more massive and 
more colossal productive forces than have all preceding generations to¬ 
gether Subjection of Nature’s forces to man, machinery, application of 
chemistry to industry and agriculture, stcam-navigation, railways, electric 
telegraphs, clearing of whole continents for cultivation, canahsation of 
nvers, whole populations conjured out of the ground—what earlier cen¬ 
tury had even a presentiment that such productive forces slumbered in the 
lap of social labour’’’ (Marx and Engels 1951, 1 37) No single question, 
therefore, would seem to be more important to the whole Marxian anal¬ 
ysis of capitalist development than the qucsuon Why is capitalism such an 
immensely productive system by comparison with all earlier forms oi 
econonuc^tfganization’ The question, obviously, has been put before, 
and certain llH'ttons of Marx’s answer arc in fact abundantly plain In 
particular, the social and economic structure of capitalism is one which 
creates enormous incentives for the generation of technological change 
Marx and Engels insist that the bourgeoisie is unique as a ruling class 
because, unlike all earlier ruling classes whose economic interests were 
indissolubly linked to the maintenance of the status quo, the very essence 
of bourgeois rule is technological dynamism ‘ Capitalism generates unique 
incentives for the introduction of new, cost-reducing technologies 

The question which I am particularly interested in examining is the 
role which is played, within the Marxian framework, by science and 
scientific progress in the dynamic growth of capitalism For surely the 
growth in resource productivity can never have been solefy a function of 
the development of capitalist institutions It is easy to see the existence of 
such institutions as a necessary condition but hardly as a sufficient con¬ 
dition for such growth Surely the technological vitality of an emergent 
capitalism was closely linked up with the state of scientific knowledge and 
with industry’s capacity to exploit such knowledge 

Marx’s (and Engels’s) position, briefly stated, is to affirm that science 
IS, indeed, a fundamental factor accounting for the growth in resource 
productivity and man’s enlarged capacity to manipulate his natural 
environment for the attainment of human purposes However, the state¬ 
ment requires two immediate and highly significant qualifications, which 
will constitute our major concern in this paper (1) science does not, 
according to Marx, function in history as an independent variable, and 
(2) saence has come to play a cntical role as a systematic contributor to 
increasing productivity only at a very recent (from Marx’s perspective) 

' “The bourgeoiiie cannot exist withtnit constantly revolutionising the instruments 
of production, and therehty the rebuions of production, and with them the whole relations 
of society. Conservation of the old modes of production in unaltered form, was, on the 
contrary, the first condition of existence for all earlier industrial classes" (Marx and 
Engels 1951, 1-36). 



SCONOiaO ROLE OF SOENCE 


7>5 

point in hutory. Tbe ability of (dence to perform thu role had necessarily 
to await the fulfillment of certain objective conditions What these con¬ 
ditions were has not been understood adequately 

I 

Marx’s treatment of saentific progress is consistent with his broader his¬ 
torical matenahsm Just as the economic sphere and the requirements of 
the productive process shape man’s ptohtical and social instituUons, so do 
they also shape his scientific activity at all stages of history Science does 
not grow or develop in response to forces internal to science or the saen¬ 
tific community It is not an autonomous sphere of human activity 
Rather, science needs to be understood as a social acuvity which is rcspion- 
sive to economic forces It is man’s changing needs as they become aruc- 
ulated in the sphere of production which determine the direction of scien¬ 
tific progress Indeed, this is generally true of all human problem-solving 
activity, of which science is a part As Marx states m the mtroducuon to 
his Cntique of Politual Economy “Mankind always takes up only such 
problems as it can solve, since, looking at the matter more closely, we will 
always find the problem itself arises only when the material condiuons 
necessary for its solution already exist or are at least in the [Mocess of 
formulation” (Marx 1904, pp 12-13) 

Marx views specific scientific disaphnes as developing in response to 
problems arising in the sphere of production The matcnalisuc concepuon 
of history and society involves the rejection of the notion that man’s intel¬ 
lectual pursuits can be accorded a status independent of material con¬ 
cerns It emphasizes the necessity of systematically relating the realm of 
thinking and ideas to man’s material concerns Thus, the scientific enter¬ 
prise Itself needs to be exarmned m that perspecuve “Feuerbach speaks 
in particular of the percepUon of natural saence, he menuons secrets 
which arc disclosed only to the eye of the physicist and chemist but where 
would natural science be without mdustry and commerce’ Even this ‘pure’ 
natural science is provided with an aim, as with its material, only through 
trade and industry, through the sensuous acUvity of men” (Marx and 
Engels 1947, p 36) Egypuan astronomy had developed out of the com- 
pelhng need to predict the nse and fall of the Nile, upon which Egyptian 
agriculture was so vitally dependent (Marx 1906, p. 564, n 1) The m- 
creasmg (if still “sporadic”) resort to machinery m the seventeenth century 
was, says Marx, “of the greatest imporuncc, because it supplied the great 
mathematicians of that time with a practical basis and sumulant to the 
creation of the science of mechames.”’ The difficulties encountered with 

’ Man 1906, pp 382-03 Engels states: “Like all other sciences, m adse m atics arose 
out of die assdii men; from the measuremeot of land and of tbe content of vesmb, 
from the computation of tiine and medianics" (Engels 1939, p. 46; rmph a sis Engels's. 
Cf. Mane 1906, p. 564) 



JOtJItNAL OF POUTTCAL EOONOHY 


716 

gearing as waterpower was being harnessed to larger millstones was “one 
of the circumstances that led to a mme accurate investigation of the laws 
of friction ” * 

These themes are repeated by Engels, who asserts that “from the very 
beginmng the ongin and development of the sciences has been deter¬ 
mined by prtxiuction.”^ In accounting for the rise of science during the 
Renaissance, his first explanation again drew upon the requirements of 
industry 


If, after the dark mght of the Middle Ages was over, the sciences 
suddenly arose anew with undreamt-of force, developing at a 
miraculous rate, once again we owe this miracle to production 
In the first place, following the crusades, industry developed 
enormously and brought to light a quantity of new mechanical 

% /ing, clock-making, milling), chemical (dyeing, metallurgy, 
ol), and physical (spectacles) facts, and this not only gave 
enormous material for observation, but also itself provided 
quite other means for experimenting than previously existed, 
and allowed the construction of new instruments, it can be said 
that really systematic experimental science now became possible 
for the first time ’ 

Moreover, in a letter written in 1895, Engels stated “If, as you say, 
technique largely depends on the state of science, science depends far 
more still on the stale and the regutremenls of technique If society has a 
technical need, that helps science forward more than ten universities The 
whole of hydrostatics (Torricelli, etc ) was called forth by the necessity for 
regulating the mountain streams of Italy in the sixteenth and seventeenth 
centunes We have only known about electricity since Us technical appli¬ 
cability was discovered” (Marx and Engels 1951, 2 457. Letter from 
Engels to H Starkenburg, January 25, 1895, emphasis Engels’s) 


’ Marx 1906, p 4)1 He adds “In the same way the irregularity caused by the motive 
power in mills that were put in motion by pushing and pulling a lever, led to the theory, 
and the application, of the flywheel, which afterwards plays so important a part in Modern 
Industry In this way, during the manufacturing period, were developed the first scientihc 
and technical elements of Modem Mechanical Industry ” 

* Engels 1954, p 247 Earlier in the paragraph, he had stated "The successive develop¬ 
ment of the separate branches of natural Ktence should be studied First of all, astronomy, 
which, if only on account of the seasons, was absolutely indispensable for pastoral and 
agricultural peoples. Astronomy can only develop with the aid of mathematics Hence 
thu also had to be tackled Further, at a certain stage of agnculture and in certain regions 
(raising of water for irrigation in Egypt), and especially with the ongm of towns, bi( 
building structures and the development of handicrafts, mechanics also arose This wa 
soon ne«led also for navigation and war Moreover, it requires the aid of mathematic 
and to promoted the latter’s develcqnnem ” 

‘ Ibid; p 248. The editor of Engels's unfinished manuscript points out that Engc' 
had written in the msufgin of the manuscript opposite this paragraph' "Hitherto, whi 
has been boasted of is uhat production owes to science, but science owes infimtely ntoi 
to production " 



BOONOMIC ItOtB or ICIENGE 


717 


Thit statement is probably the most explicit and direct asserdon in the 
writings of Marx and Engels that factors affectutg the demand for saence 
are overwhelmmgly more important than factors affecting its supply. 
Scientific knowledge is acquired when a social need for that knowledge 
has been established Science is, however, not an initiating force in the 
dynamics of social change Developments in this sphere are a response to 
forces origmating elsewhere. Thus, Marx and Engels appear to be pre¬ 
senting a purely demand-determined explanation of the social role of 
science. Scientific enterprise supphes that which mdustry demands, and 
therefore the changmg direction of the thrust of science needs to be under¬ 
stood in terms of the changing requirements of mdustry 


U 

In this section I will argue that, while the dcmand-onented component of 
the argument just presented is indeed a major part of the Marxian view, 
there are also vital but less conspicuous elements in Marx’s argument 
which have been ignored Without these additional and more neglected 
elements one cannot explain a central thesis which emerges out of Marx’s 
view namely, that it is only at a particular time in human history that 
science is enhsted in a crucial way m the producuve process It is only at a 
very recent point in history, Marx argues, that the mamage pf science 
and industry occurs Moreover, this marriage does not coinadc with the 
historical emergence of capitalism In fact, Marx is quite exphcit that the 
union of science and industry comes only centuries after the arrival of 
modern capitalism and the emeigence of sophisticated bodies of theoret¬ 
ical saence If arguments based upon the existence of capitalist incentives 
and demand forces generally were a sufficient explanation, the full-scale 
industrial exploitation of science would have come at a much earber stage 
in Western history But it did not Why? 

Stripped to its essentials, Marx’s answer is that the handicraft and 
manufactunng stages of production lacked the technological basis which 
would permit the applicaUon of scientific knowledge to the solution of 
problems of industnal producuon.^ This essential technological basis 

* Since the lubsequent diicuaion turns directly upon the Manuan penoduaooo 
scheme, it is important to remind the reader of the meaning which Marx attaches to 
the terms "handicraft,” "manufiuxure,* and "modem induMry" Engels exprawd 
Marx’s meamngs sucanctly as iollows "We divide the history of industrial productian 
since the Middle Ages into three periods* (1) handicndt, small master e r afts mc ii widi a 
few journeymen and apprentices, where esich laboter produces d>e complete article; 
(2) manufacture, where greater numbers of wor fa nen, grouped in one large estsdilishment, 
produce the c(»nplete article on the pnnaple of division of labor, eadi worikman perfonn- 
ing only one partial operation, so that the product is complete only after having p as md 
tuooenively through the hands of all; (3) modem industry, where the product it produced 
by machinery driven by power, and where die work of die laborer is limited to siqienD- 
tending and correcting the peHarmaaoei of the mechanical agent” (Bagels 1910, pp 
12-13). 



JOURNAL OP PQLmOAL ECONOMY 


718 

emerged only with modem industry The immense and growing produc¬ 
tivity of nineteenth century British industry was really, in Marx’s view, 
the resultant of three converging sets of forces' (1) the unique incentive 
system and capacity for accumulation provided by capitalist insututions, 
(2) the availability of bodies of scientific knowledge'^ which were directly 
relevant for problem-solving activiucs in industry, and (3) a technology 
possessing certain special characteristics It is this last category which is 
least understood and to which we therefore now turn 

Historically, capitalist relationships were introduced in an unobtrusive 
way, by the mere quantitative expansion in the number of wage-laborers 
employed by an individual owner of capital (Marx 1906, p 367) The 
independent handicraftsman, operating with a few journeymen and 
apprentices, gradually shifted into the role of a capitalist as his relation¬ 
ship vnti^these men assumed the form of a permanent system of wage 
paymen^^H^ as the number of such laborers increased * The system of 
manufacture, therefore, while introducing social relationships drastically 
different from the handicraft system of the medieval guilds which pre¬ 
ceded it,’ initially employed the same technology 

From Marx’s nud-mneteenth-ceniury vantage point, the system of 
manufacture had actually been the dominant one throughout most of the 
history of capitalism—from “roughly speaking the middle of the 16th 

to the last third of the 18th century’’ (Marx 1906, p 369, see also p 787) 

’ Actually, Marx’s use of the term “science” was sufficiently broad that it included 
bodies of systematized knowledge far beyond what we ordinarily mean when we speak 
today of pure or even applied science—e.g , engineering and machine building It was 
not a term which he attempted to use with precision In Theories of Surplus Value, for 
instance, he refers to science simply as “the product of mental labour” (Marx 1963, pt 1, 
p 353) 

* “With regard to the mode of production itself, manufacture, in its strict meaning, 
IS hardly to be distinguished, in its earliest stages, from the handicraft trades of the guilds, 
otherwise than by. the greater number of workmen simultaneously employed by one and 
the same individual capital Ihc workshop of the medieval master handicraftsman is 
simply enlarged” (Marx 1906, p 353 Cf Marx and Engels 1947, pp 12-13) 

* “With manufacture was given simultaneously a changed relationship between worker 
and employer In the guilds the patriarchal relationship between journeyman and 
master maintained itself, in manufacture its place was taken by the monetary relation 
between worker and capitalist—a relationship which in the countryside and in small 
towns retained a patriarchal tinge, but in the larger, the real manufacturing towns, 
quite early lost almost all patriarchal complexion” (Marx and Engels 1947, p 52) 

' ° Maclunery had sometimes been employed in earlier periods, but Marx clearly 
regarded these instances as exceptional “Early in the manulacturing penod the principle 
of lessemng the necessary labour-tune in the production of commodities, was accepted 
and formulated and the use of machines, especially for certain simple first processes 
that have to be conducted on a very large scale, and with the application of great force, 
sprang up here and there Thus, at an early period in paper manufacture, the tearing 
up of the rags was done by paper mills, and in metal works, the poondmg of the ores 
was eflected by stamping mills The Roman Empire had handed down the elementary 
form of all machinery in the water-wheel” (Marx 1906, p 382) In a footnote Marx 
makes the extremely interesting observation that “the whole history of the development 
of machinery can be traced in the history of the corn mill” (ibid , p 382, n. 3) 



SOONOMIC ROLB OT ICSIENCSE 


719 


Manufacture involved a ligntficant regrouping of workers and a redeiim- 
don of the responubihties of each. Whereas a medieval handicraftsman 
would himself perform a succession of operations upon a product, the 
manufacturing system divided up the operation into a succession of steps, 
each one of which was allocated to a separate workman ^ ‘ 

The essence of the manufacturing system, therefore, is a growmg special¬ 
ization on the part of the individual worker. While this in turn has psycho¬ 
logical and social consequences of the greatest importance for the worker 
with which Marx was very much concerned,* ^ it continued to share with 
the earher handicraft system an essenual feature That is to say, although 
the product now passed through a succession of hands, and although this 
reorganizaUon raised the producUvity of labor, it nevertheless perpetuated 
the industrial system’s reliance upon human skills and capaciues ’ ^ 
Whereas the critical skill was formerly that of the gmld craftsman, it is 
now the unremitting repetiUon of a narrowly defined acuvity on the part 
of the detail laborer More precisely, the productive process now pressed 
against the constraints imposed by the hmitcd strength, speed, precision, 
and, indeed, the bmited number of limbs, of the human ammal 

So long as the worker conUnues to occupy strategic places in the produc- 
Uve process, that process is limited by all of his human frailties And, of 
course, the individual capitahst is, in many ways, conunually pressing the 
worker against those limits But the point which Marx is making here is of 
much broader sigmficance The applicaUon of science to the productive 

" “The necdlemaker of the Nureroberg Guild was the cornerstone on which the 
English needle manufacture was raised But while in Nuremberg that single artificer 
performed a scries of perhaps 20 operations one after another, in England it was not 
long before there were 20 necdlemakers side by side, each performing one alone of those 
20 operations, and in consequence of further experience, each of those 20 operations was 
again split up, isolated, and made the exclusive funcuon of a separate workman” (ibid , 
pp 370-71) 

’ ^ ‘ While simple co-operation leaves the mode of working by the individual for the 
most part unchanged, manufacture thoroughly revolutionises it, and seizes labour-power 
by Its very roots It converts the labourer into a crippled monstrosity, by forcing his 
detail dexterity at the expense of a world of productive capabiiiUes and insuncts, just 
as in the States of La Plata they butcher a whole beast for the sake of his hide or hii 
tallow” (ibid , p 396) 

“For a proper undcntanding of the division of labour in manuiacturc, it is essential 
that the following points be firmly grasped Fust, the decomposiUon of a process of 
production into its various successive steps coincides, here, strictly with the resolution 
of a handicraft into its successive manual operations Whether complex or simple, each 
operation has to be done by hand, retains the character of a handicnft, and is therefore 
dependent on the strength, skill, quickness, and surcncss, of the individuad workman in 
handling his tools The handicraft continues to be the basis This narrow technical basis 
excludes a really saentific analysis of any definite process of industrial production, since 
It IS still a condition that each detail process gone through by the product must be ciqwblc 
of being done by hand and of forming, in its way, a separate handicrafi. It is just because 
handtciafi skill continues, in thu way, to be the foundation of the procen of production 
that each vrorkman becomes exclusively assigned to a partud Ainction, and that for the 
rest of his life, his labour-power is turned into the organ of this detail funcuon” (ibid , 
pp. 371-72). 



7*0 


JOURNAL or ROUnCAL EOONOMV 


process involves dealing with impersonal laws of nature uid freeing itself 
from all dependence upon the organic It involves calculations concerning 
the behavior of natural phenomena It mvolves the exploitation of reliable 
physical relationships which have been established by scientific disciplines 
It involves a degree of predictabihty of a purely objective sort, from which 
the uncertainties and subjectivities of human behavior have been system¬ 
atically excluded Saence, in short, can only incorporate its findings in 
impersonal machinery. It cannot be incorporated in human beings with 
their individual volitions, idiosyncracics, and refractory temperaments 
The manufacturing penod shared with the earlier handicraft system the 
essential feature that it was a tool-using economy where the tools were 
subject to human manipulation and guidance It is this element of human 
control, the continued reliance upion the limited range of activities of the 
human hand, and not the nature of the piower source, Marx insists, which 
IS decisive^ distinguishing a machine from a tool 

The machine proper is a mechanism that, after being set in 
motion, performs with its tools the same operations that were 
formerly sone by the workman with similar tools Whether the 
motive power is derived from man, or from some other machine, 
makes no difference in this respect From the moment that the 
tool proper is taken from man, and fitted into a mechanism, a 
machine takes the place of a mere implement The difference 
strikes one at once, even in those cases where man himself con¬ 
tinues to be the prime mover The number of implements that 
he himself can use simultaneously, is limited by the number of 
his own natural instruments of production, by the number of 
his bodily organs The number of tools that a machine can 
bring into play simultaneously, is from the very first emancipated 
from the organic limits that hedge in the tools of a handicrafts¬ 
man 


in 

What, then, is the distincuve technological feature of modem industry^ 
It IS that, for the first time, the desigjn of the productive process is carried 
out on a basis where the characteristics of the worker and his physical 

'* Ibid , p. 406, see also p 410 In his early work, TTu Poverty of Pkilosifity, Marx had 
stated "Tile machine is a unification of the instrumenu of labour, and by no means a 
combination of different operations for the worker himself. 'When, by the division of 
labour, each particuUr operation has been simplified to the use of a single instrument, 
the linking-up of all these instruments, set in motion by a single engine, consUtutes— 
a machine ’ (Babbage, Treile sur I’Ecmomu lies Machines, etc , Pans 1833). Simple tools, 
acciinnilation of tools, composite toob, setting m motion of a composite tool by a single 
hand engine, by men, setting in motion of thcM instruments by natural forees, machines, 
system of machines having one motor, system of machines having one automaUc motor— 
tins IS the progress of imuihinery*’ (Marx, n d , pp 132-33 This book was first published 
tn 1847). 



COONOMIC ROLE OF SOIENCai 


721 


endowment are no longer central to the organization and arrangement of 
capital. Rather, capital is bemg designed in accordance with a completely 
different logic, a logic which explicitly incorpiorates principles of science 
and engineering. ‘ * The subjectivity of a technology adapted, out of neces¬ 
sity, to the capacities (or, better, the debibties) of the worker is rejected in 
favor of the objectivity of machinery which has been designed in accor¬ 
dance with its own laws and the laws of science 

In Manufacture it is the workmen who, with their manual 
implements, must, either singly or in groups, carry on each 
parucular detail process If, on the one hand, the workman 
becomes adapted to the process, on the other, the process was 
previously made suitable to the workman This subjective pnn- 
ciple of the division of labour no longer exists in production by 
machinery Here, the process as a whole is examined objectively, 
in itself, that is to say, without regard to the question of its execu- 
Uon by human hands, it is analysed into its constituent phases, 
and the problem, how to execute each detail process, and bind 
them all into a whole, is solved by the aid of machines, chemistry, 
etc 

The shift from the hand-operated to the machine-operated process is a 
momentous one, for the simple reason that machine processes are suscep¬ 
tible to conUnuous and indefinite improvement, whereas hand processes 
are not ‘ ’ The factory system makes possible the virtual rouunizaoon of 
productivity improvement By breaking down the producuve process 

' ’ There ii an important learning experience at the technological level before this 
can be done well "It is only after considerable development of the science of mechanics, 
and accumulated practical expenence, that the form of a machine becomes settled 
cnUrely in accordance with mechanical principles, and emancipated from the traditional 
form of the tool that gave rise to it” (Marx 1906, p 418, n 1) A typical aspect of the 
innovaUon process, therefore, is that machines go through a subslanual process of 
modification after their first introduction (tec ibid , p 442) 

Ibid , pp 414—15 Later, Marx adds “The implements of labour, in the form of 
machinery, necessitate the substitution of natural forces for human force, and the conscious 
applicabon of science, instead of rule of thumb In Manufocture, the organizauon of the 
social labour-process is purely subjective, it is a combinauon of detail labourers, in its 
machinery system, Modem Industry bos a producUve organism that is purely objective, 
in which the labourer becomes a mere appendage to an already onsting material condition 
of production” (p 421) 

“As soon as a machine executes, without man's help, all the movements requisite 
to elaborate the raw material, needing only attendance from him, ive have an automatic 
system of machinery, and one that ts susceptible of constant improvement in its detsub” 
(ibid , p, 416) 

" In a valuable article, "Karl Marx and the Industrial Revolution,” Fkul Sweezy 
aiguci that many of the important differences between Marx and hu classical pre¬ 
decessors reduced to the Hurt that (he classical ecemoousts “took as their model an economy 
based on manu&ctuie, which is an essentially conservative and change-rcsistant economic 
Oder, while Marx, reoognizuig and making lull allowance far the profound trans¬ 
formation effocted by the industrial revolution, took as bis model an economy based on 
modem machine industry” (Sweezy 1968, p 115) 



722 


journal or POUTICAL ROOMOMY- 


into ol^ectively identifiable component parts, it creates a structure of 
activities which is readily amenable to rigorous analysis. "The pnnciple, 
earned out in the factory system, of analysing the process of production 
into Its constituent phases, and of solving the problems thus proposed by 
the application of mechanics, of chemistry, and of the whole range of the 
natural sciences, becomes the determining pnnciple everywhere.” * ® Thus, 
histoncal development has brought technology to a point where it has 
become, for the first time, an ob|ect of scientific analysis and improvement. 

A characteristic feature is, that, even down into the eighteenth 
century, the different trades were called “mysteries” (mysttres), 
into their secrets none but those duly initiated could penetrate 
Modern Industry rent the veil that concealed from men their 
own social process of production, and that turned the vartous 
^spontaneously divided branches of production into so many 
not only to outsiders, but even to the initiated The 
pnnciple which it pursued, of resolving each process into its con¬ 
stituent movements, without regard to their possible execution by 
the hand of man, created the new modern science of technology 
The vaned, apparently unconnected, and petrified forms of the 
industrial processes now resolved themselves into so many con¬ 
scious and systematic applications of natural science to the 
attainment of given useful effects Technology also discovered 
the few main fundamental forms of motion, which, despite the 
diversity of the instruments used, are neccssanly taken by every 
productive action of the human body, just as the science of 
mechanics sees in the most complicated machinery nothing but 
the continual repetition of the simple mechanical piowcrs 

Modern Industry never Itxiks upon and treats the existing 
form of a proc ess as final 1 he lechnit ai basis of that industry is 
therefore revolutionary, while all earlier mcxlcs of production 
were esseniiallv conservative 

In Its most advanced form, therefore, “modern industry makes 
science a productive force distinct from labour and presses it into the ser¬ 
vice of capital” (Marx 1906, p 397) 

Before capitalism could reach this stage of self-sustaining technological 

* ’ Marx 1906, p 504 The manufacturing stage needs to be seen as an essential step 
in the introduction of science into the productive process The application of science 
required that productive activity be broken down into a senes of separately analyzable 
steps The manufacturing system, even though it continued to rely upon human skills, 
accomplished precisely this when it replaced the handicraftsman with a number of detail 
laborers In this important sense it “set the stage’’ for the advent of modern industry 
Ibid , p 532 Marx (1959) examines the vast poiiibilmes for capital-saving 
innovations and improvements in an advanced capitalist economy in Capital, vol. 3, 
chaps 4 and 5. 



sooNOwa aoLB ca hibncb 


723 

dynaixiuin, however, another critical condition needed to be fulfilled. 
Machinery cannot fully liberate the economy from the output ceiling 
imposed by dqiendence upon human skills and capacities so long as these 
things continue to be essential in the production of the machmes them* 
selves. In the early stages of modem industry, machmes were, inevitably, 
produced by direct reliance upon human skills and capacities The manu¬ 
facturing system responded to the demand for the new invendons by 
creating new worker specializations.^* While this sufficed in the early 
stages of the development of modem industry, improvements in machme 
design and performance and increasing size eventually came up mcreas- 
ingly i^ainst the limitations of the human machine maker 

Modem Industry was cnppled m its complete development, so 
long as its characterisdc instrument of production, the machme, 
owed Its existence to personal strength and piersonal skill, and 
depended on the muscular development, the keenness of sight, 
and the cunning of hand, with which the detail workmen in 
manufactures and the manual labourers m handicrafts, wielded 
their dwarfish implements. Thus, apart from the dearness of the 
machmes made in this way, a circumstance that is ever present 
to the mind of the capitalist, the expansion of industries earned 
on by meims of machinery, and the invasion by machinery of 
fresh branches of production, were dependent on the growth of a 
class of workmen, who, owing to the almost artistic nature of 
their employment, could increase their numbers only gradually, 
and not by leaps and bounds But besides this, at a certam stage 
of Its development. Modem Industry became technologically 
incompatible with the basis furnished for it by handicraft and 
Manufacture The increasing size of the pnme movers, of the 
transmitting mechanism, and of the machmes proper, the 
greater compbcation, multiformity and regularity of the details 
of these machines, as they more and more departed from the 
model of those ongmally made by manual labour, and acqmred 
a form, untrammelled except by the condiuons under which they 
worked, the pierfecUr^ of the automatic system, and the use, 
every day more unavoidable, of a more refractory material, such 
as iron mstead of wood—the solution of all these problems, 
which sprang up by the force of circumstances, everywhere met 

“Ai invenuoni locreued in number, and the demand lor the newly duoovered 
machmes grew larger, the inachine-aialung indiotry split up, mote and more, into 
numerous independent branches, and division of labour m these manufscturcs was more 
and more developed. Here, tlKn, we see m ManulKture the immediate technical 
foundation of Modem Industry Manufacture produced the machinery, by means of 
which Modem Industry abolished the handicraft and manufitfturmg systems in thcae 
spheres of producUon tbat.it first seiaed uposi" (Man 1906, p. 417) 



794 


JOURNAL OF POLITICAL BOONOHY 


with a stumbling-block m the personal restrictions which even 
the collective labourer of Manufacture could not break through, 
except to a limited extent Such machines as the modem hydrau¬ 
lic press, the modem powerloom, and the modem carding 
engine, could never have been furnished by Manufacture. ^ * 

The vital step, therefore, is the establishment of the technological con¬ 
ditions which would make it possible to use machinery in the construction 
of machines, thus bypassing the central constraint of the old manufactur¬ 
ing system “Modern Industry had therefore itself to take in hand the 
machine, its characteristic instrument of production, and to construct 
machines by machines It was not till it did this, that it built up for itself 
a biting technical foundation, and stood on its own feet Machinery, 
simultaneously with the increasing use of it, in the first decades of this 
century^lgppvopnated, by degrees, the fabrication of machines proper ^ 
Marx singliiFout, not only the new power sources which offered gigantic 
quantities of energy subject to careful human regulation, but also that 
indispensable addition to the equipment at the disposal of the machine 
maker, the slide rest This simple but ingenious device of Henry Maudslcy 
replaces, as Marx perceptively notes, not any particular tool, “but the 
hand itself” (Marx 1906, p 408) In this sense it is a strategic technological 
breakthrough, fully comparable in importance to the steam engine 

The improvements in the machinery-producing sector constitute a 
quantum leap in the technological arsenal at man’s dispiosal They make 
It possible to escape the physical iimitaUons of a tool-using culture I'hey 
do this, ironically as Marx pomts out, by providing machines which 
reproduce the actions of a hand-operated tool, but do so on a “cyclopean 
scale 

Ibid , pp 417-18 Marx saw the improvements in the means of communication 
and transportation as particularly significant in pushing the productive process beyond 
the limitations inherent in the manufacturing system "The means of communication 
and transport became gradually adapted to the modes of production of mechanical 
industry, by the creation of a system of river steamers, railways, ocean steamers, and 
telegraphs But the huge masses of iron that had now to be forged, to be welded, to be 
cut, to be bored, and to be shaped, demanded, on their part, cyclojican machines, for 
the construction of which the methods of the manufacturing period were utterly 
inadequate” (pp 419-20) 

*’lbid,p 420 Marx saw this process as culminating during his own time "It is only 
during the last 15 years (i e , since about 1850), that a constantly increasing portion of 
these machine tools have been made in England by machinery, and that not by the same 
manufacturers who make the machines” (p 408) 

“If wc now fix our attention on that portion of the machinery employed in the 
construction of machines, which constitutes the operating tool, we find the manual 
implements reappearing, but on a cyclopean scale The operating part of the bonng 
machine is an munensc drill dnven by a steam-engine; without this machine, on the 
other hand, the cyluiden of large steam-engines and of hydraulic presses could not be 
made. The mechanical lathe is only a cyclopean reproduction of the ordinary footlathe, 
the planing machine, an iron carpenter, that works on iron with the same tools that the 
human carpenter employs on wo^; the instrument that, on the London wharves, cuts 



saoHcnaQ role or scibnce 

IV 


725 


Thiu, I would interpret the Marxian position to be that it is the changing 
requireitients of industry and the altering perception of economic needs 
which provide the stimulus to the pursuit of spieciiic forms of scientific 
knowledge But I would also conclude that the Marxian position cannot 
be adequately descnbed as a demand-mduced approach without doing a 
severe injustice to the subtlety of Marx’s histoncal analysis ** For the 
ability to apply science to the producuve sphere turns upion industry’s 
changing capacity to utihze such knowledge, a capacity which Marx explic¬ 
itly recognizes has been subjected to great changes over the course of 
recent history Indeed, Marx himself, as I have tried to establish, devoted 
considerable effort to the elucidation of the factors which have shapied 
society’s altering capacity to absorb the fruits of scicnufic knowledge.** 
Nor did Marx argue that the histoncal sequence in which scientific 
disciplines actually developed was also directly determined by economic 
needs For example, in discussing the relauve pace of development in 
industry and agnculture, he states that productivity growth in agnculture 
had, histoncally, to await the development of certain scientific disciplines, 
and therefore came later, whereas industry progressed more rapidly than 
agnculture at least in large part because the scientific knowledge upon 


the veneen, is a gigantic razor, the tool of the shearing machine, which shears iron as 
easily as a tailor's scissors cut cloth, is a monster pair of scissors, and the steam hammer 
works with an ordinary hammer head, but of such a weight that not Thor himself could 
wield It These steam hammers are an invention of Nasmyth, and there is one that weighs 
over 6 tons and strikes with a vertical lall of 7 feet, on an anvil weighing 36 tons. It is 
mere child’s play for it to crush a block of granite into powder, yet it is not less capable 
of driving, with a succession oflight taps, a nail into a piece of soft wood" (ibid , p 421; 
see also pp 492-93) 

At one point Marx presents what one might be tempted to call a Toynbeean 
“challenge-response” mechanism to account for the emergence of high productivity 
societies It is not true, he says, “that the most fruitful soil is the most fitted for the growth 
of the capitalist mode of producuon This mode is based on the dominion of man over 
nature Where nature is too lavish, she ‘keeps him in band, like a child m leading-strings ’ 
She does not impose upon him any necessity to develop himself. It is not the tropics with 
their luxuriant vegetation, but the temperate zone, that is the mother cotmtiy of capital 
It IS not the mere fertility of the soil, but the differentiation of the soil, the variety of its 
natural products, the changes of the seasons, which form the physical tiasis for the social 
division of labour, and which, by chimges in the natural surroundings, spur man on to 
the multiplication of his wants, his capabilities, his ineiuis and modes of labor It is the 
necenity of bringing a natural force under the control of soaety, of economising, of 
appropnating or subduing it on a liuge scale by the work of man’s hand, that first plays 
the decisive part in the history of industry” (ibid , pp 563-64) 

In this light, there is no necessary ctmflict between Manx’s materialist oonoeption 
of history and his treatment of science as a productive force under advanced capitalism 
I therefiire disagree tvith the following statement of Bober' “Marx mtends to oiler a 
matenaliitic conception of hutory Yet be frequently stresses the power of snence as a 
component of modem techmque and production The incorporatioa of scsenoe in die 
foundation of his theory is no more defiwble than the inclusioD of all other noninatenal 
phenomena” (Bober 1965, p 21) 



796 JOURNAL or POUnCAL MOOHOitY 

which industry relied had developed earlier. "Mechanics, the really 
scientific basts of large-scale industry, had reached a certain degree of 
perfection during the eighteenth century. The development of chemistry, 
geology and physiology, the sciences that dtreelfy form the specific basis of 
agnculture rather than of industry, docs not take place till the mneteenth 
century and especially the later decades 

This strongly suggests at least some degree of independence and auton¬ 
omy on the part of science in shaping the sequence of industrial change, 
in spite of the fact that, as we saw earlier, Marx and Engels usually 
emphasize the cause-effect relationships which run from industry to 
science If the growth in agricultural productivity is dependent upion 
progress in specific subdisciplines of science, and if the existence of prof¬ 
itable commercial opportunities m agriculture cannot “induce” the 
produ^oilkof the requisite knowledge, then factors internal to the realm 
of scw^R|j|pust be conceded to play a role independent of economic needs 
Moreover, it is especially curious to find that Engels is content to 
state, as quoted earlier, that “from the very beginning the origin and 
development of the sciences has been determined by production” (Engels 
1954, p 247) For Engels himself, in the Dialectics of Nature, had also 
presented a classification scheme for the sciences which emphasized a 
hierarchy of increasing complexity based upon the forms of motion of 
the matter being analyzed Increasing complexity is identified with the 
movement from the inorganic to the organic, from mechanics to physics 
to chemistry to biology ** Engels even goes so far as to speak of an 

Mars 1968, pt 2, p 110 In The German Ideology Marx and Engels stated that “the 
science of mechanics perfected by Newton was altogether the most popular science in 
France and England in the eighteenth century” (Marx and Engels 1947, p 56) 

** “Hegel’s division (the original one) into mechanics, chemics, and organics, fully 
adequate for the time Mechanics the movement of masses Chemics molecular (for 
physics IS also included in this and, indeed, both—physics as well as chemistry—belong 
to the same order) motion and atomic motion Organics the motion of bodies in which 
the two are inseparable For the organism is certainly the higher umty whseh unlhin itself 
umta mechanics, p/ptstes, andchemstty into a whale where the trimty can no longer be separated 
In the orgamsm, mechanical motion is cfTected directly by physical and chemical change, 
in the form of nucntion, respiration, secretion, etc , just as much as pure muscular 
movement” (Engels 1954, pp 331-32, emphasis Engels’s) For Engels’s enure treatment 
of the subject, see ibid , pp 322-408 In his book, Herr Eugen Duhnng's Revolutum m 
Science, Engels draws a sharp distinction between the sciences concerned with inanimate 
nature and those concerned with living organisms The former group of sciences 
(mathematics, astronomy, mechaiucs, physics, chemistry) are susceptible to mathematical 
treatment "to a greater or less degree ” No such precision is possible in the sciences 
concerned with living organisms “In this field there is such a multitude of reciprocal 
relations and causalities that not only does the solution of each questibn give rise to a 
host of other questions, but each separate problem can usually only be solved ptecemeal, 
through a senes of investigations which often requires centuries to complete, and even 
then the need for a systematic presentation of the interrelations makes it necessary again 
and again to surround the final and ultimate truths with a luxuniuit growth of hypoth¬ 
eses” (Engels 1939, pp 97-99) 



BOOMDIOa ROLB OT aaONCE 


797 


“inherent sequence,”** which he clearly believes has structured the 
historical sequence in which nature’s secrets have been progressively 
uncovered. But, if one accepts this intuitively plausible view, then surely 
there is much more to “the ongin and development of the sciences” than 
can be accounted for by the specific demands being generated in the 
productive sphere Surely the historical fact that the biological sdences 
came to the assistance of agriculture long after the mechamcal sciences 
were being utilized by mdustry is a sequence onginaung, not in economic 
needs, but m the differing degrees of complexity of these scientific 
disciplines Engels’s formulations particularly seem to overemphasize the 
importance of demand-induced incentives to the neglect of supply side 
considerations, even though he is obviously sensitive to these supply 
variables in other contexts 

In Engels’s defense one must recall, of course, the unfinished, indeed 
often merely fragmentary condition of his DtaUctus of Nature It is en¬ 
tirely possible that, had he the opportunity, he would have resolved these 
apparent inconsistencies But it is expecting far too much to look to either 
Marx or Engels for the resolution of these deep and thorny problems 
We arc still, today, a long way from being able to incorporate the history 
of science in an orderly manner into our understanding of the economic 
development of the Western world * ‘ 


Concluaion 

There are several possible meanings which can be attached to the state¬ 
ment that “the ongin and development of the sciences has been deter¬ 
mined by production ” 

1 Science depends upion industry for financial suppxirt 

2 The expectation of high financial returns is what mouvates mdivid- 
uals (and society) to pursue a parucular scienti&c problem 

3 The needs of industry serve as a powerful agent in calhng attenuon 
to certam problems (Pasteur’s studies of fermentation and silkworm 
epidenucs) 

’’ “Clanification of the Kienca, each of which analyzes a single form of mobon, or a 
senes of forms of motion that belong together and pass mto one another, u therefore the 
clasBificaUon, the arrangement, of these forms of motion themselves according to their 
inherent sequence, and herem lies its importance” (Engels 1954, p 330, see also 
Zvonkine 1963, pp 59-74) 

“ See Engek 1954, “Prefoce ” 

The most ambitious attempt to fill thu void u the fiucinating hut seriously flawed 
four-volume work by the late J D Bernal, SnsacciafAsSsiy (1971) Hu &i ea c » sad iiufimry 
w the Nm etm il h CoUmy (London, 1953) u more lestncl^ in scope and for more con- 
sutently pettuauve Nevertbeles, Snsacs as Hutmy displaiys an immense erudition, and all 
but the most remarkably well-informed readers will learn much from, and be greatly 
stimulated by, its contents 



JOURNAL or POUTSCAL EOONOMY 


738 

4 The normal pursuit of productive activities throws up physical 
evidence of great importance to certain disciplines (metallurgy and 
chemistry, canal building and geology) As a result, industrial activities 
have, as a byproduct of their operation, provided the flow of raw ob¬ 
servations upon which sciences have built and generalized 

5 The history of individual saenccs, includmg an account of their 
varying rates of progress at different pienods in history, can be adequately 
provided by an understanding of the changing economic needs of 
society 

I believe that Marx and Engels subscribed to propositions 1-4 without 
qualification I believe they often sounded as if they subscribed to the fifth 
proposition However, I think the preceding discussion has established 
that they subscribed to the fifth proposition only subject to certain qual- 
ificatiq]isr"*qualifications which strike me as being,-collectively, more 
inter^Rma than the original proposition 

References 

Bernal, J D Science and Industry in the Nineteenth Century London Routlcdge & 
Kegan Paul, 1953 

- Science w History 4 vols Cambridge, Mass MIT Press, 1971 

Bober, M M Karl Marids Interpretation of History New York-Norton, 1965 
Engels, Frederick Socialism Utopian and Scientific Chicago Kerr, 1910 

- Herr Eugeii Duhnng’s Revolution m Science New York International, 1939 

- The Dialectics of Nature Moscow Foreign Languages, 1934 

Marx, Karl A Contribution to the Cnti^ of Political Economy Chicago Kerr, 1904. 

— -- Capital Vol I Chicago Kerr, 1906 

-, Capital Vol 3 Moscow Foreign Languages, 1959 

— - Theories of Surplus Value 3 pts Moscow Progress, 1963 

- The Poverty of Philosophy Moscow Foreign Languages, n d 

Marx, Karl, and Elngels, Frederick The German Ideology Hew NorV International, 
1947 

- Selected Works 2 vols Moscow Foreign Languages, 1951 

Sweezy, Paul “Karl Marx and the Industrial Revolution " Events, Ideology and 
Economic Theory, edited by Robert Eagly Detroit Wayne State Univ Press, 
1968 

Zvonkinc, A "Technology and the Laws of its Development ” The Technological 
Order, edited by Carl F Stover Detroit Wayne State Univ Press, 1963 



Advertising as Information 


Phillip Nelson 

State Umveraty of New York at Bagkamton and the Unaerstty (f Oueago 


This haughty youth. He speaks the truth 
Whenever he finds it pays,— 

And in this case, it all took place 
Exactly as he says' 

Exactly, exactly, exactly, exactly as he says' 

[Gilbert and Sullivan, Tlu Mikado] 


This paper tnes to show how the major features of the behavior of ad- 
vertumg can be explained by advertising's information function For 
search qualities advertising provides direct infonnauon about the char* 
actenstics of a brand For experience qualities the most important mfiir- 
mation conveyed by advertising is simply that the brand advertises This 
contrast in advertising by these qualiUes leads to significant differences 
m Its behavior 


How does advertising provide information to the consumer^ The producer 
in his advertising is not interested directly in providing information for 
consumers He is interested m selling more of his product Subject to a 
few constraints, the advertising message says anything the seller of a brand 
wishes A mechanism is reqmred to make the sellmg job of advertising 
generate information to the consumer 

The first section of this paper shows how such a mechanism exists and 
how Its operation vanes by types of market The middle three sections of 
the papier try to put the ideas of the first section to the test The first of 
these empurical sections uses the model to predict the way in which adver- 

Helpful cnticums of an earlier draft of thu article were received fioin Jerome 
Komuar, Lester Teller, Robert Van Handel, and the partiapants of a seminar at the 
University of Rochester and the Conference on the Economics of Information at the 
Umvenity of Chicago Financial assistance was received from the Research Foundation 
of the State Univemty of New York 




730 


JOUBNAL OF POUnCAt EOCH«OIIV 


tising/saies ratios will differ by industry. The second section predictt how 
media choice will vary by industry, the third explores the impact of 
alternative sources of information on advertising volume The last section 
of ihis paper examines the consequences of relaxing two assumptions made 
in the rest of my analysis' that consumers use optimal decision rules and 
that there are no legal constraints on advertising 

Information is generated by advertising because of consumer power in 
the product market The nature of that power will vary significantly by the 
nature of the product being advertised In “Information and Ckinsumer 
Behavior” (Nelson 1970), I make a fundamental distinction between 
qualities of a brand that the consumer can determine by inspection pnor 
to purchase of the brand—“search qualities”—and qualities that arc not 
determined ppor to purchase—“experience qualities ” An example of a 
search q^^tV is the style of a dress, an example of an experience quality 
IS the taste W a brand of canned tuna fish The impact of the market 
power of consumers on advertising will operate quite differently for search 
and experience qualities 

If the advertised properties of the product differ from the actual prop¬ 
erties, the consumer will know about that difference prior to purchase in 
the case of search qualities This reduces considerably—but not entirely— 
incentives for misleading advertising Misleading adverusing will sull 
exist First, consumers pay transportation costs before they spot any dis¬ 
parity between actual and advertised search qualiUes. By exaggerating 
the utility of the actual search good by any amount up to the transporta¬ 
tion costs of consumers, advertising can result in greater sales to the firm 
than a less exaggerated advertisement Second, there can be some search 
quality of a brand which cannot be successfully conveyed by advertising, 
for example, the “charm” of an apartment Under these circumstances it 
might pay the advertiser to exaggerate some charactenstic that can be 
sjjecjfied by the advertisement. But there are costs to the adveruser of 
misleading advertising he suffers a decline in his credibility for future 
advertisements and pays the costs of processing nonbuying customers 
Because of these costs and the relative unimportance of the sources of 
misleading advertising, consumers can have some confidence that the 
advertising of search qualities bears a close relation to the truth The 
advertising of search qualities provides information to the consumer, even 
though he attaches a probability less than one to the truthfulness of these 
advertisements. 

In the case of expenence qualities the consumer's power over advertiang 
IS much less potent than his power over search qualities. The major con¬ 
trol that consumers have over the market for experience quahUes is 
whether they repeat the purchase of a brand or not (Nelson 1970). This 
power IS sufficient to authenticate any statement that has cither of the 



AOVBUTISIHO a* OtroiUtATION 


73» 

foUovving charactemticf. (1) conitunm’ belief in the truth of that state 
ment does not increase the profit from uutial sales (as opposed to repeat 
sales either directly or indirectly through the recommendauons of relauves 
and friends) or (2) it is possible for the producer at no cost to himself to 
make the advertised statement true rather than false II either (1) or (2) 
holds, the advertiser maximizes his profits by telling the truth if the con¬ 
sumer believes he is doing so.' 

One of these two requirements will be satisfied by virtually all state¬ 
ments that specify the function of a brand, for example, “Pepto-Bismol is a 
remedy for upset stomachs ” This statement turns customers with athlete’s 
foot away as well as attracting customers with bellyaches Of course, it 
IS conceivable that there are more of the latter (relative to the available 
number of brands) than the former, so that there is a net gam of initial 
sales from this statement But this is where condition (2) comes into play 
If this were the case, it would pay the manufacturer of Pepio-Bismal to 
produce at the same costs something that resembled a stomach remedy 
rather than something that could be used for athlete’s foot It is conceiv¬ 
able that at that cost the firm could only produce a second-rate stomach 
remedy and could produce a first-rate foot balm But if everybody is 
buying Pepto-Bismol for upset stomachs rather than for athlete’s foot, any 
virtues of Pepto-Bismol for athlete’s foot would be irrelevant in determin¬ 
ing Pepto-Bismol sales 

Even when the advertisement of an expenence quality correctly relates 
brand to function—by far the typical case—the consumer has quite in¬ 
complete information He would like to be able to rank stomach remedies 
by their utility to him Advertising provides no direct information that 
will help him do that job (By direct information I mean information 
contained in the adverusmg statement ) After Pepto-Bismol has been 
correctly identified as a stomach remedy, the statement that Pepto- 

‘ This proposiuon can be seen most clearly in the case of constant average production 
costs {AC) In that case, proht can be wntten 

P/= (/> - AC)I + {P - AC)R - A, (10) 

where P/ = profits, P = price, / = initial sales, R = repeat sales, A = advertising 
costs, and P {R) = probability of a repeat purchase given an initial purchase Advertising 
costs arc assumed constant, since they should be invariant with respect to the truth or 
falsity of the advertising message Then false advertising will not increase ^ 

Pf= (P - AC)I + (P - AC)I P{R) - A (11) 

If (P — AC)I IS constant, profits increase with the probability of a repeat purchase, 
and this probabtUty increates with the truthfulness of an advertisement (P — AC)/ will 
be a constant over the relevant decision variable whenever either condiuon (I) or (2) 
holds If condition (1) holds, (P — AC)/ is a constant varying the tnithfiilnca of the 
statement but holding the character of the product constant If condition (2) bolds, 
(P — AC)/ IS a constant holding the stateroent constant but varying the character of the 
product 



732 


JOURNAL OF POUTICAL ECONOMY 


Bismol IS most soothing is without information content Its producers have 
an incentive tosay so even if it were the least soothing of stomach remedies. 

The miniscule amount of direct information from advertising for 
experience qualities gives the consumer an incentive to extract any con¬ 
ceivable indirect information that would help Such indirect information 
is available from advertising The consumer can learn that the brand 
advertises I contend that this is the useful information that the consumer 
absorbs from the endorsements of announcers, actors, and others who are 
paid for their encomiums These and other advertisements lor experience 
goods have no informational content Their total informational role— 
beyond the relation of brand to function—is simply contained m their 
existence The consumer believes that the more a brand advertises, the 
more likely it is to be a better buy In consequence, the more advertise¬ 
ments brand the consumer encounters, the more likely he is to try 
the bramr<^ 

Why are advertised brands better buys^ The answer to that question 
IS not self-evident If advertising were distributed at random among 
brands for a product, it would indeed be newsworthy that a brand adver¬ 
tised, but that news wtiuld decrease the probability of a tonsumer’s trying 
tht advertised brand Advertised brands would cost more by the cost of 
the advertising and hent c would bt worse buys 

It IS my contention, however, that advtrtising is not distributed at 
random among brands of a product Heavily advertised brands are likely 
to providt a lower P* (price per unit ol utility of the brand) ^ to the 
average consumer than less heavily advertised brands of the same product 
(To give unambiguous meaning to that measure, I assume that consumers 
all have the same income and utility functions) 

I’list ol all, firms vary in their clfiticncy in producing the utility that 
consumers seek Some firms produce brands that yield more utility to the 
consumer for a dollar ol production cost than do other brands In general 
a firm that has lower costs relative to the utility ol its brand than other 
firms will find that tt pays to expand its output by both increasing adver¬ 
tising expenditures and decreasing P* Tins behavior of firms by clliriency 
generates a negative association between advtrtising and P* by brands 

Two crucial propositions are contained m the previous paragraph 
The proposition—that to sell more the firm will offer a lower P *— 
obviously holds for the usual demand curve, which is negatively sloped 
More critically, this proposition also holds for the relationship between 
demand and price when advertising expenditures adjust optimally to 

* This definition of P* is an attempt to obtain a utility adjusted price For simple 
additive utility functions, does the job For other utility functions, diflercnt definitions 
ofP* might be required For purposes of my analysis this variation in the definition o(P* 
IS not important as long as a utility adjustment of price is possible 



ASVERTmNO AS INFORMATION 


733 


price Even though this demand curve can have a positively sloped 
portion (Demsetz 1959], Schmalensee’s (1972) analysis implies that even 
this demand curve will be dechning where it counts—in the neighborhood 
of equihbnum ^ The proposition that there is a reverse assooauon be¬ 
tween quantity produced and P* can be put to the test If this proposition 
were true, it would pay a brand to advertise its rank in its product class 
more, the higher that rank Do leading firms advertise their sigmficance 
more than nonleading Arms advertise their lack of leadership^ To answer 
that question, I looked at three issues of Life magazine for March 1972 
{Life 1972) I found six brands claiming that they were at largest brands 
for their respective product classes I found none advertising a lower rank 
There arc far fewer brands that are first in their product class than are not 
first (even confining ourselves to the sample of brands that advertise in 
three issues of Life) If brands advertised their rank in their product class 
independently of what that rank happened to be, the probability of an 
advertisement which advertised rank advertising that the rank is first 
would be far less than 5 In consequence the probability of our observed 
results occurring by chance is less than one sixty-fourth * 

The second projxisition—that the firm that wishes to sell more will 
advertise more—also has a strong a prion defense Adveitising and — AP* 
are the two inputs yielding higher quanuty demanded From producUon 
theory one anticipates that if the “pnee” of the two inputs were constant, 
an increase in the output is usually most profitably achieved by ah increase 
in both inputs But in this case the “pnee” of the inputs does not remain 
constant as one increases quantity Quanuty is the “price” of — A/**, that 
IS, the loss to the firm in lowenng pnee to increase quanUty demanded is 
proporUonal to that quantity In consequence, one would expect adver- 

’ Using Demsetz’s (1959) terminology, the average revenue curve allowing advertising 
to adjust optimally to a change in price is the mulalis muUoubs average revenue curve 
{MAR) For MAR to be rising or constant MMR 2 MAR, where MMR — marginal 
naUatis mutandis revenue curve For firms to have positive or zero profits MAR ^ AC, 
where AC = average production and advertising costs and MC = marginal production 
and advertising costs, but MMR = MC Hence, MC > AC, which implies that AC is 
rising or constant But Schmallensee (1972) has shown that in cquilibnum AC must be 
decreasing Hence, MAR must be decreasing in equilibrium 

* This test would be biased if brands that were not number one advertised themselves 
as m^or producers more than those brands which are number one But this bias is 
produced by the very proposition I am testing, a proposition that would explain why 
number one brands choose more explicit rank advertising than others A more serious 
problem with this approach is that the data do not support another imphcauon of 
rational consumer behavior that firms that advertise the most would advertise that fact. 
I think the primary reason that the quantity of advertisement is not advertised is that 
advertising has been so frequently attacked that it is in disrepute These attacks make it 
difficult for consumers to understand the ultimate rationalization of their own behavior 
(This problem is discussed in more detail in the ^'Deceptive Advertising” section of thu 
paper ) In consequence, brands do not advertise their advertising rank, though they do 
advertise that they have advertised, e g , **As Advertised in L(ft ” 



JOURNAL OF FOUTtCAL ROONDMY 


734 

tising expenditures to increase because both output increases and the 
pnee of the other factor of production increases The evidence is over- 
whclnung that advertising increases as sales increase. 

There are other reasons for a consumer to respond positively to adver* 
tising What is a high-uuhty brand for some consumers will be a low- 
uulity brand for others Advertising has the sccmmgly magical propierty 
that those whose tastes are best served by a given brand are those most 
likely to see an advertisement for that brand Advertisers choose the 
media in which they advertise in part to maximize the repeat-purchase 
probability for their brand In consequence the producer distnbutcs his 
advertisements among media so that his message is seen by those who are 
most likely to repeat purchase his brand An esoteric, high-price soup gets 
advertised m the New Yorker, while Campbell’s soup displays its wares in 
Good HomAmping 

A th^u^tor operates to increase the reputability of advertised brands 
The full analysis of this factor is consigpied lo the Appendix, but the 
essence of the argument is simple enough Advertising increases the prob¬ 
ability of a consumer’s remembering the name of a brand Those brands 
with the highest probability of repeat purchase have the greatest payoff 
to improved consumer memory In consequence, brands which provide 
the highest utility have the greatest incentive lo advertise 

A feature of the previous discussion must be emphasized Nowhere m 
that analysis was it necessary to assume that consumers respond positively 
to advertising because the brands that advertised the more heavily were 
belter buys The self-interest of consumers to respond to advertising only 
tf It increased their utility produces an additional guarantee that highly 
advertised products will provide higher utility to the consumer Suppose 
there were some produi ts for which firm self-inlcrest dictated a negative 
relationship between advertising and utility per dollar Consumer self- 
interest would make the consumers respond negatively to advertising for 
these products In consequence, firms would not advertise such products 

In this analysis of the indirect information contained in advertising, the 
focus has been on experience qualities The same analysis can be used to 
show that advertising increases the reputability of brands which are dom¬ 
inantly search goods However, reputability will play a much smaller role 
in the advertising for search qualities, because the consumer can obtain 
so much direct information about these qualities from advertisements and 
direct inspection The possession of this direct information reduces the 
payoff to both consumers and advertisers of advertisements’ increasing 
the reputability of a brand In my subsequent analysis I shall assume that 
this difference between the character of advertising for search and ex¬ 
perience quaUties is so large that the advertising for experience qualities 
IS dominantly indirect information and the advertising for search qualities 
is dominantly direct information 



735 


ADVniTKMO A» nmntMATIOM 

MargiBdl Revenue of Advertising to the Producer 

To maximire profits a producer will adveruse to the point where his 
maiginal revenue of advertising is equal to hu marginal cost of advertis¬ 
ing One expects differences in the behavior of this marginal revenue for 
search and experience qualities This difference in turn will produce a 
differential effect on the quantity of adverusing purchased by producers 
of search and experience qualities 

To demonstrate this effect one needs a theory to generate the margmal 
revenue of adverusing For search qualities, advertising v«ll increase sales 
when and only when it gives the consumer information that he did not 
have before I will assume that all the information that advertising con¬ 
veys about a search brand is contained in any given message The revenue 
(R) generated by advertising will, therefore, equal- 

R = QPNG„ (1) 

where N = the number of potential customers, that is, those who would 
buy the brand if confronted with the appropriate signal (in this case, one 
or more advertising messages), Q = average quanUty purchased per 
customer, P = price, and C, = the proportion of piotenual customers 
with one or more adverusing messages Of the four components of R only 
Gj vanes by adverusing intensity The behavior of G, has been examined 
by Stigler (1961) and Gould (1970), but there is a problem with their 
analysis that makes reexammauon necessary 

As Sugler recognizes, there are three relevant processes at work Con¬ 
sumers acquire advertising mformauon, they leave and enter the market, 
that IS, they are mobile, they forget adverusing mformaUon Sugler asserts 
that the latter two processes produce idenUcal results m the advertising 
market, except for possible differences m their intensities I think he is 
wrong In the mobility case, the most reasonable simple assumpUon is 
that the probability of leaving the market is mdejiendent of the number of 
advertising messages a consumer knows In the forgetung case, however, 
the simplest reasonable assumpuon is that the number of messages forgot¬ 
ten about a brand is directly proporuonal to the number of messages 
known about a brand 

I expect, then, mobility and forgetung to be quite different probability 
processes OpUmally, one should combine the two processes m the same 
model Such a model is easy to construct, but I have not been able to 
solve the equations that the model generates Instead I have developed 
two different models In one there is mobihty, but no forgetung, m the 
others there is forgetting, but no mobihty Because it is simpler and 
probably more important, I concentrate on the forgetung model This is 
simply a matter of economizing on space My results do not change when 
I use the mobility model ’ 

’ My uialyiu of the mobility model u svatUble upon request 



JOURNAL OF POLITICAL ECONOMY 


736 

In my forgetting model I assume that the rate at which consumers 
confront an advertising message (c) is a constant for all consumers I abo 
assume that the forgetting rate ts directly proportional to the number of 
advertising messages a person knows The differential matrix of this 
stochastic process is 


c 

(c + a) c 

2a --(c + 2a) c ’ 

3a ~ + 3a) c _ 

where the state space is (0, I, , n) advertising messages about a brand 
and a is the forgetting rate when one knows one advertising message 

I shall try to abstract out of this stot hastic process the relationship 
betw^j^ the present value of revenue and advertising intensity This 
present \wue of revenue will change over time as the process unfolds 
Of these present values both the simplest and the most useful is the one 
calculated after the stochastic process has settled down to a steady state 
Then the present value will merely be the present value of a constant 
stream of returns It is this steady-state condition that I will examine 
As IS well known (e g , Karlin 1966), the steady-state distribution (-1) 
can be obtained by solving 

AF = 0 (3) 



In words, the net movement into any steady state must be zero or it is not 
a steady state The solution 


A, = 



I = (0, 1, , n), 


(4) 


where d = eja In particular the probability of having one or more adver¬ 
tising messages is 

1 - Ao = 1 - e-- (5) 

Equation (5) is G, in equation (1) and hence is relevant in determining 
the revenue from advertising in the case of search 

However, G, will not be relevant in determining the revenue from 
advertising for experience qualities I have assumed that the information 
the consumer obtains from advertising for experience qualities is that the 
brand is advertised and is, hence, more likely to be a better buy In the 
face of experience qualities, a consumer’s optimal strategy is to try a fixed 
number of brands, say r, and then continue to use the best of the set 
(Nelson 1970) A consumer to whom advertising for expenence qualities 
provides favorable information will limit his sainphng to those brands for 



ADVSRTinNO AS INFORMATION 


737 


which he has received the most advertising messages. Say he has currently 
received m messages for the rth most advertised brand in his own sample 
of messages. Assume further that in case of a tie in the number of adver¬ 
tising messages the consumer receives, the consumer prefers the brand 
whose advertising messages he has most recently received * Then a con¬ 
sumer will try a brand if he receives m messages about the brand, and 
any messages beyond m arc redundant 

For experience qualities one needs a more general form of equation 
(1) The revenue generated by advertising is 

R = QPNG^ (6) 


where G„ = the proportion of potential customers with m or more adver¬ 
tising messages, where m is the critical number of advertising messages 
required for a consumer to sample a brand (To put the revenue in 
present-value terms, all one need do is multiply the right side of eq [6] by 
1 jw —where w equals the discount rate ) For the forgetting model, is 


1 - 


m— 1 m— I j/ 

V A, = 1 - V 


(7) 


The amount of advcrusing can be measured by the rate at which 
customers acquire advertising information—or c The marginal revenue 
to the producer of advertising is, therefore 


MR^ QPN 


dc 


( 8 ) 


Assume that Q PN are the same for adverusing for search and experi¬ 
ence qualiues ’ The MR^ will differ between search and experience 
qualities because dGJdc differs by m (I will assume, for simplicity, tliat 
all consumers are faced with the same m in the adverUsmg for any given 
brand, though this assumpuon is contrary to fact) 


dG^ _ 

dc a{m — 1)* 


(9) 


* A more resdistic assumpuon would be that in a case of a ue at m messages, the 
probability of the consumer's sampling the brand is the invcnc of the number of Ued 
brands But this more realistic assumption increases considerably the complexity of the 
presentation without adding any new insight into the process 

'' This assumption is contrary to fact, but its conflict with mdity should not prodiKc 
serious problems in this particular case In Nelson (1970) I showed that there should be 
more monopoly power in the market for experience goods than in the market for 
search goods Assume the same total market size—and there u no reason to assume that 
this varies systematically between search and experience goods Then average firm sue 
should be larger for experience goods than for search goods and there should be fewer 
firms If marginal production and advcrtisihg coats were constant with reflect to size, 
these two diflercnces should cancel one another out, i e , total advertising expenditures 
lor the industry would not vary through variation in the size of firms within the industry 



738 JOURNAL or rouncAL economy 

For search qualities m = 1, for experience qualities « ^ 1 It can be 
easily shown that 

dc dc ’ 

where m > 1 and C" < 0 * In other words, over the declining portion of 
dGJdc, dGJdc is always greater for experience qualities than for search 
qualities (or equal when m = 1) Since only the declining portion of the 
marginal revenue curve is relevant for firm decision making when mar¬ 
ginal costs are constant or increasing, firms advertising experience qual¬ 
ities will advertise more than firms advertising search qualities when they 
both advertise 

Clan the marginal cost of production and advertising always be above 
the nwginal revenue curve of advertising for experience qualities'* This is 
ponil any given brand of a product (which will then not advertise), 
It IS impossible for the market in general If for all brands, the marginal 
cost curve is higher than the marginal revenue curve for an m as low as 
two, then the market will make the minimum m one The marginal 
revenue curve for search and experience will then be the same only in 
this polar case Of course the industry’s total advertising expenditure are 
independent of the number of firms as long as marginal costs of production 
and advertising are constant 

How does one test the proposition that there will be more advertising 
of expenence than of search goods^ In “Information and Consumer 
Behavior” (Nelson 1970), it is shown that goods i ould be classified success¬ 
fully by whether the quahty variation was ascertained predominantly by 
search or by experience, and the respective goods were called “search 
goods” and “experience goods ” To take account of some ambiguity in 
the classification procedure in Nelson (1970), two alternative classifica¬ 
tions were used They are again used here, together with a third classifi¬ 
cation (Classification II with cameras moved from the search to the 
expicnence category) This third classification is motivated by the dis¬ 
cussion in Nelson (1970, p 320 n ) The commodities in these classifica¬ 
tions arc presented in table 1 The following mean advertising ratios were 


c’ = (i- —) 

" a* (m - 2)’ \ m - 1/ 


Since d > 0, m is 2, then < 0 implies 

ei 


> 1, so 


m — I 


• —I 


d d d d' 

But- > -for in — 1 > i > 1. Hence,-- > 1 implies that -——— 

m — ( in — I m — I (in — I)I 


> I 


Hence, > C* when < 0. 



AOVBRTISmO AS INFORMATION 


739 


TABLE I 

Asvbrtuino/Sales Ratios in Consumes Cateoorv by Intosmatiot Catboory* 

Good and Infor- ' Advertuuig/Sales Good and Infor- Advertisuig/Salct 

mation Category Ratio, 1957 mation Category Ratio, 1957 


Experience durable 

Books 2 702 

Paintst 1 *50 

Tires 1 385 

Appliances 3 296 

Motorcycles and bicycles I 078 

Motor vehicles 0 907 

Motor vehicle parts 

and accessories 0 700 

Professional and scientific 
instruments^ 2 086 

Clocks and watches 5 629 

Conununications equipment 2 034 

Average 2 177 

Average total experience 3 427 

Search goods 

Knit goods 1 075 

Carpets 2 052 

Hats 2 124 

Millinery 0 326 

Men’s clothing 0 928 

Women’s clothing I 263 

Miscellaneous apparel I 269 

Furniture 1 451 

Footwear I 326 

Leather goods 1 204 

Jewelry§ 2 202 

Costume jewelry§ 2 498 

Average 1 395 


Ej^mcace nondurable 

6 872 

Wine 

4 395 

Liquor 

2.408 

Dairy products 

1 B85 

Grain mill producta 

1 695 

Cereals 

4 845 

Bakery 

2 803 

Sugar 

0 280 

Confectionery 

3 543 

Muccllaneous foods 

4 073 

Cigars 

2 370 

Other tobacco 

5 429 

Drugs 

10 280 

Soaps 

Perfume 

7 938 
14 723 

Petroleum refining 

0 507 

Meats 

0 610 

Pcnodicals 

0 304 

Average 

4 085 


*Goodx clautficti by Claailiradon III data from Teller (1964) 
tSearch good in CSjuiificatiOD 1 
{Search goo<l in ClaMificationi I and II 
fiExpeneoce good lo Claasification 1 


obtained 

Classification I 
Classification II 
Classification III X, 


0347, X, = 0123, D = 0234, t = 3.67, 

0348, X, = 0137, D = 0137, / = 3 16, 

0343, X, = 0132, D = 0211,1 = 340, 


where X =» mean advertising sales ratio, e = experience goods, s = 
search goods, D — difference of the means, t = test result of the hypoth¬ 
esis that /) = 0. For ail these classification procedures the differences arc 
large, and hence are economically as well as statistically significant 
The chief problem with this test and subsequent tests to be made is the 
possible ambiguity in the classification of goods into the two information 
categories search and experience Altemauvc classification procedures 









740 JOURNAL OF POLITICAL ECONOMY 

do not appear to have a large effect Of course, random errors in classifi¬ 
cation bias result toward zero observed differences in the means, so they 
do not vitiate the procedure The big problem in classification is an 
unconscious (or conscious*) tendency for the classifier to produce the 
results he is seeking Hopefully, the present classificauon, which was 
developed prior to the present tests, will be found to be valid ’ 

In consequence our results support the hypothesis that producers of 
exjjerience goods advertise more than producers of search goods This 
result IS important because it in turn supports our fundamental behavioral 
proposition that advertising of experience qualities increases sales 
through increasing the reputability of the seller, while advertising of 
search qualities increase sales by providing the consumer with “hard” 
information about the seller’s products 

■lit is another important feature of these results They are precisely 
the op0ltiie of what would be anticipated given an obvious alternative 
theory of the way in which advertising for expienence qualities operates 
As I discussed in the first section, there is, indeed, some “hard” informa¬ 
tion conveyed by advertising for experience goods A consumer is able 
to relate brand to function For search goods advertisements can provide 
far more information than the relation of brand to function 

If advertising were solely concerned with distribution “hard” informa¬ 
tion, there should be more advertising measured in dollar terms for search 
goods than experience goods, simply because there is more “hard” infor¬ 
mation that can be conveyed about search qualities I'hmk, for example, 
of an advertisement saying simply, “Bayc r is an aspirin ” Then think of 
an advertisement saying, “Jonathan I-ogan is a dress,” and showing a 
picture of a Jonathan Logan dress The latter advertisement takes up 
more space and, hence, is more expensive As we have seen, however, the 
data produce results precisely contrary to this prediction I'his strongly 
suggests that the information content of advertising for experience goods 
goes far beyond the relation of brand to func tion—another reason for 
believing that advertising for experience goods provides information about 
the reputability of brands 

’ Obviously, the simpli test used in the text makes no attempt to control for other 
variables instead I have tried to present as many simple tests as 1 can in the belief that 
the many biases to which any one test is subjected (of which excluded variables is only 
one example) will not operate consistently in all my tests One example of a possible bias 
IS the close relationship between starch goods and style goods (This relationship is not 
accidental but engendered by the search—m contrast to the exptrience—process) 
It would appear, however, that the bias induced by the asscxiation of search and style 
runs against the test results reported in the text The evidence we have (e g , Telser 1964) 
suggests a positive association between advertising in an industry and the number of new 
brands in the industry The same logic which generates that association would maki 
style changes—a characteristic of style goods—generate additional advertising That 
search goods tend to be style goods could hardly explain the low levels of advertising in 
search goods 



advertisino as information 


741 


This last paragraph does not imply that the relation of brand to function 
is a nonexistent activity of advertising for experience goods. The implica¬ 
tion is only that the effects of this activity are swamped by the effects of 
advertising to increase the reputabihty of experience brands Why’ 
Simply because most advertising of experience goods occurs with such 
intensity that most people already know the relation of brand to function. 

This explanation is itself open to test The smaller the advertising 
intensity of an experience good, the more likely that the advertising con¬ 
veys new information to the consumer of the relation of brand to function 
We have just seen that when advertising for experience goods is providing 
that kind of information the optimal page size of an advertisement ought 
to be smaller than the optimal page size for search goods, since the latter 
advertisements have more information to convey 

When, however, advertising for experience goods is designed pnmanly 
to increase the reputability of a brand, two forces will tend to increase 
the page size of advertisements for expenence goods relauve to this page 
size for search goods First, even with the same amount of “hard informa¬ 
tion” and even when the consumer sees both, a larger adverusement 
should increase sales for expenence goods more than a smaller advertise¬ 
ment The larger advertisement enhances the reputation of the advertiser 
more than does the smaller advertisement (When consumers select brands 
by volume of advertising, they will choose in terms of the dollar volume of 
advertising they see rather than simply the number of advertising mes¬ 
sages ) Under these same conditions, the smaller and the larger advertise¬ 
ment for search goods would produce the same increase m sales 

Second, the probability of a consumer seeing an adverusement vanes 
more with page size for expenence goods than search goods As we shall 
see in the next section, the marginal return to the consumer 1 $ small for 
adverusements whose sole additional information is increases in the 
reputability of a brand In consequence, such advertisements must depend 
much more on catching a disinterested consumer’s eye than advertise¬ 
ments that consumers seek out Large page size tends to accomplish that 
objective (by way of contrast, classified advertisements have both large 
marginal returns to the consumer and exceedingly small page sizes) 

One can, therefore, predict that as the volume of adverusing increases 
by brand, the difference between the average page size of advertisements 
for experience and search goods increases I tested this pro{X>$ition by 
examining the nauonal brand advertisements in the New Yorker for all of 
1965 (TTus magazine was chosen because it is one of the few that has a 
lot of adverusements of both search and experience goods ) I measured 
the volume of adverusing by the total pages purchased m the Nm Yorker 
by brand I then divided this volume of adverusing into four categories. 
As table 2 shows, there is a perfect rank correlation between the differ¬ 
ences in page size of expenence and search goods and the volume of 





ADVXItTIONO AS INTOKMATIOK 


743 


advertising. The probability of this perfect ordering being produced by 
chance is one twenty-fourth In consequence these results are significant 
at the 5 percent level. 

These data yield other striking results The average page size for the 
smaller volume advertisers is smaller for exfienence goods than for search 
goods, and thu result is reversed for large-volume advertisers In the class 
of brands that advertised two pages or less in the New Yorker in 1965, the 
difference in average page size between search goods and cxpenence goods 
was 087 The standard deviation of that difference was 037 Hence this 
differenoe is sigmficant at the 5 percent level * ‘ For big-volumc advertisers 
(10 or more pages in the New Yorker in 1965), the difference in average 
page size between search and experience goods was — 218, the standard 
deviaUon of that difference was 059 This difference is significant at the 
01 percent level 

These results explain a phenomenon inexplicable with the simple in¬ 
formation models previously used in the analysis of advertising It is 
traditional in adverUsing to have a particular brand sponsor a network 
television show This results in three or more commercials dunng that 
picnod for the same brand Often the information content of these com¬ 
mercials does not differ Since the audience is quite stable over the course 
of the program, this adverUsing behavioi makes no sense if the goal of the 
firm IS to maximize the number of people who have one or more bits of 
itiformaUon about the brand Obviously a random distnbuuon of the 
brand’s commercial does a better job than this bunching procedure But 
this bunching makes sense in terms of the goal of most television adver¬ 
tising to increase the reputability of a brand of an cxpicnencc good 


Marginal Revenue of Advertising to the Consumer 

In the usual analysis of adverUsing, the consumer plays a rather passive 
role Our analysis, on the contrary, visualizes consumer decision in the 
quest for informaUon as a central determinant of the behavior of adver¬ 
tising One crucial vanable governing consumer decisions is the marginal 
revenue to the consumer in confronung an advertisement This marginal 
revenue has to be greater than the time cost to the consumer in order for 
him to examine an advertisement This places a constraint on both the 
amount and type of advertisement a brand will use 

Our model suggest that there should be a systematic difference between 

“* I ran thu teat both including and excluding advertuementt Cor books Hie reason 
for thu itrategy was that book advertuemenU dominate the unall-volumc advertisement 
category for expenence goods The perfect rank correlauon was produced by both 
procedures 

' ‘ If books are included in experience goods, the difference u 247 and its standard 
error 028 The results are significant at the 01 percent level 



744 


JOURNAL OF POLITICAL ECONOMY 


the margjinai revenue to the consumer for experience goods and the 
marginal revenue for search goods The marginal revenue of information 
to the consumer for a good is the expected improvement in his utility as a 
result of using this information in his sampling of brands of this good 
This expected improvement can be analyzed as the product (m the mathe¬ 
matical sense) of two components (1) the standard deviation of the utility 
distribution of brands (i e , the distribution of P*) for that good, (2) the 
expected improvement in standard deviation units of utility that can be 
obtained by using that information The first factor depends on tht nature 
of the market for the good The second factor depends on the nature of the 
information available about the good Since the focus is on information 
in this analysis, it is not surprising that the second factor is the agent of 
the elected difference between the marginal revenue of advertising to 
the for search and cxpcnencc goods, respectively 

But the first factor cannot be ignored The nature of tht market is also 
a function of the nature of the information (Nelson 1970) Indeed, tlu 
standard deviation of the utility distribution of expenenci and search 
goods might well be systematically different Howt ver, the nature ol that 
difference is not at all clear Two forces operate in opposite directions on 
the relative magnitude of this standard deviation ol seait h and experience 
goods On the one hand. comjKtitive pressure reduces the standard de¬ 
viation of utilities about which consumers agree more for search goods 
than for experience goods fNelson 1970) On the other hand, the greater 
sample size that consumers use for search goods will increase the standard 
deviation m utility of characteristics about which consumers disagree for 
search goods relative to experience goods ' ^ 

It IS impossible to say, a prion, which of the two forces analyzed will 
dominate in determining the relative variance of tht utility of search and 
experience goods for a given roiisiimer It would he- surprising, however, 
if this mixed effect could outweigh the impact til the differcnrc in the 
information characteristics of searcii and experience goods In analyzing 
this difference I will assume that the payoff to the same information would 
be the same for search and experience goods, that is, the utility disiribu- 

‘ * A brand that specializes in satisfying an extreme taste will become the inost-prrft rred 
brand of those with that taste at the expense of becoming one of the least-prelemd 
brands of those with average taste and tastes at other extremes The smaller the samph 
size of brands (assuming that it is greater than one), the smaller the payoff to such 
specialization Sample sizes as small as two screen out Icast-prcierred alternatives quite 
effectively, but do not effectively discriminate between most-preferred brands and 
brands nearby on a consumer's utility scale Any increase in sample sizes beyond two 
increases the payoff to being the most-preferred brands for some people considerably, 
and only changes the cost of being the least-preferred brand for other people mildly 
Hence, greater sample sizes of brands encourage greater variance among brands m th< 
characteristics about which consumers disagree In a simple model of this process, a 
sample size of two produces a brand distribution concentrated at the median of the 
distribution of consumers by their most-preferred quality, an infinite sample size products 
a brand distribution that corresponds exactly with that distribution of consumers (An 
analysis of this model is available upon request) 



advertisino as information 745 

lions by brand for a given customer are the same for search and expenence 
goods 

The big difference in the character of the advertisements of search and 
experience goods is that advertisements for expenence goods are dom¬ 
inantly “soft” or indirect information Consumers use “soft” information 
for experience goods because they have no option The pnmary informa¬ 
tion content of advertisements for expenence goods is the information 
that the brand advertises For search goods, on the other hand, the con¬ 
sumer does have the option of using “hard” or “soft” information But 
one of the basic assumptions of our model is that consumers respond to 
the direct information contained in the advertising message-—“hard” 
information—for search goods This implies that for search goods this 
“hard” information has a greater value to the consumer than “soft” 
information I predict, therefore, that the marginal revenue of adveruse- 
ments to the consumer will be greater for search goods than for expenence 
goods 

To test this proposition I look at the distribution by media of advertising 
for experience and search goods Consumers will be willing to look at 
advertisements as long as the marginal revenue to them of so doing is 
greater than their marginal cost The marginal cost to the consumer is 
dominantly a ume cost This time cost will vary by ihe alternatives use of 
the time used in watching the advertisement 

Many advertising media take advantage of severe restrictions on the 
alternative time uses of the consumer Television and radio inierspicrsc 
advertisements with programs the consumer wants to hear By hstemng to 
the commercials the consumer sacrifices only those activities he can do 
during the short time period of the commencal The cost to the consumer 
of looking at billboards and transit piostcrs arc only the aciiviues he can do 
while confined to a moving vehicle Only advertisements in ncwspajiers 
and magazines demand of the consumer the sacnficc of the best of his 
possible alternative time uses For newspapers and magazines he can 
arrange the time saving from not looking at the advertisements any way he 
chooses 

Newspapers and magazines have the added property that the consumer 
can look at the advertisements as many times as he wishes We would 
exjiect some positive relationship between the marginal revenue to the 
consumer of an advertisement and the number of times he will want to 
look at that advertisement Hence, we would expicct advertisements with 
a high marginal revenue to the consumer to concentrate in newspapers 
and magazines relative to the other adverusing media, we would expect 
just the contrary behavior for advertisements with low marginal revenue 
to the consumer. We, therefore, predict that there will be more advertise¬ 
ments for search goods in newspapers and magazmes relative to the other 
advertising media compared to the same behavior for experience g;oods 

I test this proposition by comparing the distnbuuon of advertising for 



746 


JOURNAL OF POLITICAL ECONOMY 


TABLE 3 

Ratio of Teuvsion Network to Magazine AovER-naiNo 
Bv Goods, 1966 


Goods 


Ratio* 


Exppricncp 

Automobiles 1 93 

Foods 2 35 

Toiletries 2 46 

Tobacco 2 61 

Drugs 2 58 

Seal ch 

Apparel 1 59 

Household furniture 0 84 


*lnclu(ics onI\ those goods tabulated for both Alcohol not iivrhidrd because of prohibition on network 
adverUftlofi^f Ik4rd liquor (U S Bureau of the Oniu* 1967) 


TABLE 4 


Ratio of Local to Nahonai Advertisino by Local Mhiia, I9(>6 


Media 

Ratio* 

Ntwspaptn 

4 1 

Radio spot 

22 

Television spot 

1)5 

Outdoor 

06 

T raiisit 

1 0 


*V S Bureau of the Ceniiu (1967) 


magazinesi against television network advertising for diflTtrtnt kinds ol 
consumer goods The results overwhelmingly support the hypothesis 
Look at the ratio of advertising expenditures for television to these expen¬ 
ditures for magazines for 1966 as reported in table 3 Lvtry one of the 
experience goods has a higher ratio than the search goods The geometric 
mean of this ratio for experience goods is 2 45, while for search goods the 
geometric mean is 1 16 This dififcrence is statistically significant at the 
1 percent level, 1 = 42 

We can have more than usual confidence in this test, because wc can 
test the test Using quite different—but highly persuasive--evidence, 
Ferguson (1963) has established that local advertisements have a higher 
marginal revenue to the consumer than national advertisements (Fer¬ 
guson’s result IS predicted by our theory Local advertising provides for 
the most part information that can be checked prior to purchase price 
and where to find things ) If our tests were indeed a test for vanation in 
the marginal revenue to the consumer, we should find the ratio of local to 
national advertisements higher for newspapers than for all other local 
media. Look at table 4 for the ratio of local to national advertising by 









A0VZ&TinHO AS INFORMATION 


747 


local media While there are too few observations to hope for statistical 
sigmiicance, the differences in the order of magnitude involved are huge 
This evidence is highly suggestive 


Alternative Sources of Infosmiation 

Thus far our analysis has concentrated on consumer behavior when faced 
with two alternatives to sample (whether by expiencncc or search) at 
random or with the aid of advcrusing Advertising is not the only aid to 
samphng that the consumer can employ The most obvious source of 
assistance that consumers have is the recommendation of relauves and 
friends and consumer magazines It seems reasonable to suppose that for 
experience goods consumers bchevc these rccommcndauons constitute 
better information than advertising ** Certainly these recommendauons 
must sometimes provide better information or they would never be used 
The consumer is surrounded by a sea of advertising whose information is 
obtained by the consumer at virtually no cost to himself The consumer, 
then, always has the altcmauve of using advertising as his guide Instead 
he sometimes uses the guidance of relatives and friends Hence, that 
information must sometimes be better information The more consumers 
use relatives and friends, the less they resjxind to advertising 

I showed in “Information and Consumer Behavior” that guidance was 
used more for goods for which there is a low frequency of purchase than 
goods for which there is a high frequency of purchase both for cxpenence 
and search goods (For simplicity I will henceforth call low frequency of 
purchase goods “durables” and high frequency of purchase goods “non¬ 
durables,” though the essence of their behavioral difference is contained 
in the frequency of purchase) This leads to the reverse predicUon for 
advertising greater advertising for nondurables than for durables for 
both expenence and search 

However, this relationship should be far stronger for expienencc goods 
than for search goods First, the level of guidance is far less for search goods 
than for experience goods In consequence, guidance should have less 
impact on the demand for advcrusing for search goods Second, adverns- 

‘ ’ For learch goods the case u not clear at all A consumer might well regard a picture 
of a dress contained in an advertisement as better infbrmaUon than a friend telling her 
that the dress is pretty 

I* Durables and nondurables will difler by another important characteristic as well 
The value of advcrusing information to the consumer for a given number of advertising 
messages will be greater for durables than nondurablcs However, this consequence will 
produce ambiguous effects on advertising expenditures The lower value of advertising 
information to the consumer for iKHidurables will reduce the number of advertis¬ 
ing messages that consumers receive fiH' nondurablcs, but the low value of advertmiig 
information to the consumer will also increase zhe expenses to the producer for every 
meange that the consumer does receive 



JOURNAL OF POLITICAL ECONOMY 


748 

mg information is better for search goods than it is for experience goods, 
whereas the information of relatives and friends is worse for search goods 
than It IS for expienence goods Hence, advertising information will 
compete more effectively with the information of guides in the case of 
search goods 

I find a significantly higher average advertising sales ratio for nondur¬ 
able than durable experience goods no matter which of the alternative 
classification procedures I use for experience goods 

(.lassification I — 2 293, = 4 085, / = 1 92, 

Classification 11 = 2 187, = 4 085, t = 1 97, 

Classification III = 2 177, = 4 085, t = 2 028, 

where aruhnielu mean advertising/sales ratio for durable experi¬ 

ence goods, = arithmetic mean advertismg/sales ratio for nondur¬ 
able experience goods, /-values are computed for tests of the difference 
between two means and arc significant at the 5 percent level with a ont- 
lail lest procedure For search goods the sample size is too small for any 
significance test, but the cursory evidence points to the reverse relation¬ 
ship higher advcrtismg/sales ratio for durables than for nondurables 
With British data, Doyle (1968) also finds an inverse relationship be¬ 
tween durability and adveriising/sales ratios But since Doyle did not 
make any distinction between search and experience goods, he did not 
observe that that relationship was confined to cxpencnci goods 

Doyle (1968) observes one more relationship that can be c'xplaincd by 
advertising’s rok as inlormalicm He finds a significantly nc'gative rela¬ 
tionship between adveriismg/salcs ratios by product and the unit price 
of that produc t, where the unit is defined roughly as the quantity that is 
ordinarily pureased at one time, for example, one car, one pound of 
coffee, etc A low unit price will mean some combination of the following 
high frequency of purchase or low total expenditure on the commodity 
I have already shown why there should be a close relationship between 
high frequency of purchase and advertising expenditures Because the 
classification is so broad, however, the durability measure docs not catch 
all of the effect of frequency of purchase on advertising So even when 
durability is explicitly considered in the multiple regression, part of the 
relationship between unit price and advertising will be attributable to the 
relationship between price and frequency of purchase 

The relationship between unit price and total expenditures on a product 
will also yield a negative association between unit pnee and advertising 
The greater the total expenditures of consumers for a product, the greater 

•* It can be demonttrated that the correlation of consumer preferences is lower for 
search goods than for experience goods 



advertuino as information 


749 

the anticipated vanance of utihty to consumers of brands of that product 
In consequence, when consumers spend a lot for a commodity they will 
tend to use better—but more expensive—information than advertising' 
to wit, the guidance of relatives and fnends or consumer magazines 
Doyle’s evidence, then, provides additional supfiort for my explanation 
of the impact of durability on the advcrtising/sales ratio Of course, this 
is hardly conclusive evidence Only the cumulauve impact of additional 
studies can produce evidence that could be even remotely so characterized 


Deceptive Advertising 

In the preceding pages I have given scant attention to deceptive adver- 
Using, though, obviously, deceptive adverusing exists We have seen that 
some decepuon will occur even in the case of search qualities, where the 
consumer has the most market power over the content of advertisements 
But the amount of deceptiveness in advertising can be easily exaggerated 
if one simply looks at the incentives of adverusers to deceive without con¬ 
sidering the incentives of consumers not to be deceived The circumstances 
under which advertisers have the greatest incentives to deceive if con¬ 
sumers believed them are precisely the circumstances under which con¬ 
sumers would be least inclined to believe advertising Decepuon requires 
not only a misleading or untrue statement, but somebody ready to be 
misled by that statement 

One possible source of decepuve advertising is consumer confusion 
As long as consumers followed the deasion rule believe an advertisement 
for experience qualiUes when it tells about the functions of a brand, do 
not behevc the advertisement when it tells how well a brand performs 
that function—the consumer will be rarely deceived But there is no 
guarantee, of course, that consumers will always use that decision rule 

There is another important source of decepuve adverusing the law 
Whenever a law on adverusing practices is moderately enforced, decepuve 
advertising is sure to occur Take, for example, the law prohibiting the 
mislabeling of the fabnc content of clothing If that law is sufficiently 
enforced, consumers will believe that a clothing label is usually correct 
This will provide an incentive for some manufacturer to mislabel—unless 
the law IS enforced so vigorously that nobody gains from breaking it— 
a nonopumal level of law enforcement (Becker 1%8, Suglcr 1970) In the 
absence of the law no one could trust any clothing label that it was not in 
the self-interest of the producer to specify correctly Hence these clothing 
labels, though incorrect, would not deceive many people 

The law increases decepuve adverusing in another way Ckinsumers 
arc unlikely to be legal experts Some, therefore, are likely to believe that 
certain forms of deceptive advertising are prevented by law when, in fact, 
they are not. It is not clear that broadening the dcfiniuon of fraudulent 



750 


JOURNAL OF POLITICAL ECONOMY 


advertising would solve this problem. The more the law protects against 
fraud, the more people think the law protects against fraud. Misinter* 
pretauon of the law’s domain will exist, no matter how extensive that 
domain 

Why, then, would the police power of the state ever be invoked against 
“deceptive” advertising, since it is quite piossible that these laws increase 
rather than decrease deception in advertising? These laws can accomplish 
something They can—at a cost—make more information available to the 
consumer Consumer market power reduces deception by consumer dis¬ 
trust of any statement about which it is in the self-interest of producers to 
deceive As we have seen, for experience qualities this narrows consider¬ 
ably the information available to consumers from advertising Some 
people mmfat deem it impiortanl that other information be made available 
from aemmising, that consumers, for example, be able to determine the 
fabnc content of their clothing Laws can achieve this objective at the 
price of both enforcement costs and costs to the consumer of the elimina¬ 
tion of possible memorable sources of indirect information 

These laws will be relevant almost exclusively lo the advertising of 
expenence qualities, since, even without laws, advertising for search 
qualities provides fairly complete information about the properties of the 
brand being advertised Deceptive advertising will be concentrated 
where the laws are concentrated—almost exclusively in the advertising 
of experience qualities There is evidence that supports this contention 
Though I find it unsatisfactory, let us use for the moment the Federal 
Trade Commission’s enterion of what constitutes deceptive advertising 
Whatever its shortcomings, it has developed indcpicndently of the desire 
to generate data that would either support or reject hypotheses of econo¬ 
mists For the first 6 months of 1965 the Federal Trade Commission found 
58 advertisements deceptive about the quality of the product (as distin¬ 
guished from Its price) ' * All were advertisements about expenence 
qualities (U S Federal Trade Commission 1970) 

C;iearly, then, there is some deceptive advertising The only empincal 


'*The Federal Trade Commission did find a number of deceptive advertisements 
related to price Ordinarily the price of an article is known prior to purchase In all cases 
of deceptive advertising about price, however, the deceptive advertising was about price 
characlensucs that would not be discovered prior to purchase When payment is post¬ 
poned, the consumer need not know the price before the purchase This offers a firm an 
opportunity to place terms in the contract about which a large proportion of consumers 
would not be aware pnor to purchase The Federal Trade Commission judged three of 
these eases in 1965 (U S Federal Trade Commission 1970) The other category of 
deceptive price advertumg found by the Federal Trade Commission was the advertise¬ 
ment of a deceptively high regular price (six of these cases in the first 6 months of 1965) 
Id purchasing the good, the consumer usually knows the purchase pnee but not the 
regular price In consequence this deception is not something that would be revealed to 
the consumer prior to purchase (The basis for the deception is that consumers offen use 
regular price quotations to economize on search ) 



aovsktbino as information 


75* 

question is one of the magnitude of this phenomenon My analysis up to 
now has given httle weight to deception m advertising What changes in 
this analysis are required if deception is an important rather than an 
unimportant part of advertising^ 

Surprisingly, this phenomenon would have little impact on the major 
conclusion of my analysis Look i^ain at my demonstration that brands 
with the lowest /**s (pnce/utihty) have the greatest incentives to adver¬ 
tise It does not require intelligent consumer response to advertising— 
though It provides a basis for such intelligent response Consumers who 
actually believe paid-for endorsements are the vicums of the most benign 
form of deception They are deceived into doing what they should do 
anyway. In consequence there should not be much difference between 
the behavior of those who respiond to advertising because they arc intelli¬ 
gent, and the behavior of the deceived Under these circumstances it is 
both exceedingly difficult and not very important to put the deception 
hypothesis to the test 

This discussion of the deception hypothesis helps explain why econo¬ 
mists have been so long deceived about the character of advertismg It 
docs not pay consumers to make very thoughtful decisions about adver¬ 
tising They can respond to advertising for the most ridiculous exphcit 
reasons and still do what they would have done if they made the most 
careful judgments about their behavior 

Whatever their expUat reasons, the consumers’ ultimate reason for 
responding to advertismg is their self-mtcrest in so doing That is, it is no 
mere coincidence that thoughtful and unthoughtful judgments lead to the 
same behavior If it were not in consumer self-interest to respond to adver- 
Using, then consumers’ sloppy thinking about advertising would cost 
enough that they would reform their ways 

The learning required of consumers in this case is not very complicated 
Advertised endorsements have been used for a long time During this 
penod the consumer, his parents, and his grandparents have consumed 
countless quantities of advertised and nonadvertiscd brands If consumers 
were losing out by trying advertised products, they would have had an 
enormous number of oppiortunitics to discover this fact 

Many economists have felt that other consumers thmk quite imprecisely 
about advertiang—and well they might But this superficial observauon 
had led economists, but not consumers, astray Economists have failed to 
see that consumers’ response to advertising persists because of the under¬ 
lying information rple of advertising 


Summary smd Conclusion 

Thu paper has attempted to show the way m which advertising as infor¬ 
mation operates I have contended that there is a basic difference in the 



752 


JOURNAL OF POLITICAL ECONOMY 


character of the information conveyed by advertising for search and 
experience goods For search goods advertising’s information is direct 
For experience goods the information conveyed is dominantly indirect— 
simply that the brand advertised I have shown that this difference in the 
character of information leads to greater advertising expenditures for 
expenence goods than search goods and greater marginal revenues to the 
consumer for search goods than expenence goods These implications were 
found satisfied by the data 

This analysis has not examined the totality of advertising’s operation 
I have not looked at the relationship between advertising and frequency 
of purchase of the advertised good—in contrast to brand For example, 
food advertisements often contain recipes which provide information for 
the use of the good in question Though this may be an important part 
of adi^upsmg’s activity, it is irrelevant with respect to the set of implua- 
tions exa^lbicd in this article 

There is another possible function of advertising which I have not 
analyzed because I do not know how advertising’s impact on a consumer’s 
utility function, holding information constant The changc-in-tasic idea 
cannot be cflcc lively tested because no real theory about taste changes has 
been developed Fortunately, one does not need such an idea to explatn 
the major features of advertising behavior 

It should be perfectly obvious to anyone who has read this artiele that 
an enormous amount ol work on advertising still remains to be done On 
the theoretical side I have not provided a general equilibrium solution 
for the amount of advertising expenditures (in particular I have not 
solved for m in eq [7]), on the empirical side 1 have not explained a sub¬ 
stantial portion of the variance of advcrtising/salcs ratios by industry 
The focus of this paper has been neither to develop a complete theory of 
advertising nor to present a complete multiple regression analysis of 
advertising’s empirical behavior Rather, the emphasis in this paper has 
been to develop a theory of advertising that is both reasonable and pro¬ 
ductive The productivity of this theory does not depend upon its com¬ 
pleteness (as partial equilibrium analysis in general attests ) A theory is 
productive if it can generate implications that are consistent with real- 
world behavior The theory developed in this article satisfies that require¬ 
ment That this theory makes sense out of a wide range of empinca! 
phenomena suggests that it has caught the essence of advertising behavior, 
that further theoretical (or empincai) work would not vitiate our results * ’’ 

' ’’ For example, it has been suggested that the equilibrium that I envision cannot be 
stable, that all consumers would tend to shift to the better-advertised brand until that 
brand had all the customers This comrary-lo-fact result can esisily be prevented by 
allowing consumer tastes and income to vary Nonadvertised brands, then, would 
specialize in minority consumer taste (While my analysis has for the most part assumed 
homogeneous consumers for simplicity, its conclusions do not depend upon this 
assumpUon ) 



ADVERTIStNO AS INFORMATION 


753 


Appeodix 

Producers of equal efficiency have a choice of making a low'Utility brand at high 
unit profits or a high-utility brand at lower unit profits It is reasonable to assume 
that the sales from customers who have no information about the brands’ utilities 
will be the same for the two brands But consumers will tend to repeat the pur¬ 
chase of higher-utihty brands In consequence, the expected sales of the high- 
utility brands will be greater In equilibrium, the total profits of equally efficient 
firms will be the same whether they make a high- or a low-utility product if firms 
of this level of efficiency continue to produce both This implies that the profit 
generated from the average individual making a purchase without information 
will be the same for both the high- and low-utility brands (Since both profits and 
number of customers with no information are assumed to be the same for both 
brands, the ratio of the two must be the same ) 

Assume, for the moment, that advertising merely increased the probability of 
a person’s making an initial purchase of a brand without any impact on the con¬ 
ditional probability of a repeat purchase given an initial purchase Then high- and 
low-utility brands would have equal incentives to advertise, for the marginal 
probability of a repeat purchase would increase in the same proportion as the 
probability of an initial purchase 

P{R) = P(I) P(RII), (Al) 

where / = initial purchase, R = repeat purchase With P{RII} constant, P(R) 
increases proportionately with /*(/) This implies that the average new customer 
generated by advertising would yield the same revenue to the firm as did the 
average old customer with zero information for both high- and low-utility brands 
Unless costs behaved in a very peculiar manner, this would imply that the profits 
generated by advertising were the same for the two brands, since the average old 
customer with zero information produces equal profits for the two brands 

Why should this not be the slate of affairs if advertismg were simply information- 
aP After purchasing (and using) a brand, the consumer does not require adver¬ 
tising as a clue to the hidden qualities of a good He can assess these hidden quali¬ 
ties directly Why, then, should advertising have any impact on the conditional 
probability of a repeat purchase’ 

A simple mechanism that produces this impact is the process of consumer 
memory The average consumer uses a lot of different products It costs him some¬ 
thing to remember the brands of these products that be tries The cost is smaller 
the more familiar the name of the brand. Advertising makes brand names famihar 
We would expect, therefore, the consumer to remember a higher proportion of 
advertised brands that he has tried than unadvertised brands Indeed, there is 
evidence that this phenomenon does, in fact, operate The ratio of the recalled 
sales of advertised brands to recalled total sales of a product is higher than their 
actual market share (Sudman 1962) 

This process should have an impact on the conditional probability of a repeat 
purchase. A necessary condition for a repeat purchase made deliberately is that 
the consumer remember the name of the brand of which he wishes to make a 
repeat purchase But if he remembers the name of the brand, he will not repeat 
his purchase by mistake For suffiaently low-utilily brands, a consumer who 
remembers the name of the brand will have a lower conditional probabihty of a 
repeat purchase than if he did not remember the name of the brand For suffi¬ 
ciently high-utility bianda, the reverse will be the case 

For a high-utility brand, advertismg will produce a greater percentage increase 



754 


JOURNAL (» POLITTCAL ROONOMY 


in the marginal probability of a repeat purchase than its impact on the probability 
(rf* initial purchase, since both terms on the right in equation (Al) mcrease. Since 
we assume the impact of an equivalent dose of advertismg on the probability of 
initial purchase to be the same for low- and high-utihty brands, it follows that the 
producers of high-utility brands have a greater incentive to advertise than the 
producers of low-utility brands 


References 

Becker, Gary “Crime and Punishment An Economic Approach ” J.P.E, 76, 
no 2 (March 1968) 169-217 

Demsetz, Harold “The Nature of Equilibrium in Monopolistic Competition ” 
JPE 61, no 1 (February 1959) 22-30 

Dorfman, Robert, and Steiner, Peter “Optimal Advertising and Optimal Qual¬ 
ity "AES 44, no 5 (December 1954) 826-36 

Doylq^P.''"Advertising Expenditure and Consumer Demand ” Ot^ord Earn. 

20, no 3 (November 1968) 394-414 

Ferguson, James The Adverltnng Sale Structure tn the Daily Newspaper Industry 
Engciwood ClilTs, N J Prenttce-Hall, 1963 

Gould, John “Diffusion Proceses and Optimal Advertising Policy " In Microeco¬ 
nomic Foundations of Employment and Inflation Theory, edited by E Phelps et al 
New York Norton, 1970 

Karlin, ,Samuei A First Course tn Siochaslu Processes New York Academic Press, 
1966. 

Life, vol 72 (1972) 

Nelson, Phillip “Information and Consumer Behavior " J P E 78, no 2 (March/ 
April 1970) 311-29 

Schmalensee, Richard “A Note on Monopolistic Competition and Excess Capac- 
city " JPE 80, no 3, pt 1 (May/June 1972) 586-91 

Stigler, George “The Economics of Information,” ,/P £ 69, no 3 (June 1961) 
213-25 

- “The Optimal Enforcement of Laws,” JPE 78, no. 3 (May 1970) 

526-36 

Sudman, Seymour “On the Accuracy of Recording of Consumer Panels ” 
Unpublished Ph D dissertation, Graduate School Bus , Univ Chicago, 1962 

Telser, Lester, “Advertising and Competition "JPE 72, no 6 (December 1964) 
537-62 

U S , Bureau of the Census Statistual Abstract of the United Stales, 1967 Washington: 
Government Printing Office, 1%7 

U S., Federal Trade Commission Decisions Jan I, 1965-June 30, 1965. Washing¬ 
ton, Government Printing Office, 1970. 



Taxation and National Defense 


Earl A Thompson 

UCLA 


Contrary to traditional models of national expenditures and taxation, 
national defense is not a consumer or a producer good Rather it is a 
good produced to protect the ownership of other goods from foreign 
takeover Using a model of international distributional equilibnum 
containing national defense and aggression decisions, a Pareto-optimal 
national tax structure is derived The entire U S tax structure is then 
shown to be roughly equivalent to the optimal structure This empirical 
result indicates that the U S political system has somehow been able to 
create a roughly efficient tax structure in spite of the systematic mis¬ 
guidance of econoRusts 


In the traditional theory of national taxation, the govemment provides 
only collective consumer or producer goods Because governmental 
provision of such goods does not alter any of the familiar necessary con¬ 
ditions for the efficient utilization of the resources remaining for pnvatc 
use, efficient taxation when the private-goods sector is perfectly competi¬ 
tive IS achievable only with lump-sum taxes (such as head taxes, land 
taxes, or equal, ad valorem taxes on all consumer benefits) Any tax other 
than a lump-sum tax yields violations of the familiar necessary conditions 
for achieving Pareto optimahty given the resources remaining for private 
use 

In the real world, lump-sum taxes have been consistently rejected 
despite the obtter dutum of the traditional theory of taxation 

This paper begins with a development of a theory of effiaent national 
taxation in a world in which each govemment provides collective defense 
of the propierty of its atizens While the tradiuonal model is frequendy 

Thu research was supported by a Lilly Foundation giant for the study of property 
rights at UCLA The author benefited from comments on previous drafts made by 
Armen Alchian, Arnold Harbeiger, Louu Makowski, B<fo Rooney, and a referee of tbu 
Jomud. 


755 




JOURNAL OF POLtriCAL EOONOMV 


756 

alleged to apply to collective defense, it is an error to describe collective 
defense as a collective consumer or producer good A tank is neither a 
consumer nor a producer good In the descriptively more accurate model 
of collective defense developed in Section I below, the private accumula¬ 
tion of certain Rinds of capital creates an extra defense burden for the 
country protecUng that capital Efficient t 2 txes arc then not lump sum 
taxes, efficient taxes discriminate against the accumulation of capital that 
is coveted by potential foreign aggressors 

Section II shows that the optimal capital tax is achieved by the use of 
a simple income tax complemented by (1) realistic depreciation allow¬ 
ances, (2) tax write-offs for chanty and abnormal, noncosmetic, medical 
expenses, and (3) a theoretically specified, positive (a) percentage 
depleuon allowance to natural resource owners, (b) degree of progression 
in t^^incomc tax rates, (c) minimum income exemption for each in- 
diviou^^nd his dependents on his income tax, (d) corporate-profits tax, 
(e) excise tax on each consumer durable, and (/) tax break on capital 
gains 

In contrast, in the traditional economic theories of government expen¬ 
ditures and taxation all of these special features of the income tax—and 
the income tax itself—are inconsistent with a Pareto optimum * 

Section II also presents rough-and-ready quantitative estimates of the 
SIX optimal rates in consideration 3 above The results are compared with 
actual U S rates and indicate a striking degree of overall efficiency of the 
U S tax structure Our recommendations for policy changes have 
relatively minor effects on economic incentives 

The results raise the question of what in the U S political system has 
permitted the evolution of suth an efficient lax structure in spite of the 
fact that the only existing tax theories deprecate every major feature of 
the structure 


I. The Basic Model 

The Environment 

We shall employ a capital model with discrete time, an infinite horizon, 
no joint production, and perfect knowledge about the environment ^ 

‘ Samuelaon (1964) has attempted to provide some kind of rationalization of realistic 
depreciation allowances in an apparently traditional environment, but we shall find a 
simple economic error in his exercise Also, some authors have assumed that an income 
tax does not generate a “double taxation of savings” and an meihcicnlly discriminating 
tax on future (relative to present) consumption in the traditional model Since the 
double tax is only on the interest from savings in the traditional model and there may 
be a zero interest rate, these authors can be defended, but only in a special case 

^ The central results in this paper have also been established for a mathematically 
leas familiar model containing continuous-ume and Marshallian joint production, 
although dropping this assumption requires a substitute assumption, as noted at the 



TAXATION AND DEFENSE 


757 


We assume that each of the N* individuals in country a has a utility 
function defined over all feasible sequences of his consumption ben^ts, 

), 1 = 1,2, ,N% 

where U‘{') is a monotone increasing, differentiable, strictly quasi 
concave function ^ Aggregate consumption benefit over these individuals 
during the /th period is given by the differentiable, quasi concave pro¬ 
duction function 

53b} = .Kuc.), ‘=1.2, , (1) 

1-1 

where represents capital of the Ath kind (^ = 1, , M) devoted to 

the production of consumpUon benefits at time t Aggregate capital in 
country a in each future period is the result of devoting capital in the 
preceding period to its production or acquisition so that 

•^k« + l = ‘ = 1,2, , (2) 

where /*'( ) is differentiable and quasi concave Foreign aggression is 
treated as a form of investment, the capital obtained from a current act 
of aggression being unavailable for consumption until the following 
period 

National defense effort at time t is 

A = G’(K,on , K^o.). (3) 

where G'( ) is also differentiable and quasi concave The aggregate 
capital stock of kind k in any penod is the sum of the amounts of capital 
used in the above activities plus the amount taken by foreign aggressors, 
K^^, That IS, 

M 

= f^kCl + 5!^ f^kft + ^kCl + ^kAt (“f) 


end of this section of the paper In the case of a hmte horizon, several of our tax* 
equivalence theorems hold in only an approximate sense Tlic perfect>infonnation 
assumption can be substantially relaxed and is made largely to simplify the discumon 
^ The “strict quasi concavity” of a function here means that if /{x) » /(x'), x # x', 
then 

/[ax + (1 - a)x'] - fix) > ^ sup [x, - x',)f, 

for some i > 0 where 1 > a > 0 and x = Xj, Thu may alto be termed “asymp¬ 
totically strict quasi concavity ” It assures the absence of iniimte quaniiUet m maximizing 
/ over all x subject only to linear equalities in x vnth positive coeffiments Thu type of 
quasi concavity, as well as the standard quasi concavity assumptions stated below 
[f (x) IS quasi concave if J(x) = f [x')r * # *' implies that f[ax -t- (1 — «)*'] — 
/(x) 2 0], also are used to insure the existence of a competitive equibbnum. The 
differentiability assumptions in this paper, together with the absence of traditional 
noimegativity aisumptioiu, are made to facilitate the mathesuLticsd argument, since it n 
obvious that technical nonsubsUtutabiliUes do not duturb the optimality of mcentiva 
in our efficient tax system. 



75^ JOUKHAL or POUnCAL ECXmOMY 

Relations (l)-(4) insure the absence of joint production and collective 
goods in that they state that no particular unit of a capital good serves 
several functions simultaneously, such as produang consumption goods 
and producing itself in the following period 

Similar relations hold for each country 

We assume that the distribution of capital between countries is an 
equilibrium distnbution, meaning that each of the countries has rationally 
decided (rational in the Paretian sense) which property to claim and 
defend in each period and that the decisions are mutually consistent given 
the world’s aggregate stocks of capital The equilibrium initial capital 
stock of country a is given by 


A*. = A*, *=1,2, .,M 


(5) 


W^HHume that each country knows the rational strategies of the others 
so that no actual aggression occurs in determining this equilibrium 
distribution of capital between countries In equilibrium, the country 
possessing a unit of capital m a given period is the country that has made 
a pnor commitment to impose on any other country attempting to 
acquire the capital damages which are at least as great as the value of the 
capital to that country * Since the only way to subvert the pnor com¬ 
mitment of another country is to take control over the entire country, all 
foreign aggression is all or nothing. So if the net return from aggression is 
ever positive, it is greatest for = K,^, Hence, we can write the profit 
to aggression against country a for a particular aggressor as 


'■At 




( 6 ) 


where d( ) is the aggressor’s evaluation function of the assets he acquires 
and C(D,) is his corresponding cost of the aggression, which we assume 


* It may be of interest lo note that such commitments arc generally impossible to 
make in a pure democracy, as the voters can always vote against a war by voting down 
war appropriations Such a government is sure lo lose, bit by bit if not all at once, all 
of Its transferable capital to a nondemocratic foreign aggressor that can make commit¬ 
ments to fight wars over property at war costs to both parties which are greater than the 
value of the property at stake (The reason such commitments are rational is that once 
they arc made, the democratic country rationally surrendcra its capital so that fighting 
thi war is unnecessary ) However, in a constitutional democracy, where certain govern¬ 
ment policies arc not subject (except at great cost) to future voter disapproval, the 
constitution, by giving proper incenuves to the government leaders and by allowing 
them to command war resources without voter approval, may effect the necessary war 
commitincnt Constitutionally granted wartime finance policies such as the drafi, debt 
financing and government currency creation, and pnee controls are thus a necessary 
part of our wartime financial structure An important implication of this necessary, 
confiscatory, wartime financing is that there is an insufficient accumulation of war 
relevant capital during peacetime This is the economic basis of the classical “national 
defense argument” for peacetime subsidization of certatn domestic industries (vr 
Tbcmipson fl966] for an elaboration and empincal application). In the fimiud mod<l 
above, such subsidies appear as government purchases of capital toed to produce national 
defense 



TAXAtSON A>B> DXFXMSE 


759 


to be a monotonic increasing function. The rational foreign aggressor 
acquires no K^, if and only if the profit to the aggression is never positive. 

Equilibrium also implies that each country always retains its rationally 
produced capital This is true because if a subsequent umt of capital were 
not successfully defended, it would have been better for the country to 
consume the capital which was devoted to its production This follows 
from the monotonicity of utihty funcUons and the consumability of 
capital expressed in relations (2)-(5). In equilibrium, then, a makes D, 
just high enough in each period that for each potential aggressor, ^ 0 
for all > 0 That is, it makes 7), just high enough that the solution 
value of IS equal to zero for all potential aggressors From (6), this 
level of D, obviously depends upon K^„ , which determines the 

return to foreign aggression Hence, we set i), equal to 
the defense requirement of the country at time t —the level of defense 
required to dissuade all pmtential aggressors Also, since there arc no 
foreign aggression activiues in equilibrium, the only output of the 
“government” is D, Hence, equations (3) and (4) are wntten 


, K^,) = G‘{K,o„ , K^a.h ( 3 ') 

^1. = + Z (*') 

f 

Conditions for Pareto Optimality 

Maximizing B\, ) subject to l/'^(B{, Bi, ) = U**, 

j ^ 2, and equations (1), (2), (3'), (4'), and (5), we find that necessary 
for a Pareto optimum in our environment is that the allocauon of resources 
(i e , K^c„ Ki, 0 ,y and for all t, and i ) satisfies, in addiuon to the 
(onstraint equations above, the following marginal equalities 



du'idB, _ /j 

dU‘/dB,^, C< ’ V 


for all t, k,y, and z, 


for all t, k,y, and i, 



( 7 ) 

( 8 ) 


where subsenpts on function symbols indicate partial derivatives of the 
function with respect to capital of the type specified by the subsenpt 
Equation (7) states the familiar condibon that in an optimum, dilTerent 
kinds of capital are allocated between sectors so that their relauvc margmal 
productivities arc equal whatever they produce Equation (8) states that 
in an optimum, the marginal rate of time preference of B, over B,^^ is 
less than the familiar margmal rate of time producuvity by a percentage 
equal to the increase m defense requirement caused by the capital which 
IS produced to create the extra relative to the defense productivity 
of this capital. 




JOURNAL OF POLITICAL EOONOUY 


760 

Cmpettttv€ Eqmltbnum 

We now give each individual in country a an initial endowment of capital 
(#:**, , ^iJ,) such that 

for alii (9) 


We also give each of these individuals a set of quasi concave production 
functions for each period which read 

C“'= C“{K\c„ , K‘»c,) and 

/*" = , JfU) Tor every k 

The aggregate functions described in (I) and (2) must then be derived by 
m^mpucing aggregate output for given aggregates of inputs devoted to 
the piifSurtion of the output That is, 


( 10 ) 



C'{K,cn 

’ ^MCt) 

= max 

E 

C'‘{K\c„ , K'^a) 

subject to 








i 




and 









= max 

E 

1 


subject to 











all k,y, and 1 


(II) 


We assume that each individual may economically participate somewhat 
in the production of each output (which will imply nonincreasmg returns 
to scale m the individual functions), so that the above maximizations 
obviously occur when and only when 


CL' = cu= Cl), 

11'* = (= /f) for all k, t,j, and t 


( 12 ) 


Hence, if the equilibnum in the economy satisfies (12), it generates the 
aggregate production functions in (1) and (2) given the constraints in (11) 
We now introduce prices Our prices are all initial-penod, unit-of- 
account prices; that is, they describe the amount of wealth one must 
currently surrender in order to obtain delivery of a good at a speafied 
date To obtain such pnees from prices that would rule in actual transac¬ 
tions in future penods, a suitable discount must be applied to the price in 
the future to reflect the value of early payment in the form of initial 
wealth The price of capital of type k delivered in period t is written P^„ 
and the price of consumption goods delivered m penod I is written P, 
An mdividual is also taxed an amount whose present cost is given by T‘ 



TAXATION AND DEFENSE 


761 

Each individual is assumed to choose Kj^,, and so as to max¬ 
imize U‘ (B*,, B 2 , ) subject to the production functions in (10) and 

his budget, 

7’' + E = E + E P,C'‘ + E E (13) 

Ik I ty 

- E E E PkiiK^kc. + K„) 

Ifk 

The solutions represent a competitive equilibrium when pnces are set* so 
that, for all t and k, 

E Z/ = E E + E = ^*1. 

Ill y 

and (14) 

E (^‘kci.t + E ^lyi*,) = E 7“' 

I y I 


The Case of Lump-Sum Taxation 


When taxes are lump sum so that they do not vary with the individual’s 
behavior, the individual uulity-maximizing choices are seen to sansfy the 
following marginal equalities 


and 


c;‘ /« " p„ 


dU'jdB, _ ^ 

dU‘ldB,,, - C' ■ 


all k,y, t, t, and t 


These conditions sure inconsistent with the condition for Pareto optim¬ 
ality in (8) except when * = 0 for every y and t, which is the im¬ 
plausible special case in which the returns to aggression by the marginal 
foreign aggressors are never affected by the size of the victim’s capital 
stock 


The Pareto Optimality of a Competitive Equilibrium with 
Certain Capital Taxes 

We now assume that 

7’* = E E + 7”", (15) 

Ik 


’ We assume that 

hm inf (I I - ZEI - T') 

I-OP Vo / 

exuts to assure fimte solutions Our prices arc therefore “Malinvaud pnces” (Maluivaud 
1953) 



JOURNAL OF POLITICAL EOONOUY 


76 a 

where a^, is a constant present tax rate on capital of type k at date t and 
the 7’®* IS a lump-sum tax or subsidy to the individual set so that J], T' 
satisfies the government’s budget condition, 

T‘ = P„{1 + (16) 


Equation (16) reflects the fact that the capital tax is levied on sellers of 
capital rather than buyers and that prices are the net prices to sellers We 
assume that the government minimizes costs using fixed-factor prices so 
that 


+ «t.) ^ ^ 

Pyli^ + ^n) ('y 


M^miHSing f/‘( ) subject to (10), (13), and (15) yields the following 
manMi^ conditions 


and 


C = ^,(1 + “J/P. «ind /r = P„(t + (18) 

r- g; p„{\ + a„) ’ 


c;- 


(19) 


dU'IB, _ 


dU'IB,,, 


J_ p, 

1 + <iy,+ 1 Pt + i 


all k,y, z, l, and i (20) 


Equation (17) satisfies the conditions in (12) so that (11) holds We now 
need only set 

Dt+i 

( 21 ) 


“y(+1 


67’ - £>7' 


in order for (19) and (20), together with (9), (11), (14), (3'), and (4') to 
represent the same equation set as (7) and (8) together with (1), (2), 
(3'), (4'), and (5)—in order for any competitive equilibrium with such 
capital taxes to be a Pareto optimum * 


‘ Since we have not precluded decreasing marginal costs of defense with respect to 
the protected capital, there may be several allocations satisfying (7) and (8) for given 
utility levels of JV” — 1 individuals And some of these allocations may be Pareto non- 
optimal In such c.ises, however, we assume that the government picks tax rates (a,, 
and T") which correspond to their shadow values in a global Pareto optimum. That 
such taxes succeed m inducing global Pareto optima despite the possible economies of 
scale in protecting capital is easily proved First note that our quasi concavity, non- 
increasing returns and bounded-wealth assumptions insure that all solutions to (19) 
and (20)—given (9), (11), (14), (3'), and (4')—are privately optimsd. It follows that 
we need only pick the levels of a^, and 7*' that correspond to their shadow equivalents 
in any specify Pareto optimum in order to have the privately selected allocations of 
the equilibrium coincide with the Pareto optimum For if this were not the case, then 
one allocation satisfying (19) and (20)—given (9), (11), (14), (3'), and (4')—would 
make somcooe woite off than would another such allocation 



TAXATION AMD DBnUOB 


763 

Note that no particular tax rate on capital in the initial period is im¬ 
plied by optimal capital taxes. This is reascmable because such capital has 
already been produced so that taxing it is equivalent to applying a lump¬ 
sum tax. Nevertheless, we oftentimes below apply the harmless procedure 
of applying the optimal tax rate on future capital to capital in the initial 
period. Note also that the optimal captal tax is equivalently a tax on the 
value of the capital output, 

It IS easy to show, using (18), allowing the optimal capital tax rate to 
apply in period 1, and assuming hnear homogeneity of Z>'( ) and C'( ) 
with respect to their respective arguments, that T*' k 0 so that optimal 
capital taxes alone are just sufficient to finance government expenditures. 
While we do not maintam these homogeneity assumptions m the paper, 
the result indicates, to the extent that the homogeneities are roughly 
plausible, a relatively minor role for lump-sum taxation or subsidization in 
a world employing an optimal capital tax 


SpeaficatMn of the Marginal Aggressors' Marginal Profit Functions 

Since the marginal aggressors’ profits are kept at zero by the potential 
victim’s defense effort, we have, differenuating (6), 


Hence, 


dA . 

- = A,, 

_ dC(D‘) ^ 
dD' * 

(25) 

A^, ly^ 

if A^, > 0 

(26) 


We assume that for some subset of (k) =(1,2, , M), wntten a(k) = 

(1, 2, , MJ, A^, > 0 Wc call any kind of capital in this subset a 

part of the country’s “coveted capital ” For the rest of the capital stock, 
=0 

EquaUons (21) and (25) tell us that if A^, + ^ = 0, i = 0 That is, 
if a parucular kind of capital is not part of the country’s coveted capital, 
the optimal tax on the capital is zero Now wc assume that for all x and^ 
in a(k), 

A P 

^ ^ til (27) 

Pr. 

That is, the relative marginal values which foreign aggressors place on 
different kinds of the country’s coveted capital are equal to the corre¬ 
sponding relative values to the defending country There arc several 
reasons that this is not a stnctly justifiable assumption. It does, however, 
serve to maintain reasonable orders of magmtude. A jet plane is a lot 



764 JOURNAL OF POUTIGAL ECONOMY 

more valuable than a light bulb, to the aggressor as well as the defender. 
From (26) and (27), 

^£1 = :2k for all x,ye a{k) (28) 

Py, Djf 

Hence, from (28), (19), and (21), optimal capital taxes are nondiscrimm- 
atory, that is, 

= <*» for all x>y B a{k) (29) 


Problems in Implementahon 

We have as yet produced no model specific enough to indicate which 
types of capital comprise a country’s coveted capital stock Also, since it 
IS pflJlically very costly to tax the value of capital in every period when 
there are not transactions in the capital during every period, a problem 
arises as to how one can create, if possible, a tax system which levies only 
on transactions but which is still equivalent in effects to the idealized 
system of optimal capital taxation described above These problems of 
implementation are the subject of Section II 


A Possible Generalization 

Admitting Marshallian joint production of consumption and investment 
goods would open up the possibility that some units of produced capital 
would be optimally surrendered in the future to a potential fortign 
aggressor To have determinate units of suih capital, we would, in effect, 
have to introduce different dtfciisc costs for different units of equally 
valuable capital, thus violating (28) and giving rise to discriminatory 
taxes While these cases of “surrendcrable capital” are excluded from our 
formal analysis below, an informal discussion ts provided at the end oi 
Section II 


n. Achieving an Optimal Capital Tax 

The Transaction Structure and Ituome Taxes 

Wc now allow our economy to have an explicit transaction structure—a 
particular set of trades between individuals which achieves the optimal 
competitive equihbnum desenbed above Suppose each producer sells 
his entire output, purchasing all of his inputs all over again for his pro¬ 
duction in the following penod Then there would be no difficulty m 
implementing the optimal capital tax A simple income tax, a tax on all 
producer sales, with tax exemptions granted for sales of the outputs of 
noncoveted capital, would obviously be sufficient to produce an equivalent 



taxation and defense 


765 

to the optimal capital tax. However, this supposition is far from realistic, 
m our model, producers may retain some of their capital output for their 
own future use To acquire an equivalence between an income tax and 
the optimal capital tax then requires amendments to the simple mcome 
tax besides exemptions for sales of outputs from noncoveted capital The 
income tax on the kth kind of capital of individual t is given by 

Tt‘ = Z b„(P,ClK!: - X\,), (30) 

$ 

where 6^, is the income tax rate, A'J, represents deductions from the tax 
base, and represents the capital that 1 uses to produce goods for the 
market m penod t By definition, 

iCi, = JCg- + K^, (31) 

where JCjlJ is the capital that individual 1 uses to produce goods which are 
not sold in the market The opumal mcome tax exists when, for each k 
and 1 , b/^, and take on values that make (30) equivalent to (15) and 
(21), or, 

Z KAPfilKi - JTL) = Z + T*" for all K],, (32) 

I I 

or 

dTl^jdKi, = (33) 

Soluuons to (32) and (33) are written (bh, Xj^) 

Note that income taxes have an independence property expressed as 

Z Pf = Pi (34) 

k 


Noncoveted Capital 

Capital may generate benefits which apply to only certain, specified 
individuals within a country—such capital is “people specific ” One’s 
psychic capital—for example, acqiured abihty to appreciate nature and 
various activities, stock of pleasant memories, acquired ability to entertam 
himself—is part of the country’s people-specific capital stock The rest 
of a country’s “people-speafic capital” is “friendship capital”—where a 
specified mdmdual (e.g., a husband) can command the services (c g , 
good cooking) of another (e.g , a wife) because the other either feels a 
sense of gratitude or indebtedness to the specified mdivtdual or has con¬ 
fidence that the specified individual will reciprocate in the future without 
an explicit agreement, the cost of which would preclude such favors 
from having a positive net value to the recipient. People-speafic capital 
IS of no value to individuals outside the country and is not part of a 



766 JOUllNAL OF FOUnCAL ECONCatY 

country’s coveted capital stock. ^ Since the returns on people>^cific 
capital do not generate market transactions so that s 0 for people- 
specific capital, an income tax operates in an optimal fashion by not taxing 
people-specific capital 

While returns from fnendship capital frequently come in the form of a 
monetary payment in the cases of gift and chanty income, such income 
is, appropnatcly, substantially disregarded for tax purposes in the 
United States * However, large gifts (including inheritances) frequently 
only partially represent a payment for fnendship services, with another 
part representing a payment for services that a foreign aggressor could 
acquire Large gifts and inheritances should therefore be taxed to some 
degree, with the tax rate increasing with the size of the gift or inhentance 
and never exceeding the income tax rate We do observe such taxes, 
althoq||^we have no way of estimating optimal rate schedules to compare 
to the ac«ftd schedules 

When an aggressor takes over a country, he docs not benefit from human 
capital to the extent that such capital is necessary for the “subsistence” of 
the individual An individual is below “subsistence” if he would sooner 
die in rebellion than pay the taxes of the successful foreign aggressor 
One’s subsistence is compnsed of normal support for his family and any 
abnormal, noncosmetit, family medical expenses Hence, one’s human 
capital below that required for his family’s subsistence is not part of his 
country’s coveted capital stock As a result, an optimal income tax has an 
exempuon, on incomes from human capital required for normal 
family subsistence of market goods and for abnormal, noncosmctic 
medical expenses The observed U S income tax exemption of about 
J750 per person appears to approximate the normal subsistence level 
fairly well, white the observed wnte-off for abnormal, noncosmetic 
medical expenses corresponds nicely to the treatment of these expenses 
under an optimal income tax 

It IS likely that a modern foreign aj^ressor would support those citizens 
of an acquired country who were unable to produce their own subsistence 
Assuming this to be the case, any increase in the assistance requirement of 

’’ It should be pointed out that our assertion that people-specific capital is noncoveted 
rests on an assumption that a successful foreign aggressor cannot substantially switch 
this kind of capital into the production of benefits which are not people specific Thus, 
It IS assumed that the human capital that an individual devotes to producing benefits 
for himself (le., producing “leisure'') cannot be substantially converted to the production 
of goods for the aggressor Our foreign aggresion therefore do not make slaves out of 
their victims, they merely tax them to subsistence This implication af^iears to be fairly 
realistic The behavioral basis of it appears to be that an individual acquires certain 
work-leisure habits, which cannot be substantially broken at any reasonable cost by the 
foreign aggressm* 

* A gift which does not represent a payment for a particular service but a mere 
ledistnbutiosi of wealdi based upon directly interdependent utilities, while not contained 
in our formal model, is clearly a mere trarafer and should not be taxed 



TA3CATtON AND DEFENHB 


767 

below*subsittence individuals in a definiding coun;ry has the same external 
social product as any other decrease in the coveted capital stock of equal 
value. The decrease in the coveted capital stock in this case appears as an 
increase in expenditures on charity Hence, a receipt of chanty is not only 
a return on noncoveted capital, it is also a reward to individuals who 
produce less than their subsistence for reducing their country’s stock of 
coveted capital So expenditures on chanty should be treated for income 
tax purposes as any other expenditure which reduces the capital stock of 
the country In fact, chantable expenditures in the Umted States are 
treated substantially the same as business expenditures for income tax 
purposes ® 

Certain consumer durables such as furniture, portraits, trophies, and 
certain antiques are also part of the country’s noncoveted capital stock 
While there are no taxes on these goods as ATj = 0, consumer durables 
which are part of the coveted capital stock are discussed below 

Cash, that is, papier currency, may exist m the economy with explicit 
transacuons as an intermediate asset which allows for the achievement of 
the no-transactions-cost economy described in SecUon I However, with 
costless currency creation, a successful foreign aggressor can also cost¬ 
lessly create any feasible level of real cash balances by altering the rate of 
growth of the currency supply Hence, the level of real cash balances used 
by the defending country is irrelevant to the aggressors and not p>art of a 
country’s coveted capital stock 

The definitions of capital which Marshall (1925), Knight (1935), and 
Smith (1937) inferred from discussions of men of affairs excluded picople- 
specific capital, subsistence-producing capital, certain piersonal consumer 
durables, and papier money Using our theory, we can raUonalize the 
exclusion of these forms of capital from the concept of capital used by men 
of affairs by arguing that these men are only discussing coveted capital 
because it is the only capital which should be taxed The only remaimng 
typics of pnvate-good capital in the Smith-Marshall-Kmght taxonomies— 
the typies comprising the capital stock as seen by men of affairs—are 
natural resources, produced producer durables, human capital above 
subsistence requirements, and the remaimng consumer durables These 
forms will be assumed to comprise the country’s coveted capntal stock, 
and the achievement of an optimal tax on each of these forms of capital 
IS examined separately below 

* Note, however, that a •tandard-type compcutivc model such ai that developed ahovc 
applies only to cases of pnvate-good chanty—cases in which each individual emoys hi 
giving chanty rather thM enjoying the beneSciary’i receipt of the chanty (i.e , in vduch 
there are no external economics in the giving of chanty) and in which an individual can 
costlessly avoid the knowledge of another's suffering (1 e , in which there arc no external 
diseconomies in the creaUon of chanty-iuduang attnbutes) Including both of these 
excluded realisuc cases would probably have little total effect on the tax tresument of 
chanty because their separate effects are substantially canceHiog 



768 JOURNAL OP POLITIGAL SCONOUY 

Producer Durables Whuck Are Ongmaiiy Produced for Sale 

No tax on the gross income from capital which is not sold in every period 
is equivalent to the optimal tax The dcnvative of taxes with respect to 
Kj„ IS positive in (33), and, m view of (31), is zero in (30) for fixed Kj^ if 
X‘ia = 0 So, for optimal taxation, ^ 0 To achieve the effects of an 
optimal capital tax, we wish to consider a tax on k's gross income reduced 
by an estimate of its depreciation 

Depreciation, however, is generally estimable only for a good which is 
originally produced for sale, meaning that whenever a good other than 
Itself IS used to produce it, it is marketed Without loss of generality, we 
assume that when a producer good that is originally produced for sale 
(labelled e for equipment) is not used to produce current marketed output. 
It IS used to produce only Itself in the following period Thatis, 

LettM^ lj ^-. represent realistic depreciation of the original purchased 
capital in terms of current consumer goods, (30) becomes 

Tl‘ = Z b„[P,Ci{K‘, - K-,) - - n'K,)] 


Using (18) this becomes 

(35) 

t 


Hence, to obtain an optimal net income tax on e in time t, in view of (32) 
and (35), we need only pick a which satisfies 


b 


el 




p,c: 


rl + l 


(36) 


Using (18), this optimal income tax rate becomes 


b 


0 

ft 


1 + < 


n' -1 


(37) 


From (20) and (18) we can see that when there is a stationary solution 
such that o“ = 0 ,'+, and C* = 


<t;(i + Pi) 

(1 + p.)(i + a?) - r 


(38) 


where 1 + p, = (5f//55,)/df//dJ?,4,), the marginal rate of time 
preference. 

Thus, the use of realistic, physical depreciation allowances converts 
the ineffkient income tax into an efficient one under an appropriate 
income tax rate ' “ 


In the real world, depreciation allowances are typically granted on a fixed schedule 
for a particular kind of capital good regardless of how the good actually wears out But, 
also in the real world, depreciation typically takes the form of Marshallian joint pro- 



taxation and defense 


769 

The only other existing rabonahzation of the realistic depreciation 
write-oflf this author has seen is by Samuelson (1964) Samuelson shows 
that a write-off on income taxes of all forms of interest expense will create 
a tax structure with no effect on pnces only if the lax on income includes a 
wnte-down for realistic depreciation However, no tax system in existence 
subsidizes interest expenses and taxes net receipts so as to have offsetting 
revenue and incentive effects Such a system would be a lot of trouble for 
no purpose The tax system Samuelson claims to represent is a United 
States-like system, a system containing a tax on interest income as well as 
a corresfionding write-off of interest cxpienses But in such a system there is 
no net tax or subsidy on borrowing or lendmg—nor should there be in our 
optimality model because borrowing represents a mere redistribution of 
purchasing power rather than the creation of any real asset Samuelson 
erred by failing to allow the after-tax contractual rate of interest to nsc to 
reflect the equal shifts up in the interest-demand pnee for loans and the 
interest-supply price of loans resulting, respectively, from the tax wnte-off 
of interest expenses and the tax on interest income Once gross market 
rates of interest are raised to reflect the tax on interest income, the reduc¬ 
tion of the gross market rate by applying the tax wnte-off on interest 
expenses (or income) to obtain the after-tax borrowing (or lending) rate 
relevant for discounting only serves to pull the discount rate back down to 
the onginal real interest rate for the original allocation of real resources 
And with no reduction in the discount rate in Samuelson’s model, no 
positive income tax rate satisfies Samuelson’s pnee invariance condition, 
whether or not there are depreciation allowances 


Natural Resources 

Natural resources (i e , minerals and oil and gas) jxisc a different problem 
in the lack of transactions to correspond to each act of production Like 
producer-goods which are purchased upon their original creauon, natural 
resources are accumulated by the owner without any corresponding 
transaction But unlike such producer goods, there can be no transacuon 


duciion of marketed output and future capital Under >uch depreciation, the optimal 
tax on a new investment under stationary conditions is a ungic tax on the present value 
of the capital in each future period resulting from the invatment, while an income tax 
without a depreciation allowance is a tax only on the imtial capital value It is easy to 
see by an argument similar to that used above that a realistic depreciation allowance 
converts the income tax into a tax on the present value of the future capital valua 
implied by an investment and thus leads to an optimal choice of investments in a world 
in whitdt actual depreciation takes the form of Mardiallian joint production of marketed 
output and future capital for the firm 

There are real-world cases of physical depreciation of producer durables which have 
been produced for side in which the depreciation docs not take the form of Marshallian 
joint production. This appears for origtnaily produced timber, wine, and various agri¬ 
cultural products Here, we olaerve in the Umted Slates, the variable depreciation 
allowances of the kind which our formal model describes, these are frequently called 
“cost depletion allowances ” 



770 


JOURNAL or ROUTKiAL BCXmOUY 


comisponding to onginal creation for a natural resource. Because tax 
collectors cannot be assumed to know the value of the natural resource at 
any given date in the past, they cannot be assumed to know the change 
in its value over time However, once a natural resource is utdized by 
converting it into some other good by an act of withdrawal from nature, 
there is, we assume, always a sale of the withdrawn resource. Hence, the 
obvious method of achievmg the effects of a tax on the accumulation of 
natural resources is to apply taxes at withdrawal in a way which sub¬ 
sidizes early withdrawal The ordinary income tax does not do what it 
may appear to—tax early withdrawal—because the profit from with¬ 
drawal increases with the rate of interest so that delaying withdrawal 
merely increases future taxes by the rate of interest and has no tax-saving 
or tax-increasing effect in a world with a constant income tax rate 
a subsidy to net withdrawal income would not encourage 
early wiHlrawal We can spiecify a tax or subsidy on transacuons which 
will encourage early withdrawal only after specifying some special, 
technological features of the natural resource industry 

In particular, (a) in producing a natural resource for next year, the 
only input one uses is the same resource in the current year, and (b) the 
amount of the natural resource produced is identical to the amount of 
the resource devoted to its production. That is, 

s K‘^, (39) 

where n is the natural resource From this and (18), m an optimum, 

+ O = ^-..1 = P.C'n (40) 

Similarly, the same physical units of natural resources become consumable 
once they are withdrawn so that, letting Ql be I’s withdrawal of natural 
resources in time t, represents the consumption value of the with¬ 

drawn resource It follows that 

Ppi ~ Pii«(^ + ~ Pfir ~ Pm + i 

IS the direct withdrawal cost per unit of the resource assuming linear 
homogeneous vnthdrawal functions * * Thus, 

p.c; = - (Pc, - p.,^i) = p.,^, 

‘' Thus, an implicit production function for a withdrawcr of natural resources, can 
be written 

= min Ki_,^ K', 

where Q is the withdrawn resource If we select “denvauves” of this function to be such 
that (6) and (7) are satisfied, as we are free to do, this specificaticm is not inconsistent 
with our general model The coat of g* u the “withdrawal cost” described above, while 

“ P.C- 



taxation and defense 


771 


as in (40) above Finally, there is the observation that the spot pnee of 
withdrawn natural resources has not substantially changed over time 
(see Barnett and Morse 1963) Thus we can write Pq, = P, and 
Q{ =• C} KIq, for a withdrawer of a natural resource 
With a constant spiot price of the withdrawn natural resource, an obvi¬ 
ous encouragement to natural resource exploitation (when there u a 
positive marginal rate of time preference) is a subsidy which is a fixed 
percentage of the revenue from the sales of the withdrawn natural re¬ 
source For the producer would rather have a given subsidy this year than 
next Thus we set 

= XP^ (41) 

The A IS the “percentage depletion rate Hence, 

Tl‘ = E b„[P,Cl{K\, - JCiL) - XP.iK - IC)] (42) 


Using (39) and (40), 

Ti‘ = E 

f 

Assuming l>„, = 4, and using (40) and (19), 

= 6,(p.2 - 

I 

Tt = K{Pn2 - AF,)AiT + 


(43) 


or 


(44) 


(45) 


Hence, for an optimal tax, letung the first lumpi-sum term in (45) be 
absorbed in T®* and using (32), 




<+i + 


A'p, P, 


(I + p,) p. 


t:) 


(46) 


' ’ The tiaditional, Harbeigcr (1955) analysis of such percentage depletion allowances 
assumes that manuiacturing and oil "investments” should be taxed equally if they 
generate the same streams of cash income It iiuls to recognise that if the oil "mvestnumt” 
IS not undertaken, there is still an accumulation of oil reserves, which 11 a true social 
investment Therefore, it is necessary to net the disinvestment of oil reserves out of 
Harberge r ’s oil “investment” before taxes should be equated on hu equal investments 
The Harbeiger study should also be corrected for the fact that a depletion allowance is 
capitalised in the value of the land, thus serving to increase the costs as well as the returns 
to current oil “mvestmenls ” Making these adjustments in Harberger’s analysis and 
making mtertemporal investment possibilities and taxes expliat leads to our own analysis 



772 


JOURNAL OF POLITICAL ECONOMY 


Assuming that the income tax rate is that optimal rate applied to equip¬ 
ment, and again assuming that a”+, ** a”, we find, substituting (38) into 
(46), that 

A* = (47) 

P, 

Hence, the optimal percentage depletion allowance is the ratio of the 
market value of the conserved natural resource to the value of the product 
obtained by currently exploiting the resource 

The most reliable data on natural resources we have cover the oil 
industry Here, the ratio of mineral nght value to output value has been 
relatively constant at about 23 percent (sec Rooney ['1965]) 1 his mineral 
right value is obtained by adding amortized oil lease payments to royal¬ 
ties current US percentage depletion allowance is 22 pcrciiu 

There iftliius a dost correspondence in this industry between the optimal 
and actual percentage depletion allowances 'While precise data are 
not available, there is also a correspondent c between the low 5 percent 
depletion allowances given to producers of gravel, peat, pumice, shale, 
and stone and the obviously low value of mineral rights to these natural 
resources relative to tiu prices of the withdrawn resources to consumers 
And in the extreme case in which mineral rights are essentially free - -such 
as for soil, dirt, moss, minerals Irom sta water, and air- ihtre is a zero 
percentage depiction allowance 

We now estimate the optimal allowance for thi minerals industry in 
the aggre-gate Note first that because of the constancy over time of the 
spot price of withdrawn natural resources, spot withdrawal costs must fall 
over time so as to make the spot price ol a natural rc-sourcc rise at the 
productive rate of interest In particular the percentage reduction in spot 
withdrawal costs over time times withdrawal costs relative to the spot 
price of a particular natural resource at withdrawal must equal the rale 
of interest Therefore, the price of a natural resource at withdrawal 
relative to the withdrawn resource equals the eapiiahzed rale of cost 
reduction divided by one plus this capitalized rate Since the rate of 
decrease of withdrawal costs in the minerals industry is about 2 percent 


” Oil producers arc also allowed to write off most of their capital i-xpcnscs (Ihiir 
“intangible drilling expenses”) as current expenditures rather than depreciating ihini 
over the life of the producing wells This write-off can be justified by the “national 
defense” argument mentioned m fn 4, that is, the efficient price controls that arc applied 
during a war imply an insufficient amount of domestic capacity for wartime in thi 
absence of special subsidies Based upon the frequency of past wars and the extern ol 
wartime price controls on oil, a conservative estimate of the optimal subsidy to domtsm 
drilling IS 20 percent of the peacetime price of crude oil (see lliompson [1968]) I'hi 
value of the wntc-off of intangible drilling expenses can be estimated at around 1 7 percmi 
of the norma] pnee of crude oil (see Rooney [1965]) The extension of the formal modi I 
to admit the pombiliiy of fiiture, successAii, defensive wan for a given country is straighi 
forward. 



TAXATION AND DEFENSE 


773 


per annum (Barnett and Morse 1963), and we are usmg a productive 
interest rate of 10 percent, the price of natural resources at withdrawal 
relative to withdrawn resource price over all valuable minerals is estimated 
to be 16 67 percent In fact, the bulk of the statutory depletion allowances 
fall between 14 and 22 percent (with effecuve rates shghtly lower because 
of a limitation of the allowance to 50 percent of the net income of the 
taxpayer) 

The observed spot price of natural resources relative to withdrawn 
resources may be constant over time despite its increase for a given natural 
resource because the quality of the resources exploited may decrease over 
time For example, the ratio of oil royalties to the value of the withdrawn 
oil, a well-known empirical constant, could never have remained constant 
had not the more easily withdrawable oil deposits been tapped at the 
earher dates 


Consumer Durables 

Consumer durables, like producer durables, are sold when they are 
originally created but, unlike producer durables, do not create luturc 
benefits for others There is therefore no “income” from consumer dur¬ 
ables to tax Hence, an excise tax on the production of consumer durables 
goods IS in order Using again an interest rate of 10 percent, equation (37), 
and an optimal income tax rate of approximately 25 percent, we have an 
optimal capital tax rate of 2 5 percent. Therefore, consumer goods lasting 
5 years and depreciating in a sum-of-ycars digits fashion, should be taxed 
at an initial excise tax rate of 

n c 10 6 3 1 \ 

V 15(1 1) 15(1 1)^ 15(1 1)^ 15(1.1)7 

= 5 35 percent, 

and consumer goods lasUng 15 years should be taxed at 10 29 percent 
using these assumptions Until very recently, U S federal excise taxes on 
consumer durables ranged between 5 j>ercent and 10 percent with the 
lower rates generally applying to the rclauvely short-lived goods 

For the various excise tax rates, see Commerce Clearing House (1957, Federal Taxes, 
Subtitle D) For relative dcpreciaUon rates, see Prcnticc-Hall (1971, par 15,000) As 
excise tax rates have recently been volatile, there is no simple method of evaluating the 
post-1964 excise-tax structure Some federal excise taxes foil substantially on producer 
durables, the most notable of which are "butinets’ machines (such as typewriters and 
computers) and cars and trucks The business machine case is foirly easy to understand 
once It is recognized that the sellen of the more expensive machines normally avoid 
capital taxation by renUng theur outputs. The Ics expensive machines are frequently 
u^ by consumers, so that on excise tax u in order The same applies to cars and trucks 
However, businestmen should be allowed to ex p ense tbeir purchases of this already-taxed 
etjuipment. 



774 


JOURNAL OF FOLmCAL ttOOtKOlY 


An important consumer durable that is not federally taxed in the 
United States is an mdividual’s home And apartment building dcpreaa- 
tion wnte'ofls are so generous that, m view of the ease of transfeiring 
these buildings, there are also no substantial federal taxes on these con¬ 
sumer durables But local property taxes seem to compensate for these 
apparent inefficiencies as effective property tax rates typically are about 
2 percent per year, which is close to the federal rate on the other durables 
treated above My guess is that the cumbersome local property tax and 
the provision of free education to minors is somehow required by the 
federal government before a locality can exercise local police power or 
float tax-exempt bonds In this view, the locality is merely an adjunct of 
the federal government The chief reason for this suspicion is that it is 
not plausible that freely competing localities would offer free education 
to ^jj^tftially mobile, heterogeneous populace or would use the property 
tax as i^eans for financing it Yet the proper education of minors is an 
activity whose federal subsidization is easy to support—not as a collective 
good—as a good falling within our model which would be privately under¬ 
produced without governmental subsidy because the private decision 
makers (parents) arc not entitled to the increase in future productivity 
which their decisions (education for their minor children) create * ’ We 
shall employ still another implication of the lack of appropriate parental 
rewards in the following section While it is fairly obvious that tht 
inefficient parental reward structurt serves to rationalize special laws 
against polygamy, prostitution, divorce, child labor, the minimum wage 
for teenagers, and social security (which removes a parent’s inefficient 
incentive to instill feelings of guilt in his children in order to illicit their 
financial support alter his retirement), the traditional, inappropriate 
theory of public finance has buned the important effects of an in¬ 
appropriate parental reward structure on ihc efficiency of the tax system 
in a sea of imagined inefficiencies 

Although local property taxes in the United States arc also observed to 
fall on the plants of small businessmen, these t apital taxes are roughly 
offset by the national investment credit And while larger, corporate 
enterpnses are normally able to bargain tax breaks out ol small localities 


Parents may capture part of the increase in productivity that their decuions create 
by (inefficiently) training their duldren to feel sufficiently indt bted to transfer some futun 
income to tbeir parents, but evdD then the parent generally has a bias toward educating 
the child m ways which make itFdMively easy to capture returns So privately provided 
educaaoD would generally suirendcf too much of the child’s leisure tp“dUpuXer building,” 
which makes the children feel uuMWed to the parents and would ov^y tniun^{tS-,ididd 
toward occupations with rclaUveMow noMeewuary rewvds,' 


However, parent* who have anfficiait tsbil^ fli§lheir childiKl'* 
their childrm unoonditioaal gits or mhoitaiicei, will imdee weienr 
decistont (aee Rubier [1974]}. Thu*, public educaUon need not accomnp^drte the rich 
who are fcnerally observed to make some unconditional gifts to their 




TAXATKW AND DEFENSE 


775 


in some form or other and therefore effectively avoid local property 
taxes on their plants, we will find below a substantial reason for applying 
the current investment credit to U S. corporations 


Human Cental 


“Human capital” as used below is coveted human capital, or “skill,” that 
part of one’s human capital stock which he uses to produce goods for the 
market (or to produce future skill) in value exceeding his subsistence 
Newly created skills arc reproduced in each future pienod through the 
worker’s taking care of himself and making any necessary expenditures to 
retain his skill, the latter being treated as the former for tax purposes by 
granting it a wntc-off as a current expense 

Since skill carries its own maintenance out of what would otherwise not 
have been coveted capital, its value to the aggressor in each period is the 
present value of its entire future product Thus, optimal capital taxes on 
t’s skill amount to 


^ a‘K)^ ^ Pfil 


1 + 


(47) 


On the other hand, income taxes are 

7-? = E m 

t 


We assume a stationary optimum so that a® = a®, CJ = C„ and p, = p 
We consider an accumulation of a durable skill from the time v onward 
equal to where v > 1 

First consider the case in which the accumulation in v is not the result 
of foregoing income from skill so that = 0 for all t Then, 


^ a®P„_,C.AAUl + p) 
• (1 + a')p^ 

and 

^ AAU 

* P 

where b„ = b. 

Hence, equating A T"** and A 


(49) 

(50) 


b’ = 

' (1 + <^)P 


(51) 


' * In ■ case in wluch produced skill is maintained out of that skill itself, eq (47) would, 
of coune, take the standard Ibnn of eq (15) above It a easy to venly that our results 
below also hold to a close approoninautai Ibr that case (One need only note below that 
jowch a case JC^(l + ^ fori > e so that [50] and [53] are corrrsponduigly 

reduced to apprastimately mat^ the redtictian in [49]) 



JOURNAL OF POLTnCAL EOONCMIY 


776 


Tax wntc-offs for expenditures on education, job search, and worker- 
owned equipment would substantially subvert this efficient income tax 
The optimal income tax in (51) is only slightly different from the optimal 
income tax on purchased producer durables 

Now consider the case m which the initial accumulation of a durable 
skill IS accomplished by foregoing income from current skill and keeping 
leisure time constant Here, (49) remains the same, but 


AT-J' = A/r:. 


= — 




1 -f p 


(52) 


V b„P,C, AKl, 


Assumng b„ = b,^{oT t > v, 

Arj' = -i. 


AK‘,, , , AK>, 


I + p + a 


(53) 


Using (49), for opumal taxation, we have, 

+ p) ^ 40 _ Kv-iP 
(1 + a'>)p " \ + p + a° 


(54) 


EquaUon (54) states that an optimal income tax is progressive, as the 
marginal tax rate applying to increases in future wages, must exceed 
the average tax rate, which one avoids by training rather than 

working in the v — 1st period For individuals whose marginal invest¬ 
ments in human capital appear at an average level of foregone earnings, 
we use (51) and set = a”(l ■+ p)lp{\ -I- <i°) so that (54) can be 

written as 


= bl. 


(1 + — ^—] 

\ I + p + aj 


(55) 


Using 10 percent time-preference rates, year investment periods, and 
2 5 percent opumal capital tax rates, we can fit (55) to actual data on 
U S tax rales ‘ ^ The result is that the actual U S marginal income tax 
rates are within one percentage point of the optimal rates for all approx¬ 
imately average levels of foregone annual income (those between $5,000 
and $ 10 , 000 ) 

The fit IS again very close The actual U S marginal income tax rates 
are always within 2 percentage points of these theoreucally optimal rates 
for all reasonable levels of human capital investment (1 e , for all levels of 
foregone annual incomes of $25,000 or less) EquaUon (55) implies higher 
tax rates on the leisure-produced accumulation of human capital of 


Future tax rates were computed assuming the individuals will be married, while 
current rates, ,, were computed assuming the investor it tingle (Lasser 1971) 



TAXATION AND DEFENSE 


777 


individuals with above-average foregone earnings on their marginal 
investments and lower tax rates on individuals with lower than average 
foregone earnings But it is not implausible that the truly optimal tax 
rate on leisure-produced skill rises with income rather than remaining at 
the constant expressed in (51) because the excess of the foregone earnings 
of an individual over average foregone earmng^ is a plausible measure of 
the extent of his parental overtrainmg to surrender leisure (as the reward 
structure given parents induces them to instill greater lifetime estimates of 
the value of work relative to leisure in their young children than they 
would if they could collect as much of their child’s leisure benefits as they 
can the child’s work benefits), while the deficit of an individual’s foregone 
earnings below average foregone earnings is a plausible measure of an 
individual’s parental undertraining concerning the value of work and 
investing in human capital relative to consuming leisure So we assume, 
rather crudely, that the optimal tax rates on leisure-produced skill arc 
approximated by the degree of progression implied by the successive 
application of (55) across all levels of income Then, since the analysis of 
(49)-(54) with proportionately adjusted optimal tax rates reveals that 
(55) still describes the optimal taxation of skill accumulaUon by foregoing 
current income, (55) describes an optimal degree of progression over all 
levels of income 

The above analysis applied to accumulating durable skill so that 
depreciation could be ignored This appropnately describes most educa¬ 
tion and training, as most education and training is undertaken by young 
people who are going to use their training for very many years However, 
some training is undertaken by older people and some training of youth 
is for short-term or risky careers In these cases, the above tax on human 
capital accumulation is too high Some sort of subsidy for retraimng or for 
training in occupations with short durations is called for A depreciation 
allowance on human capital investments would be msuflicicnt For since 
the wage increases required to justify a short-term human capital invest¬ 
ment are gfreatcr than those required to justify a durable investment, the 
progressive income tax effects a greater lax rate on short-term human 
capital investments than on long-term investments We do observe a tax 
break for nsky and short-term human capital investments in the form of 
an “income averaging" opportunity, but we have made no attempt to 
quantify the effects of this tax provision 


Producer Durables Used by the Ongmal Producer 

An ongmal producer of producer goods may avoid the taxauon of his 
produced capital by writing off its cost of original producuon as a current 
expense and then using it himself for future producUon. The difficult 
problem of the government is to detect these activiUcs and to apply 



JOURNAL OF POUnOAL ECONOMY 


778 

sjieGial taxes whenever such activities occur. We now let the progressive 
income tax extend over from human to nonhuman capital but also apply 
only half of the ordinary income tax rate to “capital gams” income, 
income from the sales of the outputs of producer goods that can be used 
by the onginal producers of the goods The most wealthy individuals are 
thereby drawn into industries in which produced producer goods can be 
used by the onginal producer, thus dnving the less wealthy out of these 
activities The wealthy are induced by the tax break to identify the capital 
gains situations, those where they have produced theirownproducergoods, 
and the government may now treat the previous expenses of producing 
this capital as capital expenditures, still leaving the wealthy better off 
than with ordinary-income treatment of both expenditures and receipts 
and with a higher value on the entire enterpnse than the less wealthy 
indiv^ll^aii ‘ * The resulting tax on producer goods used by the onginal 
praduceil%B therefore about half as large as the highest tax bracket, 
reduced by about 25 percent, when, as in the United States, the capital 
gains tax is collected at the time of sale rather than as the gains accrue 
The highest tax bracket in the Umted States is currently 70 jsercent, which 
puts the effective U S tax rate on the income from producer goods used 
by the onginal producer at about its optimal level of 25 percent 

However, the above procedure efficiently taxes only the producers’ 
onginally produced capital that he uses for himself It docs not tax the 
internal accumulation of the successive capital outputs of the ongmally 
produced capital We have already seen in the case of human capital that 
in order to tax capital accumulated by foregoing current income, some 
sort of progressive tax schedule is required In the present case the relevant 
tax brackets are very high, so that significant progression is no longer 
piossible One solution to this problem, which is informationally im¬ 
practical, is to tax capital gams as they accrue, at about one-third of the 
ordinary income tax rate More practically, we may tax realized capital 
gams at tax rates which increase to the date of realization with the real 
rate of interest, starung from one-third of the ordinary income tax rate 
However, this latter solution would remove the present discouragement 
to transact in the type of capital covered by capital gains taxation (This 
discouragement is also reflected in the harsh tax treatment of capital 
losses and realized short-term capital gains ) A rational basis for this dis¬ 
couragement, although not contained in the current model, is that there 
IS a redistributional element in most transactions involving capital gains, 
one in which the buyer is more optimistic than the seller regarding the 
future value of the goods The tax discouragement therefore serves to 

' * When the government Cult to treat the previous expenses of the investor as capital 
expenditures, an inefficient “tax haven" exists—such as has recently been the case in 
the United States in cattle, timber, and crop development. The U S government 
ordinaniy shuts down these tax havens, although rather slowly 



TAXATION AND DEFENSE 


779 


save the soaally wasteful transaction costs involved in these transacuons 
Thus, we have no practical way clearly to improve the capital taxauon 
of the capital output of those producer goods which an individual pro¬ 
duces for his own use. 

There are numerous cases m which capital gains treatment cannot be 
afforded to the sale of the goods produced by capital that the producer 
produces for himself. These occur when there is a continual stream of 
marketed outputs and expenditures over time, so that it becomes difficult, 
if not impossible, to attribute any measure of marketed outputs to the 
inputs that produce them Thus, for example, a producer's maintenance 
expenditures are treated not as capital costs but as current business 
expenses, while the returns from such expenditures are treated not as 
capital gams but as ordinary income when the final products are marketed 
In order to avoid the application of the high progressive tax rates to the 
nonhuman capital of individuals with conumial streams of marketed 
outputs and expienditures on such capital, we allow these individuals to 
incorpiorate their nonhuman capital, taking their returns from the cor- 
porauon in a form in which they pay only about a 10 percent lax rate ’ ’ 
Our problem is then to specify an efficient tax on corporations in light of 
their ability to produce their own capital The simplest policy is to tax 
the value of the company (or the value of its slock, sull taxing the interest 
income of the creditors) at the efficient, 2-3 picreent capital tax rate and 
drop the tax on dividends and capital gains Such a policy would directly 
tax any capital in the company—regardless of how it is accumulated 
The costly implementation of realistic depreciation and efficient depletion 
allowances would be avoided, as would the taxation of dividends, capital 
gains, and corporate profits But we assume that such a piolicy is not 
available, or, if it is, that corporations remain whose stock does not trade 
at observed pnees The problem then is to specify an efficient corporation 
income tax in view of the corporation’s ability to produce some of its own 
capital Now purchased capital is recorded on the books of a company as 
an asset, but internally produced capital whith the company expenses is 
not included on the books as an asset However, this latter kind of capital 
is included in the market evaluation of the t ompany so that the depreciated 
stock of externally purchased assets in a corporaUon relative to its total 
capital stock can be represented by the ratio of the company's book value 
to Its market value We assume this is constant within each corporaiion 

Letung BfJKf, represent the ratio of book to market value of the assets 
of the firm, replacmg K'„ in (35) with using (32) and (36), and 

‘ ’ Partial dividend cxcluiioiu and taxes on realized capital gains at about half the 
tax rate applicable to ordinary income seem to achieve aknut th» dfecuve rate fiu- the 
typical investor The number may seem a little low, but it reflects the signifuaml advan¬ 
tages to delaying realued gams, and giving charity and bequests in the fomi of appreciated 
stock 



780 JOURNAL OF POUnCAL ECONOMY 

adjusting for a 10 percent effective tax on dividends and capital gains, the 
optimal corporation profits tax is 



The average KfjBj has been estimated to be I 6 Thus, given our other 
estimates, the average optimal tax rate on corporate income is about 37 
percent While this is somewhat lower than the statutory rate of 48 percent 
for a large company, the presence of the 7 percent investment tax credit 
in recessions has served to substantially lower the effective tax on produced 
producer durables Assuming that half of the years are “recession” years, 
the annual investment credit 3| percent And assuming that the typical 
age of.||pduced producer durables is 10 years, the optimal effective excise 
tax (usin^^e formula for consumer durables) on corporate purchases of 
producer durables is about 7 percent x 1 6 This means that the effective 
3^ piercent investment credit subsidizes capital by about 30 percent of the 
optimal capital tax But since the corporate profits tax rate is greater than 
the optimal rate by about 30 percent of the optimal rate, the combined 
48 percent corporate profits tax and 7 percent investment credit in 
recessions effects very close to an optimal tax on producer durables in 
corporations 

SurrenderabU Capital 

Apparently, the only empneal cases in which the costs of defending 
certain units of capital are significantly higher than the costs of defending 
equally valuable coveted capital are cases m which the capital is located 
on foreign soil In accord with our general results, the U S government 
neither substantially commits itself to defend such capital nor significantly 
taxes the income from such capital.^* Rather, the United States sells 
insurance against foreign confiscation to the various companies with 
foreign holdings, m effect charging a price lor providing only those pro- 


This was done by multiplying an estimate of the rate of return to book value of 
equity for U S manufacturing corporations in 1966 (Pccliman 1971, p. 307) by an 
estimate of the pnce-earnings ratio for U S industrials for the same year (Moody’s 
Investors’ Service 1967, p a 23) This was then adjusted to represent the ratio of the 
book value of companies relative to their market value by adding on the ratio of debt 
to net worth and dividing by one plus this ratio (Pechman 1971, p 307) 

An exception applies to overseas earnings of U S companies when taxes paid to 
foreign countries are below what domestic taxes would be The United States taxes such 
earmngi as domestic camingi and credits the foreign taxes against the tax bill This is a 
significant exception only when foreign tax rates are very low, in which case the Umted 
States IS probably protecting the foreign country, so that the domestic U S. tax rates 
should apply. 



TAXATION AND DEFENSE 


781 


tection services that it wouid rationally supply as an insurer in order to 
avoid claims by the insured much as private medical insurers rationally 
supply proper medical care in order to avoid future insurance claims 


ni. Conclusion 

Our results indicate that the U3 tax structure does not produce the 
deluge of malincentives and economic inefficiencies that one finds when 
using the traditional theory of pubhc finance Rather, using a more 
accurate theory of the nature of national defense, our tax structure pro¬ 
duces roughly optimal incentives Because no model such as our was 
available to the advisors or decision makers during the development of the 
U S ttix structure, our results strongly suggest that our political system, 
using the self-interested calculations of its citizens, has somehow been 
able to systematically produce a substantially more efficient tax structure 
than our economists, using the traditional theory of taxauon, have been 
able to recommend 

While incentives under the U S tax system appear to be remarkably 
efficient, we arc not advocaung the status quo In particular, we have 
argued that the current U S tax system could be improved by reinstating 
excise taxes on those consumer dutables which are not used for business 
purposes and replacing corporate income taxes on companies with public¬ 
ly traded shares with an annual tax of about 2| piercent on the market 
value of their common and preferred stock, dropping the personal income 
tax on dividends and capital gains on such stock 


References 

Barnett, H J , and Morse, C Scarcity and Growth Baltimore Johns Hopkins 
Univ Press, 1963 

Commerce Clearing House Code of Regulations Chicago Commerce Clearing 
House, 1957 

Harbcrgcr, A “The Taxation of Mineral Industries ’ Federal Tax Policy for 
Growth and Stability Washington Joint Econ Committee of Congress, Gkivcrn- 
ment Printing Office, 1955 

Knight, F H "Professor Hayek and the Theory of Investment ” Earn J 45 
(December 1935) 77-94 

Lasser, J K Your Income Tax New York Simon & Schuster, 1971 
Malinvaud, E "Capital Accumulation and Efficient Resource Allocation." 

Eeonometrua 2\ (April 1953) 233-68 
Marshall, A. Principles bf Economics 8th cd New York Macmillan, 1925 
Moody’s Investor Service. Mood's Industrial Manual (June 1967) 

Pechman, J Federal Tax Poluy Washington, D C Brookings Inst., 1971 
Prentice-Hall Federal Taxes Vol. 2 Englewood Clifls, N J Prentice-Hall, 1971 
Rooney, R. F. Taxation and Regulation cf the Domestic Crude Oil Industry Ph D 
dissertauon, Stanford Umversity, 1965 



789 JOURNAL OF POUnCAL ECONOMY 

Ruhter, W “CSiildhood, Human Capital Development, and Parental Incentives.” 
Unpublished manuscript, UCLA, 1974 

Samuelson, P A “The Pure Theory of Public Expenditure ” Kev. Earn, and 
Stalls 36 (November 1954) 387-89 

-“Tax Deductibility of Economic Depreciation to Insure Invariant 

Valuations” JPE 72 (Elecember 1964) 604^ 

Smith, A. Wealth qf Natwns Vol. 2 New York Random House, 1937. 
Thompson, E A “Statement ” Gamrmental Intenenttan in the Market Mechamsm 
The Petroleum Industiy Heann^s before the Subcommittee on Antitrust and 
Monopoly of the Committee on the Judiciary, May 1968 




Vertical Control with Variable Proportions 


Frederick R Warren-Boulton 

Waihmgbm Unwasity, Samt Lams 


If a monopolized input can be used in variable propwrtioiu, vertical 
integration or an equivalent tying arrangement will increase monopoly 
profits Assuming a constant elasticity of final demand and a constant- 
elasticity-of-substitution production function for the final product, this 
paper uses a combination of analytic and simulation techmqucs to show 
that vertical control will also reduce the demand for nonmonopolized 
inputs and, except under particular cost conditions in the input industries, 
will result in an increase in the price of the final product The direction of 
the combined welfare effect, however, depiends on the specific parameter 
values 


If a monopolized factor or intermediate good can be used in variable 
proportions with other inputs by downstream producers, pricing of the 
monopolized input above its marginal cost will distort the input ratio 
chosen by downstream users Vertical control' bv the input monopolist 
can prevent this distortion and thus increase monopioly profits This 
proposition has been established several times in the literature, notably 
by McKenzie (1951) and by Burstein (1960), and a graphical formula¬ 
tion has been presented by Vernon and Graham (1971') 

For public policy, however, the cntical problem is to determine the 
effects of vertical control on the prices of the inputs and the final product 

I would like to expren my particular appreciation to CSiarIcs Berry for his encourage¬ 
ment and anutance I would also like to thank Trout Rader, Peter Sturm, and a referee 
for tbeir helpful comments 

‘ Vertical control can be exerted by firms, unions, or individuals in the form of vertical 
integration ^complete control through ownership) or in more partial forms such as 
tying or royalty arrangements The use of a monopolized input in variable proporticns 
IS one of several passible causes of vertical control Other reasons include economies of 
control and information, reduction of ru^, pnee discnramation, avoidance of regulation 
or price controls, elimination of bilateral or successive monopoly, and the raising of 
entry bamers For a comprehensive discussion of possible reasons for vcrticsd control, 
see Williamson (1971) 


783 




JOURNAL OF POUTIOAL ECONOMY 


784 

and hence the effect on welfare * This paper examines these effects, 
assuming a constant-eiasticity-of-substitution (CES) downstream pro¬ 
duction function and a constant elasticity of demand for the final 
product A combination of analytic results and simulation experiments 
IS used to show that, except under particular cost conditions in the input 
industries, vertical control results in higher prices for the consumers 
of the final product and reduced demand for nonmonopolized inputs 
The direction of the “combined” welfare effect on consumers, producers 
of nonmonopolized inputs, and monopioly profit receivers, however, 
depends on the specific parameter values m the demand and production 
functions 

The paper is divided into three sections Section 1 assumes constant 
marginal costs in the input industries Section II extends the results to 
cases ^l^onconstant input marginal costs Section Ill examines the 
effect of j^xing other assumptions and attempts to draw some con¬ 
clusions for public policy 

I. Vertical Control with Constant Input Costs 

Assume that a final product, .Y, is produced by competitive firms using 
two inputs The first input, A, is monopolized, the other input, B, is 
supplied competitively The production function for the final product 
IS of constant elasticits of substitution, tr, and constant returns to scale 

X = -I- (1 - 

K>0,0<(><l,0^(T<oo (1) 

Assume also a constant-elasticity demand function for the final product 
of the form 

-Y = z//^;, z>o, n > i (2) 

Throughout this section, it will also be assumed that the marginal 
cost of producing the monofxilized input, M„, is constant over the relevant 
range, and that B is available in infinitely clastic supply at price 

The first step in determining the effect of vertical control on the pnee 

^ W’hile Vernon and Graham assume that the price of the final product is unchanged 
after integration, Burstein’s opinion appears to be that the price of the final prcxluct witl 
nie after integration or an equivalent tying arrangement, since "the very existence of 
substitution possibilities will in general prevent the monopolist from facing the com¬ 
petitive producers of the final product with production costs high enough 10 forct them 
to charge a price (for the final prcxluct) equal to the monopoly price” (Burstcin 1960, 
p 81) In a recent article in this Joiaml, Schmalensce (1973) examines the effects of 
vertical integration with variable proportions under assumptions of a linear homogeneous 
production function for the final product and constant costs for all inputs The implica¬ 
tion of Schmalcnsee's results is that the price effect of vertical control cannot be signed 
without additional information on the cost and demand conditions 



V^TIOAL CONTROL 785 

of the final product it to derive the profit-maximizing price for A before 
vertical control Since the supply of A is monopolized. 


P. 


1 - !/£’ 


(3) 


where E, the elasticity of derived demand for A, is given by equation (4) ^ 


E = + e) + k^jti - a) 

n + e - k^(r] - a) 


(4) 


where e is the elasticity of supply of B, and X:, is the share of the cost 
of A in the total cost of producing X Since B is assumed to be available 
in infinitely elastic supply, r = 00, and equation (4) reduces to 

E =*,, + (!_ k,)o (5) 


From the first-order conditions for cost minimization, 


k, = 


1 


I + [(1 - s)isnpjp,r-' 

Substituting equations (5) and (6) into equation (3), 


( 6 ) 


(7) 


Equation (7) cannot readily be solved explicitly for P, We do not, 
therefore, have an expression for P^ which can be used in further deriva¬ 
tions However, P^ can be computed over the range of the various param¬ 
eters, notably ff, d, and <7 Since the downstream X industry is assumed 
competitive, the pnee of X is equal to its marginal cost Thus, once F, 
IS determined, we can solve for f*„ * 

y - <5)' + (8) 

After vertical control, an unconstrained integrated monopolist will set 
marginal revenue equal to marginal cost in the X market The resulting 
price for X will be 

p ^ - <5)* + 

\ -\h L J'C* - »/'!) 


»-Il/(!-•) 


( 9 ) 


See Hicks (1963, p 244) 

For a derivation, see Nerlove (1%7, p III) 



786 JOURNAL OF POUTtCAL BOCMIOUY 

The y1 monopohst may not, however, be able to set a price for 
the final product after integration The integrated firm cannot set a 
price for X which is higher than the independent downstream firms’ cost 
of producing X The cost of producing X for an independent producer, a 
function of the price of A set by the A monopohst, is thus the “entry pnce 
of X,’’ If the A monopolist is prepared to raise the pnce of A after 
vertical control to any arbitranly high level, P„ becomes the cost of 
manufactunng X using only B (or, equivalently, the value of /*, in 
eq [8] vrtth = oo) If 0 ^ o ^ 1, is infinite, since X cannot be 
produced without using A But for it > I, Px less than infimty,* and 
may even be less than Thus the ratio of the price of the final product 
after vertical control to the pnce of the final product before vertical 
control IS 



R can easily be shown to be equal to unity if the elasticity of sub¬ 
stitution IS zero or infinity, since, for either of these two extreme values 
for <r, factor proportions cannot be affected by vertical control * In 
addition, for the special case of tr = 1, it is possible to denve an explicit 
expression for R’’ which, with constant costs in both input industries, 
reduces to 


R 

J 

fsigxfX. 


(1 - ^ , 
(1 - l/»f) 


(li) 


’ For the CE.S production function with ir > 1, the average product of B approaches 
a positive lower limit as the A[B ratio approaches zero (see Arrow et al [1961, 
pp 230-31]) Hence production IS possible with £ alone, and the unit cost of producing 
using only the B input, P,, is a finite upper limit to the integrated monopoly’s price 
* See Burstein (I960) or Vernon and Graham (1971) 

^ Assuming a Cobb-Oouglas production function for the final product X= YA*B*'~*^, 
Y > 0, constant elasticity of demand for the final product X =s ZIP2, Z > 0, y > 1, 
constant elasticity of supply for the cooperant input B = CPI, C > 0, and a constant- 
elasticity marginal-cost schedule for the monopolized input A = 27Af*, D > 0, wr 
can solve for the price of the final product before vertical control 

2*<'"*’*»*' \ l«IHS. + t+l -St)+*(St«>+»(l-S» 

~ yt*+i)«+-sast i)£jsi«+ii^j _ ^jxi-SMs+n | 

X [id - I/£)]*»<*+''j 

and the price of the final product after vertical control 

+ i^^jki-sxs-h) jinstsr+s+i-ssn-stsr^i+sti-sii 

“ yi«+iKf+ii(;<‘-*ws+i>i)**'*‘»(l _ ^)«t«-*)<i + i) 

X i*<*+'>(l - l/,)s<.+#l+.<i-i) 

The price ratio is thus, 

(1 — + I/#)*<'-^ts+uiutst^+s+t-^si+sts+rtrsti-SM 


(1 - l/,)s<*+*i+ti-»» 


R ae 

tf » 1 



vsR'noAL <x»mioL 


7B7 

where 

E — 1 + 5(ij - 1) 

«« 1 
e«40 

and g elasticity of the margpnal-cost schedule in the A industry 
(assiuned infinite in Section I) For this special case, R is always greater 
than unity and approaches unity as tf approaches infinity or as j ap¬ 
proaches one Thus the pncc of the final product must go up after 
mtegfration if the elasticity of substitution is unity and there are constant 
costs in both input industries The pnce of the final product will go up 
by a larger proportion after vertical control, the smaller are S and r/ 
Thus the smaller the share of the cost of the final product which is due 
to the monopolized input, and the less clastic the demand for the final 
product, the greater the picrcentagc increase in the pnce of the final 
product after integration * 

We can also show that R > 1 for a second special case, that of tr ^ 1 ; 
If <J & then E ^ ri ^ The monopoly markup on A before vertical 
control, PJM„ is thus less than or equal to the markup on X after vertical 
control, Furthermore, an increase in the pnce of one input by 

any proportion must result in a nse in the pnce of the final product 
by a lesser projxirtion if the prices of other inputs are held constant 
Thus PJM, must be greater than the ratio of P^ to the pncc of X which 
would exist if there were competition in both input and final^product 
industries With constant input costs, this competiuve pnce for X, P'^, 
IS also the integrated monopolist’s marginal cost of producing X, A?, 
Thus if <7 S »j, 

P P \ \ P 

* X ^ * « _ * ^ ^ * X 

F; M,~l- l/£ “ 1 - 1/ij “ A7, ’ 
and since A?, = P'^, F, < P^ 

The conclusion that vcrUcal control results in a nse in the pnce of the 
final product for two special cases, a = 1 and a ^ if, leads one to expect 
this result for the general case of 0 < <7 < 00 The inability to denve an 
explicit expression for F, from equation (7) prevents the denvauon of an 
analytic proof The effects of vertical control for the general case of 
0 < t7 < 00 can be exammed, however, through the use of computer 
simulations, using a range of values for each important parameter In 
particular, the effect of vertical control on the pncc of the final product 

' These results ibr the cm ], < >b ^ n oo, case are also denved in Schmalensee (1973, 
pp 447-48) 

® From eq (4), if o ai then E m c m ^ If c > ^, then E > c, and if a < a, 
then E < If The elasucity of denved denumd is thus a weighted average of c and a 



JOURNAL OF POLmOAL EOONpllY 


788 

and on the levels of consumers’ surplus’** and monopoly profits was 
computed for all combinations of: (t = 0 00, 0.25, 0 50, 4.00, 

»; = 1.1, 1 5, 2, 3, 4, 5, ^ = 0 05, 0.10, 0 25, 0 38, 0.50, 0.63, 0.75, 0 95 
Other parameter values were Y = 1.00, = 1.00, = 1 00, 

Z = 1,000 The conclusions from these simulations are as follows. 

A The Effect of Vertical Control on the Pnce of the Final Product 

1 With constant costs in both input industries, the pnce of the final 
product always increases for elasticities of substitution between zero and 
infinity 

R > 1 

0<«<oci 

As the illasticity of substitution is increased from zero, R rises above 
unity, reaches a maximum, and then declines, approaching unity again 
as the elasticity of substitution approaches infinity 

2 The lower the elasticity of final demand, the greater is PjP,, 
although and P^ are both decreasing functions of rj Thus if the entry 
constraint IS not effective, a lower elasticity of final demand will increase R 

The entry price, P^, is not a function of >; A lower t\ will, however, 
increase P, Thus is an increasing function of t\, and, if the entry 

constraint is effective, a lower elasticity of final demand will decrease R 

The effect of varying rj is illustrated in figures lo and !/>, which plot 
7? as a function of tr for tj = 2 and tj = 4 Figure lo assumes 6=1, 
so that, alter some value for ff, the P^ constraint is effective Thus, for 
example, if <r = 1, an increase in 17 reduces R, while if a = 2, an in¬ 
crease in ij increases R In figure \b 6 = and the P^ constraint is never 
effective Thus an increase in tj will lower R for all values of a 

3 The effect of varying 6 parallels that of varying m A higher value 
for 6 shifts PjP^ down, but shifts PjP^ upward Figure Ic shows R for 
t] =2 and 6 = i, j, i 

These results for rj and S arc consistent with the analytic results from 
the a = 1 case (since for it = I, the P,, constraint is never effective) 
and are intuitively plausible Cetens panbus, the less elastic the final 
demand, and the less “important” the monopolized input (the lower is 
S), the greater the difference which can be expected between an un¬ 
constrained monofxily over one input and an unconstrained monopoly 
over a final product On the other hand, if 6 is large there is less likelihood 
of the input monopolist being confronted after vertical integration with 

It should be noted that the use of consumers’ surplus as a measure of welfare requins 
particular assumptions, for example, homotheticity of preferences (see Rader, 1972 
pp. 234"43) For pouibte ambiguities in the use of producers' surplus later in this papi r, 
sec Mishan (1968). 



VXKTICAL CONTROL 


789 



Fio 1 —E^ect of vertical control on the price of the final product 


effective competition from other firms in the final-product industry (a 
low P^) Similarly, the more elastic the final demand, the less likely 
that the desired monopoly pnee for the final product will be high enough 
to exceed /*, 

Thus the effect of 17 and d on R cannot be determined in general with¬ 
out information as to the effectiveness of the entry constraint For 
8 ^ :S 1, if the A monopolist is willing and able to raise the open 






790 


JOURNAL OF POUnOAL BOOHOMY 


market price of A to any height (ultimately, by simply refusing to sell 
A at any price), then the constraint is not effective and A is a de¬ 
creasing function of t) and 6 Even over this range of a, however, if the 
A monopolist does not raise the price of A sufficiently after vertical 
control, he may be confronted with a maximum price for the final 
product which is less than the desired monopoly price for the final 
product. 

B The Effect of Vertical Control on Monopoly Profits and the Incentive for 
Vertical Control 

1 Monopioly profits must increase after vertical control for 0 < er < oo 
This increase reaches a maximum at some a > 1 The lower are d and tj, 
er the magnitude of the increase m profits, and the lower the 
value for <t where the increase in profits is at a maximum 

While the absolute magnitude of the increase in monopioly profits 
IS relevant for calculating the total welfare effect, a better proxy for 
predicting the likelihood of vertical control might be the proportion 
by which monopioly profits could be increased by vertical control, or tht 
increase in profits as a propwrtion either of the cost of manufacturing 
A or the total costs of manufacturing X These three alternative indi¬ 
cators of the likelihood of vertical control arc also higher, the lower 
arc S or i;, except that, where the constraint is effective, a higher 
value for 6 increases the gain in monopioly profits as a projx>rtion of 
the costs of /I or A" 

2 Since an increase in the price of the final product after vertical 
control may be impmssible or undesirable in some cases (because of 
regulation, threat of antitrust proceedings, possible entry at the A level, 
or loss of sales of A in other markets if P, is increased), it is useful to 
separate the total increase in monopoly profits into two components 
the increase in profits which would be possible without changing the 
price of the final product after vertical control {EFCY), and the furthir 
increase m profits due to the change in the price of the final product 
(Alt A/’„) If there arc constant costs in the B industry, the first com¬ 
ponent can be regarded as a pure “efficiency” effect It reflects the in¬ 
crease in efficiency of production of the final product (the reduction in 
the cost, pricing inputs at their social marginal cost, of producing anv 
given amount of the final product) due to the elimination of the input- 
ratio distortion ‘ * 

Graphically, vertical control results in both a downward shift in tin 
cost curve for X and an increase in the price of X (fig 2). Note that 

'* It u thii "efficiency” effect which is shown graphically by VemiMi and Graham 
(1971) 



79* 


VBRTIGAl. OONT1U>L 



Fio 2 —Effect of vertical control on final-product induitry (numerical results asniroe 
ff = ir = 2, = i 


because of the previous existence of monopoly power in the input in¬ 
dustry, the welfare loss due to the pnee mcrcase, A IV A/*,, is not just the 
“welfare tnanglc,” kjg in figure 2, but the area ibjg 

Since the welfare effect of the efficiency component is unambig^uously 
fxisitivc, and since it may be the only effect of vertical control, the 
efficiency compionent is worth separate examination As a function of <t, 
S, and >], the efficiency component shows the same pattern as the total 
increase in profits As a proportion of the total increase in profits, how¬ 
ever, the efficiency gam is higher, the lower is a and the higher are 
5 and rj As v nses, and as S and ij fall, the incentive for vertical control 
increasingly becomes the achievement of market power rather than 
the eliimnation of inefficiency 

If the pnee of the final product is prevented from increasing, the 
likelihood of vertical control occurring, as measured by the ratio of 
the effiacncy gam to monopoly profits before vertical control, or to the 
costs of A, IS higher, the lower arc d and ij, and the maximum percentage 
gam IS reached at lower <r for lower 5 and i; As a pro|X>rtion of the total 
costs of AT, the efficiency gam is greater for lower values for f \, but, over 
high values for <t, the proportion is higher for larger values for 6 

Whether or not the pnee of the final product is permitted to rise, 
therefore, the maximum incentive for verucal control will exist under 














792 


JOURNAL OF POLITICAL ECONOMY 


a combination of very small values for b and t} and an elasticity of 
substitution only slightly greater than unity As 6 and ij increase, so does 
the “optimal” elasticity of substitution 

C The Effect of Vertical Control on Welfare 

With constant costs in the B industry, the effect on welfare is the com¬ 
bination of the change in monopoly profits and the change in consumer 
surplus The change in consumer surplus is always negative, while the 
change in profits is always positive If income distribution effects are 
Ignored, the change in consumers’ surplus, ECS^, and the change in 
profits, An, can be lumped together to arrive at a “total” welfare effect 
from^l^cal control 

AW = ACS^ + An 

Alternatively, the total change in welfare, AW, ran be viewed as 
occurring in two steps (see fig 2 for an illustration) First, holding the 
quantity of the final good constant, vertical control results in a rediKtion 
in the real cost of producing that quantity of the final product This 
efficiency effect, EFCY, is one component of the increase in profits re¬ 
sulting from vertical control In the second step, the integrating monop¬ 
olist raises the price of the final product, which results in a further 
increase in monopoly profits. An AP^, and a reduction m consumers 
surplus ACS^ The combined welfare effect of this price incrt.nsc, AH’ AP^ 
IS necessarily negative AW — EFCY -)- AW AP^, An — EFCY -f 
An AP^ , A W AP^ = ACS^ -h An AP^ < 0 

The following conclusions can be drawn from simulations of tin 
welfare effects of vertical control 

1 As a function of the elasticity of substitution, the effect of vertical 
control on total welfare, AH', rises from zero as a iiureases from zero, 
reaches a positive maximum, declines to a negative minimum, and 
then rises again For parameter values under which the P, constraint 
IS never effective (as in fig 3a), AH' approaches zero again from below 
as tr -* 00 If the P^ constraint eventually becomes effective at some 
value for a (fig 36), AH' reaches a second positive maximum before 
declining again to zero from above as <7 -* oo 

2 If the P^ constraint is not eflective, lowering tf or 3 increases tht 
absolute value of the net welfare effect In addition to this scale effect, 
the value for <r at which the net welfare effect first becomes negative. 



VERTIOAL OONTROL 


793 




(ADJ3‘?JV«V‘’‘S37‘IIV‘VK7) Si03dJ3 3aVdl3m 


ELASTICITY OF SUBSTITUTION ( <J ) 

■Efffct of vertical control on profit*, consumer surplus, and welfare 







JOURNAl, OP POUnOAL &OOM<MtY 


794 

IS higher, the lower are S or t) As S -* 1, or as i; -♦ oo, 

(T -* 0 

Air-o 
eiAwytt<o 

As S -> 0, or as ij -• I, 

<r -► 1 

Air-o 

/ltA»'J/«l»<0 

Thus if the constraint is not effective, the net welfare effect will be 
negative over a minimum range for ff of a ^ 1 Over this rang^ for a, 
the absolute value of the welfare effect will be greater, the lower arc 
S and 1 / For the 0 < <r < 1 range, a lower S or tj will increase the 
range for a over which A IF is positive, and increase the magnitude 
of AW 

If the /^A^instraint is effective, however, the net welfare effect will 
become positivtitfcgain at some a, and the value for a where A IF again 
becomes positive will be lower, the lower are ^ or ij As (5 -* 0 or as 

>1 I) 

ff -* 1 
AH' = 0 
<1tAl*')/(il<t>0 


Thus the effect of lower values for S and if the constraint is effective 
for some a, is to narrow the range around a = 1 where A IF < 0'^ 
and increase the scale of the welfare effect Figure 4a graphs AH' as a 
function of tr for <5 = ^, ij = 3, 4 Figure 46 shows A IF(ff) for <5 = |, 
i; = 3, 4 Figure 4c shows AIF(<t) for r; — 2, S = i> ^ 

It IS thus diflicult to draw any general conclusions (or the direction 
of the welfare effect If the particular case is such that it is clear that the 
input monojxilist exerting vertical control would have the piower to 


' ^ The net welfare effect is always negative at a = I if there are constant input costs 
With a = J, r = g = 00 


AW = 

e - I 

» 00 


z 

AW = ACS, + Ax, 

e* a. 

[■(21,-1) (2ff-l)l 

(ir- 1) 

! nV;'" EP';'" j 

z 

YS>{\ - rf)o-<>■]<--'> 

(>1 - 1) 


r 

1 

1 

» 

1 

1 

i ^ 


5 - !)*«-*>] 

J 


By u»ing the senes expansion of ln(I — I/if), it can be shown that ^ Thui» 

since 

Q > 0 and E < tf, EW < 0 

eo 



vBrnoAi* OONTSOL 795 




Fio. 4.—Total welfare effect of vertical contnri 


set any desired price for the final product afier vertical control, then the 
net welfare effect could be assumed negaove at least for ir ^ 1 If, 
however, the input inonopohst would be constrained from raising the 
price of the final product after vertical control to the full monopoly 
price, then the net welfare effect could not be assumed negative unleas 
the elasticity of substitution is close to unity If a is significantly less 
than unity and if the values for i and q are rdadvdy low, then the 
welfare effect mi^t be presumed poutive. 






JOURNAL OP POUTtCXL KCONOMY 


796 

The result that the net welfare effect can be positive or negative 
IS to be expected, since the welfare effect of an acquisition of a “second 
monopoly” is a prion indeterminate, given the presence of an existing 
monopoly distortion, and vertical control by an input monopolist can be 
viewed as the achievement of a “constrained second monopoly” over the 
other input industnes Assuming constant costs in the input industries, 
a joint monopoly over all input industries is equivalent to a monopoly 
over the final product If the constraint is not effective, then the same 
level of monopoly profits and the same price for the final product also 
result from vertical control by a single input monopolist, either in the 
form ol vertical integration or by the imposition of a tying arrange¬ 
ment (the requirement that purchasers of A also purchase their require¬ 
ments of B from the A monopolist) If the P, constraint is effective, 
however, p^||^ts and final-product prices will be lower and total welfare 
higher under vRgpcal control than under a joint monopoly of all inputs 
Under vertical control, the A monopolist sets the input prices P* and 
P* (either directly in a tying arrangement or implicitly through internal 
pncing under vertical integration) such that 



This “second monopoly” over the other input industries can be 
illustrated graphically if tr = ij, since in this case the cross elastic ity 
of demand between the inputs is zero and the input derived-demand 
curves do not shift in response to changes in the prices of other inputs 
If (j = i; (see fig 5), the effect of vertical control is simply to increase 
the price of the cooperant input while leaving the price of the monop¬ 
olized input unchanged In figure 56, = —abed, Ati = aecd, and 

Aiy = ~ebi \( a > rj, then E > rj and P* > P^ Since P* > P^, 
the total welfare effect must therefore be negative if cr > >; and the P^ 
constraint is not effective If c < tj, P* < P^, and AH' ran be positive 
or negative Since i; > 1, however, this conclusion is weaker than the 
simulation result that, if the P^ constraint is not effective, AH' < 0 if 
a > 1 

II. Vertical Control with Nonconstant Input Costs 

If we relax the assumption that all input costs arc constant, a fall m 
the pnee of the final product after vertical control becomes possible 
In addition, the effect of vertical control on producers’ surplus m the 
competitive input industry must be considered 

The effect on /? of a rising or falling marginal-cost schedule for A 
can be determined from the effect of vertical control on the quantity 
of A used For example, if the quantity of A used increases after vertical 
control (Ad > 0), a nsing M, schedule will reduce Ad and hence 



797 


VSRTIOAI. OONTtUML 

Po Pb 



Fig 5 —Efiect of vertical control on input indiutrin for 0 » 9 


increase R However, while the magnitude of AA will be affected by the 
form of the schedule, the sign of AA will be the same whether 
IS nsing, falling, or constant With constant M^, the simulation results 
show that if the constraint is not effective, the quantity of A used 
always falls after vertical control if tr > »j, remains constant if ff = ij, 
and increases a < ij If the 7*, constraint is effective, however, the 
quantity of A used may increase even if a > r/ Thus if ff < if, then 
> 0, and rising marginal costs in the monopolized input industry 
will increase R, while falling M, will reduce R 

Since the quantity of B used always falls after vertical control, the 
only way the price of the final product can decline after vertical control 
IS if the quantity of A used increases sufficiently to offset the reduction 
in B The most favorable conditions for a lai^e increase in A would 
be with ff < »}'^ and a rapidly declining Af, If the eleisticity of the 
marginal-cost schedule for A, g, is less than zero and small in absolute 
value, then R < 1 becomes possible For example, in the a = 1 case, 
with constant costs in the B industry only (r = 00 , g # 00 ), /? reduces 
to 


where 



(1 - !/£)*» 


(13) 


E = I + 5{ii - \) 


As expected, since ff < if. 


Sg 


SR 

ig + E)^ 


[£ln (1 - l/£) - If In (1 - l/»f)] < 0 

(since ff < 




£<ir) 


‘ * If the comtrunt u eflTecUve, R must be greater than unity Thia, any incteatc 
in ^ if 0 ^ If cannot reault in a iall in the pnoe of the final product after vertical control 




jouxNAi. or raunou. sooNOiir 


1 


i 



798 


While the second-order conditions (if | < 0, then i < -ij) restrict 
the range over which equation (13) can be less than unity, they do not 
exclude the possibility entirely The simulation results show a rather 
narrow range for g where the price of the final product will fall after 
vertical control. Thus if 9 < ij, a declining marginal-cost schedule 
clearly improves the welfare effect rf vertical control (smce the monopoly 
profits from vertical control are also increased) and may even result 
m a situation where vertical control improves the welfare of both con- 
sumen and profit receivers 

A finite elasticity of supply of the competitive input (e # 00) intro¬ 
duces a monopsony element In the absence of vertical control, the 
elasticity of of B enters indirectly through the elasticity of 

derived demand raKl (the lower is s, the lower is E in eq [4]) Vertical 
control, however, in addition to establishing the A monopohst as a 
“second monopolist" in the sale of B, now also enables the A monopolist 
to act directly as a monopsonist toward the B mdustry It is thus easy to 
understand why other input producers would object strenuously to 
vertical control by a firm with market power over an input with which 
their product can be used m variable proportions 

From the viewpoint of consumers of the final product, however, a 
rising supply curve for B improves the effects of vertical control Since 
the quantity of B always falls after vertical control, a rising supply 
curve for B acts to reduce the fall in B after vertical control and hence 
reduce R If the supply of is sufficiently inelastic, and if tr < 1; so 
that Ad > 0, the price of the final product can fail after vertical control 
even if there are constant costs in the A mdustry For example, if 9 = I 
and t =0,** 

H _ < 1 (14) 

e>0 

Any such gam to consumers of the final product must, however, be 
balanced against the loss of producers’ surplus m the B industry The 
total welfare effect of vertical control on consumers, monopoly profit 
receivers, and competitive input producers can, of course, be positive 
or negative depending on the particular parameter values For any given 
quantity of the final product, however, nonconstant costs m the B 


“ « > I 

.-0 

aaj 

«i«URq«ueiliat4f/{4(-l- irfl ■(■/(I - <)J} < 0,iiidthia0 > (> ((I - <)1 

urO > ( > -£,iiace 

E -!-, 

«■« < + »(•“ 

»•! 

'-’nilai by the tecaad-tnia coodiUou. 




« 



vbruoal oovmuoL 


799 


industry clearly reduce the potential efiiciency gain from vertical control 
Before vertical control, the ratio of ^ to B is lower than the efficient 
ratio, but after vertical control the input ratio is higher than the efficient 
ratio if « ^ oo: 

i = > r T 

5 |_(1 - 6)Si,\ [(1 - d ) A?J 

HI. Summary and Cdncluaions 

This paper has examined the weliare effects of vertical control by 
a monopolist of an input which can be used in variable proportions 
with other inputs By specifying the production function (CES) and 
the demand function (constant elasticity) for the final product, the 
following conclusions can be drawn. 

1 If the marginal costs of producing all mputs are constant, vertical 
control will always result in an increase in the price of the final product. 
If vertical control enables an unconstrained monopoly over the final 
product, the percentage increase in the pnce of the final product will 
be greater, the less elastic the demand for the final product, tf, and the 
less “important” the monopolixed input (the lower is the distribution 
parameter, 8, in the production function for the final product) How¬ 
ever, if the final-product monopoly achieved through vertical control 
IS effectively constrained by piotential competition from firms which 
can produce the final product without using the monopolized input, 
the pnce increase will be less, the less elastic the final demand and the 
less important the monopolized input 

2 If input costs are not constant, a fall in the pnce of the final 
product after vertical control is possible if (a) the elasticity of sub¬ 
stitution between the inputs is less than the elastiuty of demand for 
the final product, and either {b) the marginal cost schedule for the 
monopolized input is downward sloping, or (c) the supply curve of the 
competitive mput is upward slopmg 

3 The combmed welfare effect of vertical control on monopoly 
profit receivers, consumers of the final product, and producers of the 
competitive input may be positive or negative, depending on the par- 
Ucular parameter values This result is to be expected, since the welfrue 
effect of a “second” monopoly (in this case, a monopoly over the other 
input industnes) u a pnon indeterminate, given the presence of an 
existing monopoly distortion With constant input costs, the combmed 
welfare effect is always positive over a range for a immediately above 
zero but becomes negative at some tr < 1 If the final-product monopoly 
achieved by vertical control is effectively constnuned, the combmed 
welfare effect becomes positive agam at some o > I. In general, the 



8oo 


JOURNAL OP POUnOAL ECONOMY 


less clastic the demand for the final product and the less important the 
monopolized input, the greater the magnitude of the welfare effect and 
the wider the range for a over which the combined welfare effect is 
positive 

Further extension of these results would involve relaxing the assump¬ 
tions made with respect to parameter values and the institutional con¬ 
text The critical parameter assumptions have been that a and ri are 
constant over the relevant range for vertical control The existence of 
a close substitute for A which could be used by independent X producers 
if were increased slightly after vertical control would reduce T, and 
hence R Similarly, a demand curve for the final product which is linear 
or even concave to the origin would reduce and hence R Since the 
efficiency gain would still be possible, an increasing t] or a over the 
relevant miige should improve the welfare effects ol vertical control 

Th^i^^&re effect would also clearly be improved if the A monopolist 
were unanfe to raise P^ significantly after vertical control This inability 
to raise the price of A facing independent X producers could be due to 
regulation, threat of antitrust proceedings, or potential entry at the A 
level In addition, if markets for A other than the X industry arc present, 
an increase in P^ in these markets nught be unavoidable if is to be 
increased for independent X producers. If raising in these other markets 
would reduce monopoly profits, the A monopolist may be unwilling to 
raise P„ to independent X firms after vertical control. 

If the A monopolist is a labor union, rather than a firm, a change 
in the objective function is necessary Although a full analysis of the 
union case cannot be presented here, vertical control by unions, in a 
number of different forms,** permits the union to choose from a set of 
combinations ol higher earnings per worker and higher employment 
levels The effect of vertical control by a union will then depend on 
the union’s particular objective function For example, if the goal of the 
union IS to maximize earnings per worker, subject to maintaining the 

' ’ First, the union could simply purchase the hrms involved Second, the union could 
try to hx the labor/output ratio through work rules, featherbedding, or bargaining over 
both wages and the level of employment Third, the union could institute some form of 
a tying arrangement, increasing the marginal cost of capital relative to the marginal 
cost of labor through a profit-sharing agreement. (Unless the debt/equity ratio were 
fixed, however, effective vertical control would require that the profit-sharing agreement 
be based on net revenue before interest payments In addition, if inputs other than 
capital could be used in variable proportions with labor, a tying arrangement would 
have to cover these inputs as well) Fourth, the union could take its “monopoly profits’' 
through a royalty on the final product, thus preventing any fall in the labor/output ratio 
resulting from unionization The best example of such a royalty arrangement is the 
United Mine Workers of America Welfare and Retirement Fund, which is financed by 
a 40-cent royalty on each ton of coal produced in unionized mines These royalty pay¬ 
ments constitute a considerable share of total labor costs, reducing employment by 
sigmficantly less than would a comparable financing system based on the level of 
enjoyment 



VIXTIOAL OONTROL 


8oi 


exiating level of employment, the effect of vertical control by the union 
will be to reduce the quantity of capital used in the industry while 
leaving the labor input constant The result will be an increase in both 
earnings per worker and the price of the final product 

The sensitivity of the welfare effects of vertical control to the par- 
dcular parameter values implies that accurate estimation of these 
parameters is necessary In particular, however, empincal work on 
the elasticity of substitution has been concentrated on estimating a 
between capital and labor in a value-added production function These 
estimates fall mainly m the 0 5 < a < 1 5 range (see Nerlovc 1967), 
where significant effects from vertical control on final product prices 
and monopoly profits could be expected (see figs 1, 3) Estimates of the 
elasticity of substitution between intermediate inputs and value added 
are generally low ' * Thus particular attention might be given to cases 
of vertical control by labor unions or capital good producers An addi¬ 
tional consideration is that the original basis of market jxiwer for unions 
and patented capital-good producers may be legally unassailable, while 
policies can still be formulated toward extensions of that market power 
through vertical control 

One implication of these estimates for a is that, for practical pur¬ 
poses, the most relevant results in this papier are those for relatively 
low elasticities of substitution The elasticity of substitution between 
one particular monopxilizcd capital good or one category of labor and 
all other capital and labor inputs, however, can be expected to be higher 
than between capital and labor ag^egates Smularly, even if the elas¬ 
ticity of substitution between intermediate inputs and value added is 
negligible, substitution piossibilitics between different intermediate inputs 
may be significant Thus we cannot rule out the intermediate-inputs 
case, nor can we ignore the results for high levels of a 


References 

Arrow, K , Chenery, H B , Minhas, B , and Solow, R S “Capital-Labor 
Substitution and Economic Efficiency ” Rev Econ and Statu 43 (August 1961) 
225-50 

Balassa, B , Guisinger, S E , and Schydlowsky, D M “The EffecUve Rates of 
Protection and the Question of Labor Protection in the United States A 
Comment ” J PE 78 (Septcmbcr/October 1970) 1150-62 
Burstem, M L "A Theory of Full-Line Forcing ” Northwestern Unw Law Rev 
55 (February 1960) 62-95 

Hicks, J R TTie Theory of Wages New York St Martin’s, 1963 
McKenzie, L “Ideal Output and the Interdependence of Firms.” Econ J 61 
(December 1951) 785-803 

See Theil and Tilanus (1964, p 268) and Balassa, Guisinger, and Schydlowsky 
(1970, pp 1157-61) 



803 


JOURNAL OF FOLrnOAL Bocmouv 

Midiaii, E. J. “What li Producer*! Surplua?” iCEJt. 8 (Oeeraaber 1968): 1269-^. 

Nerk>ve, M. “Recent Empirical Studiet of the CES and Related Production 
Functions.” In The Theoty and En^mcal Analysis ef l^aduelm, edited by Murray 
Brown. NBER Studies in Income and Wealth, vol. 31. New York: Gcdumbia 
Univ. Press, 1967. 

Rader, T Tkmry qf Mtenneanonaes New York* Academic Press, 1972. 

Schmalensee, R, “A Note on the Theory of Vertical Integration.” J.P.E 81 
(March/Apnl 1973) 442-49. 

Theil, H., and Tilanus, C B “The Demand for Production Factors and the 
Price Sensitivity of Input-Output Predictions’’/n/rraat Earn Rea. 5 (September 
1964) 258-72 

Vernon, J M., and Graham, D. A. “Profitability of Monopolisation by Vertical 
Integration ” J.P E. 79 (July/August 1971). 924-25 

Williainson, O W “The Vertical Integration of Production Market Failure 
Considerations.” A E.R 61 (May 1971) 112-23 



Communications 


The Risk Element in Occupational and 
Educational Choices: Comment 

John C. Hause 

Vnvmnty Mtmuaia 


In an interesting recent contribution, Weiss (1972) concludes that 
“moderate” degrees of nsk aversicm substandally reduce the average rates 
of return from educational investment This conclusion depends critically 
on the maximand for the consumer assumed by Weiss, which is anu- 
Fishenan with a vengeance The purpose of this note is to indicate more 
expliatly the role played by the uuhty iuncuonal specified by Weiss in 
leadmg to his conclusion, and to suggest an altemanve procedure that 
may be preferred by economists addicted to tradiuonal capital theory. 

The two main results of the analysis are (a) that the ongmal Weiss 
formulaUon is theoreUcally inappropriate for deahng with the problem 
at hand and {b) that the empirical result obtained by Weiss that moderate 
nsk aversion greatly reduces the schooling returns to Ph D recipients is 
unjustified That conclusion is pnmanly due to the foreseen absence of 
earnings while attending school, not to nsk The empincal discussion is 
also flawed by ignoring how the covariance structure of earnings over Ume 
must be taken into account in the empincal esumadon of nsk 

Weiss (1972, p 1208) assumes that consumers attempt to maximize 
exfiected discounted uulity 

Max£|J’^«-"«L>>(0]*|, (1) 

Financial support for this note was provided by U S OflSoe of Education grant 
OEG-0-72-1569 It has been modified at several pointt in respon s e to conespondoioe 
with Yoram Weiss Comments by Un Ben-Zion were also usdiil The audior ahwe u 
responsible for reprehenibilities 




804 JOURNAL OF POLITICAL ECONOMY 

where current utility u is a function of current income _)»(/), and has the 
form 

, ( 2 ) 

1 — fl 

where a is a constant ^ Fisher’s (1930) first principle of capital theory 
stresses the independence of consumption and production decisions under 
certainty (with a perfect capital market) and assumes utility is developed 
only from consumption If one accepts this view, choosing equation (1) 
as the maximand can be justified only if intertemporal shifts in consump¬ 
tion through saving and borrowing are impossible For a given discount 
rate, p, and for risk aversion (0 < a < 1), equation (1) implies that 
from the family of all income streams with the same present value, a 
known constant-level income stream is preferred, and is superior to re- 
ceivii|^with certainty) all income in the first or the last period of a 
person’s 'S^rking life, with zero incomes for other periods This outcome 
follows from the assumed diminishing marginal utilily of income at each 
point (or period) of time, not from risk, since the preceding comparison 
IS between alternative incornc profiles received with certainty ^ The 
postponement of income for 5 years past the B A degree by Ph D 
candidates assumed by Wetss, in conjunction with this feature, is the mam 
element m leading to the reduced average rates of return he computes 
Suppose that we compare two constant-level income streams, rcccised 
with certainty, one extending from the B A to infinity, the second 
extending from the Ph D to infinity (with zero income for the 5 years 
between B A and Ph D ) With a discount rate of p — 10, the Ph D 
income stream must have a present value between 1 13 and 7 39 times as 
large as the B A stream, as a vanes between 2 and 8, to equalize utilities 
This result has nothing to do with risk, it follows from zero earnings while 
acquiring the Ph D and the assumed intertemporal substitution implied 
in equation (1) Thus, it is not surprising that the coefficient of variation 
of income as a function of age (which presumably characterizes the actual 


* Weiss acknowledges that the assumption of utiiity depending on current income is 
"somewhat dubious and is adopted only for simplicity" in n 5 of his paper However, 
his empirical conclusions depend critically upon this assumption He suggests that one 
might view «(_>) as an indirect utility function, with a constant interest ran. This inter¬ 
pretation gets at the problem only for the trivial cast where income profiles diflfcr only 
by a scale parameter 

^ Dimimshing marginal utility of income was introduced in the expected utility 
hyjxithesis of von Neumann-Morgenstern to characterize risk aversion It is itchmcally 
misleading to conclude that the assumption of risk aversion necessarily implies anything 
about intertemporal preferences under certainty This unsatisfactory formulation is 
obvious if we consider an individual with “risk prelcrcnce” (by considering — 1 < a < 0 
in eq [2]) In this case, the maximand of eq (1) implies that of all certain income profiles 
with the same present value, the least desirable is a constant income over time, and the 
most desirable streams arc those concentrated at one point (or period) of time 



OOUHUNIOATIONS 805 

ruk^) is of trivial importance in reducing the average rates of return rel¬ 
ative to the postponement of income for moderate a, a conclusion pointed 
out at the end of Weiss’s study * 

For those accepting Fisherian capital theory, the natural maximand 
for getting at nsk instead of income vanabihty would be the expiected 
utility of the discounted income stream * Of course, the probability 
distribution of discounted lifeume mcome cannot be computed from the 
cross-secUonal data of scientists’ incomes reported by Weiss, since it is 
impiossible to derive the covanances of annual incomes over the life cycle 

^ Even the coefficient of variation of income as a (unction of age may be a misleading 
measure of uncertainty To the extent that there are substantial differences in postschool 
investment in on-the-job traimng anticipated by individuals, low initial incomes by 
those investing heavily in this way are ofl&et by higher earnings later on 

* Table I has kindly been made available by Weiss, and may be compared with table 2 
in Weiss's original article to show that the strong reduction in average rates of return 
originally obtained there are primarily due to the zero incomes imputed to Ph D students 
for S years In this new table, Weiss used the reported incomes of scientists in educational 
institutions with B A and M A degrees as an estimate of student income (and presumably 
consumption) Thus, mean earnings in the new calculation are assumed to be S3,640, 
S4,467, $4,754, $5,041, and $5,328 per year instead of 5 years of zero earnings The 
coefficients of variation are 77 at the B A level and 67 at the M A level With a 
risk-aversion parameter of zero, this modification almost doubles the private industry 
average rate of return and more than tuples the all-employers average rate of return, 
compared with table 2 in Weiss’s article More significant, the average rates of return 
in this new table drop olf at a relatively much slower rate as the nsk-aversion parameter 
increases The range of the risk-aversion parameter in table 1 lies between zero and two, 
instead of the smaller range of zero to one in table 2 of Weiss’s article The utility function 
used by Weiss decreases in value to negative infimty as income decreases to zero when the 
risk parameter exceeds one, and hence is not relevant for calculations assuming that 
Ph D students have zero earnings while they attend school The pcsitive-eamings profile 
for Ph D students assumed by Weiss in calculating table 1 allows one to consider more 
extreme degrees of nsk aversion than were considered in his original article For very 
high values of risk aversion (1 8 and 2), the computed rates of return are very small 
In both the original article and the new table, low rates of return are pnmanly a con¬ 
sequence of differences in level of the assumed mcan-eamings profiles for those with B A 
and Ph D degrees These low returns thus depend on the intertemporal properties of the 
assumed utility functions, and not on anything related to nsk 

’ The following discussion waives the problem of imperfect capital markets, which 
severely limit the possibility of people’s obtaining consumption loans, to be paid off by 
subsequent earnings The onginal formulation by Weiss does not get at this complicaDon 
either However, Weiss has correctly pointed out in subsequent correspondence that 
the sequential nature of consumption decisions makes the distribution of present values 
of income inadequate for characterizing the expected value of optimal consumption 
strategies 1 suspect that for empirical studies it may still turn out that the distribution 
of discounted lifetime earnings is an adequate entity for camings-strcam optimization 
It IS clear that further theoretical work is needed to reveal under what circumstances and 
in what sense “adequacy” is attained In particular, there is currently little evidence on 
which specific utility functions of intertemporal consumption under uncertsunty are most 
satisfactory for discussing these problems Furthermore, little seems to be known about 
how robust conclusions arc which are based on calculauons with an arbitrary utility 
function There may be a useful way of cluuactenzing clasKS of utility functions and 
classes of eanungs functions with random components such that the distribution of 
discounted earnings is smtable for analyzing schooling and occupational choice But this 
topic appears to be difficult 





8o6 


JOUKNAL OP POUnOAL soonKwnr 


TABLE 1 

AvSKAOB RaTBS of RsTUBM for VaJUOUS OaORBEI OF Rnc AvBRfiON 
UMOBR Modifibo EARNiNai Strbam Whilb Attxmdino School 


Dsqrbb of Risk Avbrsion 


Type of Employment 

00 

0,2 

04 

06 

0.8 

1 0 

1.2 

1.4 

1.6 

1 8 

20 

Private industry 

143 

128 

113 

099 

084 

.069 

055 

040 

028 

.015 

004 

Government 

180 

160 

140 

121 

103 

086 

069 

.053 

038 

.024 

on 

All employers 

177 

162 

146 

131 

.115 

100 

084 

069 

054 

041 

030 


from a single cross section However, the upper bound of the coefficient 
of variation of discounted lifetime income (consistent with the observed 
standard deviation of income as a function of age) can be obtained, since 
the hscounted value of lifetime income is the discounted value of 

the meaiWhrnings profile, and the supremum of the standard deviation 
of discounted lifetime income is the discounted value of the standard 
deviation of income, as a function of age * 

This calculation provides an upper bound to the coefficient of variation 
of a discounted income profile, consistent with the variances at each pomt 
along the profile The lower bound of the coefficient of vanation is zero, 
which IS approached if the income profile is determmed by uncorrelatcd 
increments of income over short intervals of time These bounds are 
unpleasantly wide for estimating the true variability of discounted lifetime 
incomes, but greater precision requires information of income covariances 
along the income profile If data tire limited to a single cross section, the 
estimating procedure developed by Fasc (1971), based on a random-walk 
model, IS probably supenor to cither of the bounding estimates 

* I am indebted Co Christopher Sims for pointing out an earlier error and sketching 
the correct denvation. The proof is shown bdow The corresponding result for variance 
IS derived, and square roots may be taken of the first and last steps 

- jj - li, )]'}"*<*<*'. 

by Schwarz’s inequality, or equivalently, by the foct that the covariance of a pair of 
random variables is never greater than the product of the standard deviations of the 
variables 








ooMinniiCATioNf 


807 


Rc fe ffCMICCT 

Faie, M. M. G. “Estunatitni of Lifctimf! Income *’ J Amenean Statu. Assoc 66, 
no. 336 (December 1971). 686-92. 

Fiiher, I. 77b Thtoty of htonst New York: MacmilUn, 1930. 

WeiM, Y. “The Ruk Element m Occupetioautl and Educational Cboicef.” J.P E 
60 (November/December 1972): 1203-13 




Purchasing-Power Parity Theory and 
Economic Reform in Yugoslavia 

Ryan C Amachcr and John S Hodgson 
Unmrstty of Oklahoma 


This study tests the applicability of purchasing-jxiwcr pianty theory to 
movements m the exchange rates between the Yugoslav dinar and two 
West European currencies during the 1950s and 1960s The purpiose is to 
deterrmne whether these exchange rates were adrmnistered without 
regard to underlying economic conditions, as is commonly thought, or 
whether they actually reflected changes m economic fundamentals withm 
the selected countries 

Although an absence of accessible, appropriate data precludes an 
examination of currency exchange rates and purchasmg powers within 
Eastern Bloc countnes, one of these Socialist countnes, Yugoslavia, has 
been introducing market-onented relbrms more rapidly than other 
countries m Eastern Europe and has produced a set of economic statistics 
more readily comparable with those of Western countnes These de¬ 
velopments provide an oppiortunity to examine the exchange rates 
between this one East European currency and Western currencies ‘ 
Accordingly, this study examines the exchange rates between the ciurency 
of Yugoslavia and those of her major Western tradmg partners, Italy 
and West Germany 

The framework for analysis is the purchasmg-power panty theory, 
which holds that the equilibrium exchange rate between two currencies 
IS determined pnmanly by their relative purchasing powers ^ The theory 
became widely known following World War I, when it wtis formulated 
and expounded by Gustav Cassel Cassel onginally contended that 


We are grateful to Robert D Tollison and a referee for helpful comments 
' Western observers have argued that exchange rates in Eastern Europe do not reflect 
economic forces and as such are irrational, irrelevant, arufictal, unrealistic, and as 
characterized by one Eastern observer, basically wrong All quoted in Boltho (1971, 
p 64), “irrational,” Brown (1962), “irrelevant,” Pryor (1963), “artificial,”Nove and 
Donnelly (1961), “unrealistic,” Umted Nations For a more complete discutsion of 
Western views on East European rates of exchange, sec Boltho (1971) and Pryor (1963) 

^ For a discussion of purchasing-power panty theory, sec Balassa (1964, 5&4-96), 

Yesiger (1966, pp 170-^), and Holmes (1967, pp 686-95) For some empmcal in¬ 
vestigations relating to Western countnis, see Graham (1930), Stolper (1948), Yeager 
(1938), Tsiang (1959), Gaillot (1970), and Hodgson (1972) 

. 809 



8io 


JOURNAL or POUnOAL EOON(»fY 


currency purchasing powers are the dominant long-run determinants of 
exchange rates Capital flows, transportation costs, and other trade 
impediments are of secondary importance. Subsequent writers have 
narrowed the theory to assert that relative currency purchasing powers, 
measured by relative price indexes, are the only important determinants 
of exchange rates, other factors being of negligible importance over the 
long run.’ In its presently accepted form purchaamg-power parity 
theory asserts that, if cquihbnum is to be maintained, exchange rates 
must be kept in line with broad relative movements m intcmauonal price 
levels, but that changes in such factors as tastes, resources, and technology 
will cause changes in exchange rates even at constant international prices.* 
We find that the Yugoslav exchange rates are not mdepcndent of their 
purchasing powers, as represented by corresponding price levels within 
each o£.tbc countries This suggests that these exchange rates, widely 
thoug^^ag^e arbitrary and unrelated to fundamentals, actually reflect 
underlying economic conditions within the three countries While the 
findmgs do not necessarily demonstrate that the Yugoslav authonues arc 
wholly rational in their exchange rate policy, or that they arc becoming 
more rauonal over time, they do suggest that the authorities are not out 
of touch with monetary fundamentals. 


I. Purchaaing-Power Parity Theory 

The comparative version of purchasing-piowcr parity theory states that 
the equilibrium exchange value of a currency in any period is equal to 
the value during a selected base period (in which the exchange rate is 
assumed to be in equilibrium) multiplied by the relative change in price 
levels in the two countries since the base year Thus 


^1 




(I) 


where R, = the calculated purchasing-jxiwer panty exchange rate (in 
domesdc currency units per unit of foreign currency) dunng 
penod I, 

Ro = the actual exchange rate dunng the base penod (also in 
domestic currency units per unit of foreign currency), 

P* = price level, / supersenpt denotes the foreign country, d 
superscript denotes the domestic country, 0 subsenpt 
denotes the base penod, t subsenpt denotes the subsequent 
period 


> Holmes 1967, pp 686-87 

* Ifowever, at Yeager pomts out, “The doctrine does deaenbe price-induced adjust¬ 
ments in trade flows that will limit non-nxmetary exchange-rate movement’’ (1958, 
pp. 518-19) 



O^OfUNlOATIOm 


8ii 


For any lelected base period, Ro, P*^, and P*i are consunt, and formula 
(1) becomes 



(!') 


where Ki$ & constant. 

The question of interest is how closely the actual exchange rate is 
associated with the calculated purchasing>power parity rate, or equiv¬ 
alently, how closely the actual rate is associated with the ratio of the price 
levels in the two countries We thus hypothesize the following relationship 
which will be used below as an estimaung equation 


\nR{t) = *0 -t- «, InP(l) + U{t) {«', > 0), (2) 


where R is the actual exchange rate in units of domcsuc currency per 
unit of foreign currency, and P is the ratio of the domestic to the foreign 
price mdex 


n. The Yugoslav Experience 

In 1947 Yugoslavia was the first country to adopit the Stalinist cconoimc 
model and in 1953 became the first country to abandon it Since abandon¬ 
ing the Stalinist model and adopung her “own road to Socialism,” 
Yugoslavia has followed an erraUc course toward a market-planned 
economy This expenment has proceeded through a number of economic 
reforms Among other objecuves, these measures were aimed at reforming 
the foreign trade sector and obtaining the eventual convertibility of the 
dmar ’in recent years there have been several devaluations, necessary 
steps toward convertibility and participaUon m organized foreign 
exchange markets Still, these goals remain in the distant future * Yugo¬ 
slavia has, nevertheless, become sigmficantly involved in trade with the 
Western (non-Comecon) world, making plausible an appheaUon of a 
purchasmg-power parity test to this admimstered rate of exchange ’ 

m. Empirical Results 

Esumates were made using end-year exchange rates and price mdexes 
obtained from IMF sources for the penod 1952-71 * Since data on 
direct exchange rates between the dinar and the lira and between the 

’ See, for example, Anakioski (1969) and Horvat (1971) 

* For an account of theK reformi, sec Amacher (1972) Although Yugoslavia still 
imposes exchange controls under a system of fixed rates and exchange raUoaing, and 
the country has attempted reform of the international sector, Yugoslav authorities give 
strong support to achieving convertibility of the dinar 

’’ For measures of Yugoslavia's activity on Western and Eastern markets, see Amadicr 
(1972, pp. 19-50) 

* Wholesale pnee indexes were laed for Italy and West Germany, and “p r oducer's 
pnees, industrial goods" were used for Yugoalavia Each of these senes was obtsuned 
from /stomahMa/ Fiaama/ ■Steutas. 



8I2 


JOURNAL OP POUnOAL ECONOMY 


dinar and the deutsche mark were unavailable, it was necessary to derive 
these from U S dollar exchange rates with these three currencies The 
“actual” dinar-lira exchange rate, Rylt was thus derived as 

^ _ number of dinar per dollar 

number of lira per dollar 

and the “actual” dinar-deutsche mark exchange rate, R^, was computed 
as 

number of dinar per dollar 

/f = ■ ■■ ' - - . ■ ■< 

number of deutsche marks per dollar 

Two senes were available for the dinar-dollar exchange rate, one re¬ 
flecting the “official” exchange rate, the other reflecting the “actual 
setUe^K^nt rate ” Both series were used since it would be interesting to 
com^rtj^e results based on the official exchange rates with those based 
on rates more closely related to market forces 

The following notation is adopted to distinguish between these various 
exchange rates 

Ry, — the dinar-lira exchange rate based on the official dinar-dollar rate, 
R^i = the dinar-lira exchange rate based on the actual settlement dinar- 
dollar rate, 

Ryg = the dinar -deutsche mark exchange rate based on the official dinar- 
dollar rate, 

Ryg - the dinar -deutsche mark exchange rate based on the actual settle¬ 
ment dinar-dollar rale 

These computations implicitly assume consistent cross-rates between 
the currencies, an assumption that would not appear to be too unrealistic 
when dealing with annual data 

The price-level ratios were < omputed directly using annual wholesale 
pnee indexes Thus = price index in Yugoslavia/pnce index in Italy, 
and Pyy = price index in Yugoslavia/price index in Germany 

Equation (2) was estimated using generalized least squares (GL5) and 
the official exchange rate, yielding the following results ’ 
for Italy • 

lni?,\(/)= -0 671 + 3 543 In Py,(t) 

SD (0 178) (0 709) (3) 

r^ = 45 jV = 19 D-W = 1 52 

for Germany 

In R}^{1) = 4 328 + 3 476 In Py/l) 

SD (0 199) (0 736) (4 

r* =r 41 JV = 19 D-W = 1 69 


* See Appendix for OLS atimates 



OOMMUNICATIONS 


813 

In both cases relative movements in price levels are highly sigmficant at 
the 05 level and explain between 40 and 45 percent of the variation in 
the exchange rates In each case a 1 percent change in reladve price levels 
is associated with a 3 5 percent change in the corresponding exchange 
rate 

The results arc also significant when actual settlement exchange rates 
are used, GLS estimates were as follows ‘ ’ 
for Italy 

In = -0.341 + 3 031 In 

SD (0 121) (0 516) (5) 

= 42 N = 19 D-W = 2 16 

for Germany 

In = 4 676 + 2 846 In PJt) 

SD (0 135) (0 507) (6) 

= 39 = 19 D-W = 2 18 

Relative pnce-level movements are again highly sigmficant at the 05 
level and explain roughly 40 piercent of the variation in the exchange rates 
In each case a 1 percent change in relative price levels is associated with 
roughly a 3 percent change in the exchange rates As would be expected, 
the pnce-level coefficients suggest that the actual settlement rate was 
adjusted slightly more in line with movements in pnee levels than was 
the more arbitrary official rate 

Since structural c hanges are bound to have occurred within the three 
economies over the 20-year penod under study, it would have been 
desirable to include time variables in the regressions to capture these 
effects Unfortunately the pnce-level ratios are highly correlated with 
time for each pair of countries, and the inclusion of lime vanablcs would 
introduce strong elements of mulucollineanty ' ^ 

In each case the observed relauonships between exchange rates and 
price levels arc weakened by the fact that the iniual exchange rate is not 
likely to have been an cquihbnum rate. Any movement of the actual 
exchange rate during subsequent jjenods, from an imtial disequilibrium 
level toward an underlying cquihbnum level, would appear as a move¬ 
ment unrelated to variations in pnee levels dunng the subsequent pieriods 
Since imUal disequilibrium was probably present, the r^s in these estimates 
are probably biased in a downward direction 

* ° Thu might be explained by the fact that as Yugoslav authorities attempt to establish 
convertibility, a goal of the reform, the exchange rate is being adjusted not only to 
account for recent relaUve pnee changes, but alio to adjust ibr past relative pnoe changes 
that had not previously been incorporated in the adminutcicd rate of exchange 

'' See Appendix for OL.S estimates 

Regressions including a Umc variable are neverthcleas reported in the Appendix 



8i4 

IV. Conchisioii 


JOURNAL OF POUnOAL BOOHOMY 


The results support the hypothesis that the devaluations in the dinar’s 
rate of exchange during the 1950s and 1960s conform to patterns pre¬ 
dicted by purchasmg-power panty theory Despite all the problems of 
measurement,'^ it appears that Yugoslavia’s prices and exchange rates 
have not been admimstered as independently of economic fundamentals 
as has been generally believed While these findings should not be loosely 
generalized to apply to intra-Bloc relationships, they do raise the sus¬ 
picion that, in some of the other Socialist economies, exchange rate levels 
may not be totally “irrational, irrelevant, artihcial, or unrealistic,” but 
may, in fact, be responsive to basic economic fundamentals More 
definite ansivers await more complete data and more comprehensive 
tests. 

Appendix 

I Ordinary least squares estimates 
A Official exchange rate 
For Italy 

(3') = -0.841 + 4 614 In/»„,(!) 

SD (0.127) (0 584) 

r* = 78 JV = 20 D-W =0 57 

For Germany 

(4') InJi'Jl) = 4 240 + 3 941 In P„(/) 

SD (0 125) (0 493) 

= 78 AT = 20 Z)-IF =0 57 

B Actual settlement rate 
For Italy 

(5') In /ej,(t) = -0 319 + 3 125 In /»,,(<) 

SD (0 089) (0 408) 

= 77 JV= 20 D-PV = 1.01 

For Germany 

(6') In = 4 740 4- 2 724 In P„(l) 

SD (0 090) (0 356) 

= 76 N= 20 D-fV = 0.95 


’ * For a discussion of some of these problems, see Wiles (1969, pp 123-55) 

Gunter Kohlroey, the East Getmaa National Pnze winning economist, argues thi 
case both ways in a single volume. First, "Exchange rates are stable and arc bein) 
incresuingly brought into alignment with the purchasing power of single currency units.' 
Later in the some volume he asserts that die exchange rates do not really correspond t 
the purchasing power of different currencies (see Pryw 1963, p. 238). 



aoMMUMraAnoNa 


815 


II. GenenUiCed lout iquarci eMinute* uwig tune vAriabla (T) 

A. Ofiicisl ocduknge rate 
For Italy: 

(3") - -4.162 + 1.797 In/>„(<) + 0.062 T 

SD (1 493) (1.075) (0.027) 

r> - .70 JV « 19 D~W » 1 73 

For Germany. 

(4") ln7ii,(/) = -0.076 + 1.234 In P^(t) + 0.079 T 
SD (1 764) (1 137) (0 032) 

= 74 AT = 19 D-W = 1 81 

B Actual Kttlement rate 
For Italy. 

(5") ln/?J,(/) = -3559 + 0 849 In/»„(<) + 0 059 T 
SD (1056) (0 823) (0 019) 

= 71 AT = 19 D-W = 2.03 

For Germany 

(6") In 7^,(<) = 0 689 + 0 420 In />„(t) + 0 073 T 
SD (1 117) (0 724) (0 020) 

r* = 78 AT = 19 D-W = 1 95 

When tune is included in the double>log specification, the l-icoret for- the co¬ 
efficients of the relative price level variables drop below 2 0, while those for the 
time variable remain above 2 0 In nonlog form (not reported), the t-soora for 
the relative price level variables remam above 2 0 while those for the tune vsuiable 
fall This illustrates problems caused by the presence of multicollineanty when 
the time vanable is included 


References 

Amacher, R, Yugoslavia’s Fomgn Trade A Stiufy of State Trade Discrwustatmn, New 
York Praeger, 1972 

Anakioski, D. “Foreign Trade in the Yugoslav Reform.” Yugoslav Suney (1969). 

Balassa, B “The Purchasing-Power Parity Doctrine A Reappraisal." JJ^.E 
(December 1964) 

Boltho, A Foreign Trade Cntena in Soeialist Econotmes Cambridge Cambridge 
Umv Pren, 1971 

Brown, A “Centrally Planned Fmagn Trade and Economic Eifiacticy.” 
American Eton. (November 1962) 

Gailliot, H. “Purchasing-Power Parity at an Explanation of Long Term Changes 
in Ejcdumge Rates ” J Money, Credit, and Bmkmg (August 1970). 

Giaham, F. Exdiange, Prices, and Fraction in Hyper-Inflatwn Germiuiy 1920-1923 
Pnneeton, N.J.: Pnneeton Umv Press, 1930 

Hodgson, J. “An Analysis of Floating Exchange Rates. The Doltar-Sterhng Rate, 
1919-1925.” Southern Eton. J. (October 1972) 

Holmes, J. “The Purchasmg Power Parity Theory. In Defense of Guttav Caad 
as a Modern Theontt ” J P.E. (October 1967) 



8l6 JOURNAL or POLtTtOAL EOONOKY 

Horvat, B “Yugoslav Economic Policy in the Post-War Period: Problems, Ideas, 
Institutional Developments.” Amtruan Ecpn Rev. (suppl., June 1971). 

International Monetary Fund Intermtioml Ftmneial Statutus Washington: IMF, 
selected years 

Nove, A, and Donnelly, D Trade with the Cammunul Comtnes London 
Hutchinson, 1961 

Pryor, F “Foreign Trade Theory in the Communist Bloc ” Soviet Studus (July 
1962) 

- The Communist Foreign Trade System Cambridge Cambridge Univ Press, 

1963 

Stolper, W F "Purchasing-Power Parity and the Pound Sterling from 1919- 
mb" Kyklosjvac 3 (1948) 

1 Slang, S “Fluctuating Exchange Rates in Countries with Relatively Stable 
Economics Some European Experiencai after World War I ” IMF Staff Papers 
(October 1959) 

United Nations, Economic Commission for Europe Econ Bull Europe (November 

1967 |^ 

Wiles, P CSitmumst International Economics New York Praeger, 1969 

Yeager, L “A Rehabilitation of Purchasing-Power Parity " J P E (December 
1958) 

—-- International Monetary Relations New York Harper & Row, 1966 



Authoritarian versus Nonauthoritarian 
Approaches to Economic Development 

G William Dick 
JiUematumal Aero Sertnce Corporahon 


A frequently stated and widely believed maxim of economic development 
holds that strong central planning under a communist or authoritarian 
regime is necessary for an underdeveloped country to achieve rapid 
economic growth Robert Heilbroner (Heilbroner 1963, pp 132, 135, and 
138), for example, states that “only political leadership of the most force¬ 
ful kind can carry the Great Ascent along ” He goes on to say that 
“in most of the underveloped nations the choice for the command post of 
development is apt to he between a military dictatorship and a left-wing 
civihan dictatorship the logic of events jjoints to the formation of 
econonuc systems and poliucal regimes which will seek to impose devel¬ 
opment on their peoples Communism may well be the quickest possible 
way out of underdevelopment, and the desperate fact is that irf many 
areas of the world the present non-communist effort looks like the slow¬ 
est possible way ” 

Jagdish Bhagwati (Bhagwati 1966, pp 203-4) trumpets the same refrain 
even more forcefully 

No policy of economic development can be earned out unless 
the government has the capacity to adhere to it Qjiitc often, 
however, democratic governments lose equanimity and deter¬ 
mination in the face of opposition This is the dilemma of 
most democratic governments It is here that socialist countnes 
have an immense advantage their totalitanan structure 
shields the government from the ngorous and reactionary judg¬ 
ments of the electorate Another advantage of the soaahst 
countnes is their passionate conviction and dedication to the 
objective of economic growth—which contrasts visibly with the 
halting and hesitant beheis and actions of most democraaes 
The firm and purposive sense of direction is in pointed con¬ 
trast to the extensive revisions and changes in policies and 

Appreciation li exprened to Ernest A Duff, professor of politics, Randolph-Macon 
Woman’s College, for his assistance in classifying the independent less developed countries 
of the world by form of government 

817 


8i8 


JOUKKAL Of POUnOAL BOOWMiY 


methods which are prompted by minor setbacks in most demo¬ 
cratic governments and which produce a sense of drift and 
helplessness. The political economy of development poses, in 
this respect, a cruel choice between n^id (self-sustained) expan¬ 
sion and democratic processes 

This theme is so pervasive in the hterature that it is generally accepted 
as fact, though the asseruon is rarely supported by definitive data of any 
kind. Recently a group of college students was asked, toward the end of a 
seminar course on economic and political development in which they had 
been exposed to an extensive range of literature in the field, whether they 
thought an authoritarian or a compeutivc form of government was more 
conducive (p economic development The reply was virtually unammous 
—aud#[^ an 

Such uflquestioning acceptance of an unsupported precept prompted 
an analysis of this question Is there any meaningful relationship between 
the form of government m a less developed country (LDC) and its rate of 
economic growth^ 

The independent LDCs of the world were classified as to form of 
government authoritarian, semicompetiuve, or competitive An authori¬ 
tarian regime is defined as a government where one political party main¬ 
tains control through force and/or propaganda, or where there is no 
political party, with control maintained by one or more persons Elec¬ 
tions, if they are held, are meaningless 

A scmicompetitivc government means that there is either one major and 
several minor pohtical parties, with the major party wuuung all elections, 
or one political party which conducts legitimizing elections from time to 
time Also, control over the population is not extensive 

A competitive form of government is one where two or more political 
parties are contending for power, with free elections held at some regular 
mterval 

Classification reflects the general political state of a country during the 
1960s, rather than the state of the country at any point in time This may 
result, for example, m a classification of a country as scmicompetitivc, 
though it may presently have a competitive political system 

These countries were further subdivided by region of the world— 
Afnca, America, Asia, and Europe. This classification scheme for the less 
developed countries studied is shown in Appendix A. All mdependent 
LDCs of the world were considered for analysis, and only India and those 
countries for which reliable economic data were not available were ex 
eluded. India was not included simply because its huge size would obscur 
the data on average rates of growth for the other LDCs of the world 
Mainland China would have been excluded for the same reason, but tht 
choice was not open because no dau were available for mtunland Chin 



oomnnadATioMi 


819 


TABLE 1 

LDOi Rjufup BY Rial GDP m CmrA Gsowth Ratu (1959-68) 
>y FoftM or GovnuoaNT 
(Opabhu DanuBunoM m %) 


QuAxma 


Fobm or GovcamiiNT 


Authoritarian 

Semicompetitivt 

Conqpetitive 

1 

30 

21 

23 

2 

IS 

42 

23 

3 

26 

16 

38 

4 

30 

21 

15 


SotmcB —Derived from Appcndbi B 


in any case Thus, 58 countnes, plus the Francophone countries (con¬ 
sisting of 14 African nations) were included in this study. Data on the 
Francophone countnes as a group were available, though data on the 
individual countnes were not These countnes, as a group, were classified 
as semicompetitive, by weighting the 14 individual forms of government 
by population 

The period covered by this study is 1959-68, with at least 5 years of 
indepiendence and reliable economic data during that period required ftH* 
inclusion of a country Thus, a LDC which achieved independence m 1964 
would be included, while one gaining independence in 1965 would not 

Data on real gross domesuc product (GDP) and population during the 
1960s were collected and the average annual rate of growth of real GDP 
and real GDP pier capita were calculated for each country. The growth 
rates calculated are trend rates, the trend being determined by fitting a 
scmiloganthmic straight line to the data by the method of least squares 
The LDCs studied are ranked by retd GDP per capita growth rates in 
Appendix B and by real GDP growth rates in Appendix C Table 1 sum¬ 
marizes Appendix B, showing the quartile distribution of each form of 
government when ranked according to real GDP per capita growth rates. 

Thirty percent of the LDCs having authontanan regimes placed in the 
top quartile, a larger percentage than that attamed by cither seinicom- 
pctitivc (21 percent) or competitive (23 percent) governments. But 30 
percent of the authontanan countnes also fell m the bottom quartile, 
twice the proportion of competitive countnes ranking this low, and 56 per¬ 
cent fell below the median rank. Only 37 percent of the semicompetitive 
countnes placed below the median rank, though 53 percent of the com¬ 
petitive countnes did so Forty-one percent of the authoritarian countries 
fell in the interquartile range, with 30 percent above and 30 percent 
below this range Sixty-one percent of the competitive governments were 
in the interquartile range, with 23 percent above and only 15 percent 
below this range. This suggests that LDCs with authtmtarian forms of 
government perform dther very well or very poorly Though, in general. 







8ao 


JOURNAL OF POLITIGAL ECONOMY 


TABLE 2 


LOO Ranked by Real GDP Growth Rates (1959-68) 
BY Fosh or Govirnuent 
(Quartilb Obtribution in %) 


Quartile 


Form or Government 


Author! carian 

Semicompctitive 

Competitive 

I 

26 

21 

31 

2 

22 

37 

15 

3 

22 

21 

38 

4 

30 

21 

15 


Source —Derived from Appendix C 


comp^^e governments may not be able to achieve the high growth 
rates attal^d by authoritarian countries, neither are their chances of 
failure so high, and the overall performance of semicompelitive govern¬ 
ments generally surpasses that of either of the other two forms of govern¬ 
ment. 

Table 2 summarizes Appendix C, showing the quartile distribution of 
each form of government when ranked according to real GDP growth 
rates Much the same conclusion would be drawn from this table, leaning 
even more in favor of competitive forms of government as opposed to 
authoritarian regimes with respect to real GDP growth rates Competitive 
governments placed 31 percent in the top quartile, surpassing both author¬ 
itarian (26 percent) and semicomjictitive (2) percent) countries, while 
only 15 percent fell in the bottom quartile. 

Ranked either by real GDP per capita growth rates or by real GDP 
growth rates, the mean ranks of competitive, semicompelitive, and author¬ 
itarian countries were 28, 30, and 31, respecUveiy This also seems to 
favor a nonauthoritanan, rather than an authoritarian, approach to 
economic development 

In analyzing this quesuon further, an average annual rate of growth, 
weighted by population, was calculated for each group of countries, ac¬ 
cording to the above classification scheme These results are shovwi in 
table 3 

These figures reveal that the 27 authontanan LDCs and the 13 com 
pctitivc LDCs of the world achieved exactly the same rates of growth ir 
real GDP per capita (1 4 percent per year) and real GDP (2.9 percent pe 
year) over the 1959-68 penod The 19 semicompetiuve LDCs of ih 
world (one of which is the group of 14 Francophone countries) achieve 
slightly higher growth rates of 1 6 percent and 3 5 percent per year in rei 
GDP per capita and real GDP, respectively 

The LDCs with a competitive form of government attained higher rat 
of economic growth (real GDP per capita) than those with an authonu 







QOiaiUMICATIONS 


831 


TABLE 3 


Reai, GDP PBK CxpriA Growth Ratu (%/yr), by Form op 
G oVERNMEirr AND BY Reoion, I959-M* 


Reoion 


Form of Government 



Authoritarian 

Semicompetitive 

Competitive 

All 

AfricR 

17 

1 5 

20 

I 6 


(3 5) 

(3.5) 

(2 8) 

(3 5) 

Amenca 

07 

1 4 

I 8 

1 1 


(2 2) 

(3 0) 

(3 4) 

(2 6) 

Asia 

1 2 

24 

09 

1 2 


(2 9) 

(3 9) 

(2 5) 

(2 9) 

Europe 

34 

32 

1 5 

28 


(3 9) 

(3 6) 

(2 7) 

(3 5) 

All 

1 4 

1 6 

1 4 

1 5 


(2 9) 

(3 5) 

(2 9) 

(3 1) 


*Average rates, weifthted by population Real GDP growth rate* are iho%VB m parentheaei 


lan or semicompctitivc form of government in Africa and America The 
authoritarian governments in Europe showed the best performance, 
while in Asia they performed slightly belter than the compeutive govern¬ 
ments but not nearly as well as the senurompetitive governments 

By region, the fastest economic growth was attained in Europie and the 
slowest in America and Asia, while the growth rates achieved in Africa 
fell in between, very close to the world average These regional compar¬ 
isons must be considered of limited significance in some cases, due to the 
numbers of countries involved Only six of the countries studied arc 
located in Europie, three authontanan, two compietitive, and one semi- 
compieutive Furthermore, there are only three scmicompicutive and two 
competitive countries in Asia and only one competitive piolitical system 
in Africa 

Though India, with a competitive form of government, was excluded 
from this study, a brief look at the Indian expicncnce is warranted From 
1959 to 1968, real GDP pier capita in India grew at an average annual rate 
of only 0 5 picrcent, considerably below the world average The rate of 
growth m real GDP was 1 7 picrcent, again somewhat below the world 
average 

Likewise, a glance at four less developed communist countries in 
Europie (Albania, Bulgana, Poland, and Rumania), which were excluded 
due to a lack of reliable econoimc data, is informative. Based on the limited 
data available, Rumania achieved the fastest rate of growth of these four 
countries, 8 2 percent pier year in real net material product (NMP) pier 
capita and 9 0 piercent pier year in real NMP, while Poland showed the 
slowest gams, 2.3 piercent and 2 9 picrcent. The average annual rates of 
growth for these four countries taken together (weighted by pxipulation). 







822 


JOUKNAL OF POUnOM, BOONOKy 


TAUJS 4 


Rkajl GDP PBK Gahta Growth Rath (%/w), bv Fomi or 
Govuuoibnt and by Rbomn, I95d-w* 




Foiui or GovaaNMBMT 



Rboion 

Authontanan 

Semicompeutive 

Competitive 

All 

Africa 

36 

21 

20 

2.6 


(5 6) 

(4.2) 

(2 8) 

(4 6) 

Amenca 

07 

1 4 

3.9 

2.3 


(1.8) 

(3 1) 

(5 2) 

(3 7) 

Asia 

1 7 

23 

0.9 

1 7 


(3 4) 

(3 9) 

(2 5) 

(3 3) 

Europe 

33 

32 

1 6 

30 


(3 8) 

(3 6) 

(2 8) 

(3 6) 

All . „ .. 

20 

1 9 

33 

23 

A - 

(3 2) 

(3 6) 

(4 7) 

(3 7) 


*Aver^:e facet, weighted by GDP Real GDP growth ratet are thown in parentheea 


from 1960 to 1968, were 4.9 percent in real NMP per capita and 5 7 per¬ 
cent in real NMP These rates are notably higher than the world average 
rates of growth for LDCs 

Determimng the average rates of growth, weighted by GDP rather than 
by population, alters these conclusions slightly—and in favor of compet¬ 
itive forms of government These results are shown in table 4 

The most notable change resulting from the use of GDP rather than 
populaUon weights is the higher rates of growth m competitive countries 
relative to both authoritarian and semicompetitivc countries. Weighted by 
population, competitive and authontanan countnes showed the same 
rates of growth, both shghtly below the growth rates of semicompetitive 
countnes Now, weighted by GDP, countries with compeutive systems of 
government exhibit growth rates of 3.3 percent per year in real GDP per 
capita and 4 7 pierccnt per year in real GDP—both sigmficantly higher 
than the rates attained by either authontanan governments (2 0 percent 
and 3 2 percent) or semicompetitive governments (1 9 percent and 3 6 per¬ 
cent) 

Proponents of an authontanan approach to economic development 
generally hold that this is especially necessary during the earliest stages of 
development. Since this contention, agam, is rarely supported by any data, 
the present analysis was extended to shed some light on its validity. The 
median GDP per capita, at the bt^nmng of the time penod studied, for 
the 59 LDCs analyzed was determined Then the average rates of growth 
for countries above and below this median value, by form of government, 
were calculated. These results are shown in table 3. 

When the individual rates are weighted by populatum in tunving at tht 
average rates of growth, the data dearly do not support the argument tha 







OOHlfUNIOAIIONI 


823 


TABLE S 

Rbal GDP pu Capita Gmmwth Ratm (%/»), av Fokm or 
Govbbniiint, Above and Beiow Ihrial MsmAN GDP pbk Capita, 
I959-6B* 


Rbiation 

10 Median 


FoaM or GovaaMMEirr 



Authontanan Seimcompetitive 

Competitive 

All 



Avenges (Weighted by Population) 


Above 

26 

1 9 

1 7 

2 1 


(3 6) 

(3 6) 

(3 1) 

(3 5) 

Below 

1 0 

1 5 

1 0 

1.2 


(2 7) 

(3 4) 

(2 5) 

(2 9) 



Avenges (Weighted by GDP) 


Above 

24 

20 

36 

27 


(3 4) 

(3 7) 

(4 9) 

(4 0) 

Below 

1 3 

1 5 

10 

I 3 


(2 9) 

(3 3) 

(2 6) 

(3 0) 


*Rul GDP growth rater in percentage change per year arc ahown in pnicnthcaca 


authoritarian governments pierfonn better than compedtive governments 
do in the earUest stages of development Below the median GDP per 
capita, the authoritarian countnes exhibited the same rate of growth m 
real GDP per capita as the compeuave countnes did, and only a slightly 
higher rate in real GDP The countnes with senucompcutive forms of 
government expenenoed higher g^wth rates than either of the other two 
groups of countnes Above the median, authontanan governments re¬ 
sulted in shghtly faster rates of growth than either compeUdve or semi- 
compedtive governments This suggests that authonunan governments 
do not perform better than nonauthontanan governments m the earliest 
stages of economic development, though they may surpass these other 
forms of government m later stages of development. 

When the individual rates are weighted by GDP, the data do tend to 
support this argument The authontanan countnes perform better than 
the compeddve countnes below the median GDP per capita, whereas the 
reverse is true above the median Yet it must be noted that, agam, the 
LDCs below the median GDP per capita with senucompeddve forms of 
government outperformed both the authontanan and competidve 
countries. Once more, this lends considerable doubt to the flat statement 
that the best approach to economic development is one with an author¬ 
itarian government 

In summary, these results certainly do not support, and tend to refute, 
the view that authontanan countries arc universally capable of achiev¬ 
ing faster economic growth in the early stages of development than coun¬ 
tries having competitive political systems, or that underdeveloped countries 







JOURNAL or POLITICAL ECONOMY 


834 

face “a cruel choice between rapid (self-sustained) expansion and 
democrauc processes” (Bhagwati 1966, p 204). The weight of the evidence 
presented here suggests that the opposite presumption is more valid, re¬ 
quiring exponents of this view to support such a contenuon The evidence 
indicates, at the very least, that the choice is not at all a clear-cut one and 
depends on the circumstances prevailing in each country 


Appendix A 


Less Developed Countries Included in Study, 
BY Form or Governmsnt and by Region 



Form o» Government 



Authoritarian 

Somicompctitivt 

Competitive 

AT 

Africa 

i^gypt 

Ghana 

Libya 

Morocco 

Southern Rhodesia 
Uganda 

Zaire 

Algeria 

Ethiopia 

Kt nya 

Nigeria 

South Alnca 

Sudan 

Fanaania 
'1 umsia 
Francophont 
countries* 

Sierra Leone 

America 

Argentina 

Bolivia 

Brazil 

Cuba 

Haiti 

Hondurat 

Paraguay 

Dominican 

Republic 

Ecuador 

Guatemala 

Mexico 

Nicaragua 

Peru 

OhiU 

Colombia 

Costa Rica 

El Salvador 

Jamaica 

Panama 

Trinidad and 

Tobago 

Uruguas 

Asia 

Burma 

China (Taiwan) 
Indonesia 

Iraq 

Khmer Republic 
Pakistan 

South Korea 

South Vietnam 

Syria 

Thailand 

Iran 

Jordon 

Malaysia 

Oylon 

Philippines 

Europe 

Portugal 

Spain 

Yugoslavia 

Greece 

Cyprm 

Turkey 


Sovacs —Baaed pnncipaBy on PvUtual Hondiooi «nd Alias of (Ac Wvrfti, f970 (Stebbina and Amoia 1970) 
*The Francophooc countnes lodude ^meroon, Cnitral African RepuMte, Chad, Congo, Dahomey 
Gabon, Ivory doaat, Madagaacar, Mali, h^urnansa, N^er Senegal, logo, and Upper Volta 







COMMUNICATIONS 


825 


Appendix B 


Lsm Developed Gountbibs op the World, 
Ranked acoordino to 

Real GDP per Capita Growth Rates, 1959-68* 



Country and 


Rate 


Country and 


Rate 

Rank 

Form of Govemment| 

(%/yr) 

Rank 

Form of Govemmentt 

(%/yr) 

1 

Libya, 1962-68J 
Jordan, 1959-67 

A 

22.7 

31 

Turkey 

C 

1 4 

2 

S 

66 

32 

Ethiopia 

S 

1 4 

3 

Portugal, 196U-68 

A 

50 

33 

Egypt 1959-661) 
Pakutant 

A 

1 4 

4 

Cyprus, 1960-68 
Cmlc 

C 

50 

34 

A 

1 4 

5 

C 

48 

35 

El Salvage 

C 

1 4 

6 

South Africa, 



36 

Bolivia 

A 

1 3 


1960-68 

s 

39 

37 

Tunisia 

S 

1 1 

7 

China (Taiwan) 

A 

33 

38 

Costa Rica 

c 

1 0 

B 

Greece 

S 

32 

39 

Philippines 

c 

09 

9 

Spain 

A 

32 

40 

Guatemala 

s 

09 

10 

Yugoslavia 

A 

28 

41 

Khmer Republic, 



11 

12 

13 

14 

South Vietnam, 
1960-66 

Jamaica, 1962-68{ 
South Korea 

Iran 

A 

C 

A 

S 

2 5 
25 
24 
24 

42 

43 

44 

45 

1959-64 

CcylonJ 

Honduras 

Brazil 

Argentina 

A 

c 

A 

A 

A 

09 

09 

08 

08 

08 

15 

Uganda, 1962-68^ 

A 

24 

46 

Burma 

A 

08 





47 

Colombia 

c 

08 

16 

Panama 

C 

23 

48 

Ecuador 

S 

06 

17 

Thailand 

A 

22 

49 

Morocco# 

A 

04 

18 

Nigeria, lyWMjblJ 

S 

2 1 

50 

Paraguay 

A 

04 

19 

Trinidad and Tobago, 




1962-66 

c 

20 

51 

Southern Rhodesia, 



20 

Zaire, 1960-64 

A 

20 


1959-67 

A 

02 





52 

Ghana 

A 

0 1 

21 

Sicra Leone, 



53 

Cuba, 1962-66* • 

A 

00 


l%5-€8 

c 

20 

54 

Sudan,1959-66 

S 

- 02 

22 

Iraq, 1959-66 

A 

20 

55 

Indonesia, 1959-67 

A 

- 02 

23 

Nicaragua 

S 

1 9 




24 

Syria, 1959-67§ 

A 

1 9 

56 

Haiti, 1959-641 

A 

- 03 

25 

Tanzania, 196^-68 

S 

1 7 

57 

Dominican Republic S 

- 03 





58 

Uruguay 

C 

- 03 

26 

Mexico 

S 

1 6 

59 

Algeria, 1959-66# 

S 

- 26 

27 

Malaysia, 

1963-68§ 

s 

1 6 





28 

Kenya, 1963-68 

S 

1 6 





29 

Francophone, 

1960-68 

s 

1 5 





30 

Peru 

s 

1 4 






SovmcB —Denved pnnctpally from NMhmtai AremmU L$ss Dm hprd Cmmirm, 1959-1969 (OcnuiiiatiOB 
For Economic Coopenitioa and Deveiopmcnt 1970) A2io» Y0arho9k9f Sttmmi AamOs Siatutta, I969(VfMed 
Nation* 1970) and Dtimofrmph* Y tarkook, 1970 (United Nttiooi 1971) 

*UnleBi otnerwue indiuted, data «re ba»ed on GDP at constant market pnccs, in national cumacy for 
the period 1959-^ 

i A * autbontanan» S « semioompetitive, and C ■* competitive 
Bated on GDP at constant factor costs 
Based on GNP at constant market prices. 

Based on net domestic product at constant factor costs 
# Based on cross domestic **productioa** at constant market prices. Contrary to GDP^ it excludes by d efin i- 
tion value added by civil servants, housebobd servants, pnvate nonproAt mstitutioas, and fin a nc ia l inter* 
me^nes 

**Based on fra* material product at constant market prices. 









Appendix C 


JOURNAL OF POLRIOAl BaONOKY 


Lw DBVSLonB CouNTkm at trs Worls, 
Ramk^ aooommmo to 
Rral GDP Growth Ratm, 1959-68* 


Rank 

Country and 

Form of Gwremmentf 

Rate 

(%/yt) 

Rank 

Country and 

Form of Government} 

Rate 

(%/yr) 

I 

Libya, 1962-68t 

A 

27 2 

31 

Peru 

s 

30 

2 

Joroan, 195^-67 

S 

96 

32 

Sierra Leone, 



3 

South Africa, 




1063-68 

c 

28 


1960-68 

S 

6.4 

33 

Costa Rica 

c 

27 

4 

Chile 

c 

6.1 

54 

Pakistan} 

A 

27 

5 

CypnUa 1960-68 

c 

60 

35 

Egypt, 1959-6611 

A 

2 7 

6 

Portugal, 1960-68 

A 

60 

36 

Turkey 

C 

27 

7 

Soutk Vietnam. 



37 

Philippines 

C 

2 7 



A 

53 

38 

Bolivia 

A 

26 

8 

JandWi, 1962-68} 

C 

50 

39 

Cuba, 1962-66** 

A 

25 

9 

Uganda, 1962-68} 

A 

49 

40 

Honduras 

A 

25 

10 

China (Taiwan) 

A 

49 









41 

Guatemala 

S 

25 

11 

Syna, 1959-67| 

A 

49 

42 

Colombia 

c 

24 

12 

Tnnidad and Tobago, 


43 

Ethiopia 

S 

23 


1962-66 

C 

46 ! 

44 

Brazil 

A 

23 

13 

Nigeria, 1960-66J 

S 

45 1 

45 

Ecuador 

S 

23 

14 

Zaire, 1960-64 

A 

43 





15 

Tanzania, 1964-68 

S 

42 

46 

Tunisia 

s 

22 





47 

Khmer Republic. 



16 

Panama 

c 

39 


1959-64 

A 

22 

17 

Iran 

s 

38 

48 

Indonesia, !959-67t A 

22 

18 

South Korea 

A 

38 

49 

Ceylon} 

C 

2 1 

19 

Thailand 

A 

37 

50 

Paraguay 

A 

19 

20 

Francophone, 

1960^ 

S 

57 

51 

Burma 

A 

1 B 





52 

Morocco# 

A 

1 8 

21 

Spam 

A 

36 

53 

Argentina 

A 

1 6 

22 

Nicaragua 

S 

36 

54 

Domnican Republic 

S 

1 5 

23 

Greece 

S 

36 

55 

Ghana 

A 

1 4 

24 

Iraq, 1959-66 

A 

36 





25 

Southern Rhodesia, 



56 

Sudan, 1959-66 

S 

1 3 


1959-67 

A 

34 

57 

Haiti, 1959-64} 

A 

07 





58 

Uruguay 

C 

03 

26 

Yugoslavia 

A 

34 

59 

Algeria, 1959-66# 

S 

-15 

27 

Mexico 

S 

33 





28 

El Salvador 

c 

32 





29 

Malaysia, 1963-681 

s 

31 





30 

Kenya,1963-68 

s 

30 






NoTi.->Fof louroet ud notet, aec App«odu S 






aOMUUNIOATIOMI 


8*7 


RafamcM 

Bhagwati, Jagdifh. T1mEcMmcsitfUiidiriadop$dCumtn$s.'titMi'iaA McGraw- 
HiU, 1966. 

MeiUmner, Robert L. Ttu Gnat Aieat, New York; Harper & Row, 1963. 

Organization for Economic CooperatKsi and Devdointtent. Natumal Accomts if 
Ltss Dtodoptd Cowtirm, 19S9-1968. Paru: Org. Econ. Cooperaboa and Develop¬ 
ment, June 1970. 

Stebbina, Richard P., and Amoia, Alba, eda. Pobtual Handbook and Atlas rf the 
World, 1970. Counal on Foreign Relationa. New York' Simon & Scbuatcr, 1970. 

United Nations. Yearbook rf National Aeeoants Slatutus, 1969 New York: Umted 
Nations, 1970. 

- Demograpluc Yearbook, 1970 New Yoik Umted Nations, 1971 





Factor Proportions and Comparative 
Advantage in the Long Run: Comment 

Alan V Deardorff 
UmvirsUy of Mtchtgan 


,Findlay (1970) examined the effects of growth on comparauve advantage 
in an ojien three-sector growth model in which only the consumption 
goods were traded He simplified his analysis considerably by assuming 
factor pnees to be completely determined by the world prices of traded 
goods It IS the purpose of this note to show that this assumption is both 
more powerful and more restrictive than Fmdiav seems to have realized 

In Section I we demonstrate the power of the assumption by showing 
that if all of Its implications are realized, then certain results that Findlay 
thought ambiguous are actually deternunate Then, in Section II, we 
argue that the assumption is restrictive, because it requires that both 
traded goods be produced Since such incomplete specialization will not 
occur for all parameter values, we describe the effect of dropping the 
assumption on the behavior of Findlay’s model and on his results 

I. The Power of the Assumption 

Findlay’s economy produces three goods- two traded consumption goods, 
X and y, the prices of which arc given from outside, and a nontraded in¬ 
vestment good, Z, the pnee of which adjusts to equate supply and demand 
domestically Findlay argued that the fixed prices of X and Y would serve 
to determine the prices of the two factors, capital and labor, which would 
then determine the input coefficients in each sector * Thus, he was able 

This paper was written with financial assistance provided by NSF grant GS-S073 
to support research in international economics at the University of Michigan 1 am 
indebted to Robert M Stem for his comments on an earlier version of the paper. 

‘ Detenmnation of factor prices by commodity pnoes requires all of the aswmptioa 
of the factoT-pnee equalization theorem Findlay cxpliatly assumed perfect oosnpetitiao, 
the absence of transport costs and tariffs, and the “strong fiuXor-intensity assunqition.” 
He did not mention the addiuonal requirement that the country actually produce both 
traded goods (incomplete specializaUon). We look further at this requiranent in 
Section II. 


829 




830 JOUKMAL or POUnOAL BaONOMY 

to state the equations of the model in the following linear form:* 


X 7 

<»tt- + + “u* ” 

(1') 

X Y 

*ji ^ + I’ai ^ + “23^ ” 

(2') 

X 7 

+ (1 - s)k^0. 

K X 

(3') 


capitsj^raoo, s is 
cocflScHMaof the 

j (1 = JO = y, 


Here, K is the capital stock, k is investment per unit of capital (equal to 
the growth rate of capital since depreciation is ignored), A is the iabor- 
captaLjatio, s is the savings propensity, and a,j are the Axed input 
the factors, 1 (1 = i, 2 = K), per unit of the outputs, 
3 = Z) 

Findlay solved this system, using Cramer’s rule, to get several properues 
of the model The solution involved the determinant, A, of the coefficients 
on the left-hand side of the above equations. Findlay thought the sign of 
this determinant was ambiguous, except when the factor intensity of the 
investment good, Z, lay between the factor intensities of goods X and F 
It was therefore only to this special case that he applied the model. 

It turns out, however, that the sign of A is not ambiguous at all, but 
depends quite simply on the relative factor intensities of X and Y alone 
The reason is that when world prices detenmne factor prices, they do so 
by equating unit costs of each of the commodities to their pnces * Thus 
the coeffiaents are related as follows.* 


irajy + t'Sjj = li (a ~ 1> 2) 3)) (i) 

where w and r are the fixed wage of labor and rental of capital, respec¬ 
tively It IS possible to use equations (i) to simplify Findlay’s expression 
for A to get 

A = (022 - Oil)/'" (“ 


Thus, A is positive (negative) if F is relatively capital (labor) intcnsiv 
compared with X. 

Equation (ii) can be derived by solving ( 1 ) for a,y and subsdtutir 
directly into Findlay’s definition of A, but this mechanical exercise is n 


> Those equations which qipeaiect is FnxUay’s article will be referred to in this n 
by the ■any, arable numerals that he used. We will number our own equations u 
small toman numenls to dittingiush diem from ha. 

* Fmdlay mentiooed this imqierty early in hu ptqxr and even illustrated it in 
fig. 1 when he showed all thrae unit isoquants with a common tangent. It u somev 
sutpnsmg therefore than he did not make further use of it 

* Findlay defined his commodity units to make each eominodity firice umty, 
givittg us a 1 on the nght-hand side of (i). 



xxmnnaotaiom 


831 

really necesaary. Instead we use (i) to transform equattons (l')-(3') into 
an equivalent system that is more easily solved. Comparing his equations 
with (i), we sec that if we multiply (1') by w and (2’) by r and add, the 
resulting equation will have unitary coefiBdents on the left-hand side 
Thus we form the following new system of equations 

p + 7 + * - “'A + r, (lii) 

A A 


X Y 

"21 — + "21 — + « 2 S* 


1 , 

s(wA + f) 


(iv) 

(V) 


Equation (iii) is derived from (!') and (2') as just described, equation (iv) 
IS identical with (2'), and equauon (v) is the sum of (lii), multiplied by s, 
and of (3') The advantage of this formulation over Findlay’s should be 
clear* it contains only the capital-input coefficients, the ordering of which 
IS easily interpreted in terms of factor mtcimtics 

Equauon (v) tells us immediately the soluuon for the growth rate td* 
capital, k, which compares favorably, in terms of simplicity, with Rndlay’s 
soluuon in his equauon (4) Indeed, if we differentiate (v) with respect 
to A we find 


dX 


=r SW , 


(vi) 


which IS unambiguously posiuve regardless of relative factor intensities. 
Thus, the ambiguity of this derivative which Fmdlay noted is resolved, 
and steady states are stable r^ardless of factor mtensiues. 

Incidentally, if wc equate Findlay’s equauon (5) with our equation (vi) 
and solve for A, wc get immediately the rclauonship wc noted m our 
equation (u) Since A appears m many of Fmdlay’s results, it is possible 
to use our (li) throughout his paper to simphiy and remove ambiguities 
Alternatively, one could start from the beginmng and rederive all of 
Findlay’s results from the system of equations (iil)-(v). Either way, as 
the reader can verify, the eftect is to get most of Findlay’s results in more 
manageable form with fewer ambiguous qualitative imphcaUtHis.^ 

* Notable eaamplei are hu eq (12) which beoamei » !/■* and bis eq. (13) 

>ducb becomes di*/dr » —t/nw’, whidi are unainbiauotnly positive and negative, 
reflectively Findlay devotes a good deal of attention to the ratio, a, of X and Y. While 
our aifnoaefa does not notably simplify the sohiUao for this ratio, we are able to get 
emly intenar el e d sohitioiis fcr XfK and F/iC separately. They behave enctly as one 
would eape^ from tbe Rybccynski theotem, however, so that it is unneoesnry to repro¬ 
duce them here. 



83* JOUIUIAL OF POUnOAL BQOIWIfy 

n. Tbe RestrictiveneM of the Assnmptiaa 

The given world prices of X and Y serve to determine unique factor prices 
only il' both X and F are actually produced domestically Since there is 
nothing in the system of equations (r)-(3’) to guarantee nonnegative 
solutions for XjK and YjK, the assumption will be valid for only a subset 
of the possible values of the exogenous variables ‘ Outside this subset, 
Findlay’s equations are of no use, and the model must be reexamined 

How, then, will the model behave if only one traded good is produced^ 
The answer to this question appears to be quite complicated, since the 
model must now determine factor prices, commodity pnees, and factor 
proportions within the producing industries, and the equations for doing 
this will not he linear Fortunately, however, the problem has already been 
solved j^(N*lhcre. For as long as Findlay’s economy produces only one 
of the trailH goods, say X, then the process of trading part of its produc¬ 
tion of X for imports of Y at fixed prices is completely separate from the 
processes of production and growth ’ Indeed, the economy behaves 
exactly like Uzawa’s (1963) two-sector growth model of a closed economy 
Thus, the analysis of Findlay’s model can be completed by simply tacking 
on two Uzawa models, one with X and the other with Y as the consump¬ 
tion good, to explain behavior in those regions where only one of these 
goods IS produced " 

While most of Findlay’s conclusions extend easily to this more complete 
model,’ one of his conclusions must be reversed Findlay observed, in¬ 
correctly as we have seen, that steady states will be unambiguously stable 
only if the factor intensity of the investment good is intermediate between 
those of the two consumption goods This wouid require, however, that 
the mvestment good be relatively capital intensive compared with the 
consumption good in one of the Uzawa models used to complete the 
* model It was in precisely such a case that Uzawa and others found the 
possibility of unstable steady states m the closed two-sector model. Thus, 
what IS really required in order for steady states to be unambiguously 
stable in the completed version of Fmdlay’s model is that the investment 
good be relatively labor intensive compared with both X and Y 

‘ It u possible to identify this subset by setting firsl X and then Tf equal to zero i 
eqq (iii)-(v) and solving the resulting equations, with J endogenous, for the bounds on 
outside of which only one traded good is produced These bounds, it turns out, both 1 
between the extreme tactor-intensity ratios of the three goods, within which factor pric 
would be exogenously determined if ail goods were traded This suggests, in an admilted 
crude way, that the likelihood of Findlay’s assumption being valid is imaller than I 
likelihood of hictor-pnce equalization ui an otherwse equivalent model with all th 
goods traded. 

’ We assume here that tbe savings propensity is independent of the price of the iitv 
men! good relaUve to the consumpuon goods 

* Or where FmeUay's equations yield negative producUon of one of the goods 

* Hu con^Mtatsve advantage results require a trivial modificauon, merely noung 
tbe ratio of X and Y is now either zero or incite, depending upon which good is produ 



OOMMUNICATIONS 833 

lUfarence* 

Findlay, Ronald. “Factor Proportions and Comparative Advantage in the Long 
Run.” J.F.E. 78 (1970): 27-36. 

Uzawa, Hirofumi. “On a Two-Sector Model of Economic Growth. II ” Rm 
Earn. Studus 30 (1963) 105-18 





The Value of Changes in the Probability of 
Death or Injury 

Michael Jonea-Lee 
Vtittrstty ^Yark 


One of the more puzzling questions encountered m public mvestment 
decision making (especially in the health and transport Bclds) concerns 
the value placed on improvements (or reductions) m the safety of human 
life. 

Schelling (1968) appears to have been the first to suggest that any 
potential change in the fatal (or nonfatal) accident rate may be regarded 
as a (typically small) change m the probability of death (or nonfatal 
acadent) for any particular individual Mishan (1971) takes a similar 
view of the problem and argues that, if alternative resource allocations 
are to be compared solely on the basis of “potential Pareto improve¬ 
ments,’’ then public investment projects which increase the safety of 
human life will be recommended only if the total of sums which indi¬ 
viduals would be prepared to forfeit to effect such probability changes 
(and the other desirable consequences of the mvestment) exceeds the 
capital costs of the projects concerned 

Given the manner in which Schelling and Mishan formalize the prob¬ 
lem, the central question is therefore the nature of the Hicksian “compen¬ 
sating variations’’ in wealth ‘ for various changes in probability of death 
or injury In fact, even if Mishan’s potential Pareto improvement criterion 
IS rgected as a basis for allocative deoMons, information concerning these 
compensating variations will still be relevant for all but those public 
investment decisions which completely ignore private prefer en ces 

The primary purpose of this paper is to develop a qualitative analysis 
of compensating variations for changes in the probalnlity of an individual’s 
own^ fatal accident. In addition, brief consideration will also be given to 

An earlier vemoo of thu paper was presented at the lotematMiial Eoatoouci Aaocia- 
lion Theory Workshop held in Beifen in July-August 1971. It th er efas e reflects the 
many hdpM oomments and sugfestMos oi the workshop paiticipanls and osgantaers, 
eqaeciaUy Jacques Driae andjim Mirriees. Naturally, only 1 am i tepon si h i r fee r em a in i n g 
error, obseunty, and triviality 

‘ See Hicks (1943) fat a discussion of nnmpmeeting variatiaiis. 

* Variatioiis in tte probahdity of an mdivldnat’s own death (or iqfury) are what 
Mhhan calls variatioiis in “direct” risks. By contr s e t , "indirect” nsfa are awnriatrd with 
the r l e at h (or iiniury) of odiers. 


835 



JOURNAL or POLmOAL BCONOUY 


836 

the manner in which the analysis might be modified to deal with (a) 
nonfatal accidents and (A) the death or injury of others Throughout the 
discussion all probabilities are to be interpreted as being of an essentially 
subjective nature. Since individuals are hardly likely to view their own 
death or injury as the subject of repeated experiments, the preconditions 
for the existence of objective (relative frequency) probability measures 
would seem not to be fulfilled, and subjective probability therefore appears 
to be the relevant concept in the current context Finally, in the interest 
of simplicity, the analysis is initially restricted to singlc-penod probability 
changes However, the majority of pubhc investments which affect safety 
will do so over extended periods of time, so that ideally one would wish 
to develop results for the general muluperiod case This generalization 
of the basic analysis is considered at a later stage in the paper 


1. The Assumptions 

It will be assumed that an individual faces two^ eventuahties for each 
level of wealth, IF, with which he commences the current period ■ either 
he may live through the current period, in which case he and his de¬ 
pendents will enjoy^ wealth IV, or he may die during’ the current 
penod, m which case his dependents will enjoy whatever proportion of W 
IS bequeathable (the possibility that the individual’s dependents may also 
enjoy the payoff from insurance contracts m the event of his death is con¬ 
sidered in the Appendix) Such a situation may be desenbed by saying 
that there exist two relevant states of the world—“life” and “death ” In 
general, the individual will be taken to be concerned not only with the 
level of wealth, W, with which he commences the current penod, but 
also with the state of the world that actually occurs, typically prefernng 
“life” to “death” At the beginning of the penod the individual is 
assumed to face a number of alternative decisions, each of which de¬ 
termines a level of wealth, W (which is not subject to any uncertainty), 
and probabilities of occurrence for each of the essenually uncertain states 
of the world While Hirshlcifer (1965, 1966) was the first to discuss this 
type of decision problem, his analysis was rather informal, and a rigorous 
foundation for a theory of choice under such circumstances has only 

^ Since we are concerned only with variations in the probability of the individual’s 
o«vn death at this stage, there is no need for explicit recognition of the various outcomes 
ofliCe and death for people other than the individual himself 

* Thu u not intended to imply that all of IF will be consumed dimng the current 
penod but, rather, that the individual commence* the current period with marketable 
assets worth W. 

’ Note that this formulation of the problem docs not dutinguuh between die various 
time* at which the individual may die during the current period (It u an essentially 
discrete-tune formulation adopted for the sake of analytical simplicity.) Thu u clearly a 
deficiency, since the individual will presumably prefer to die later rather than earlier 
in the period. However, the deficiency disappears in the continuous-time formulation 
of the problem outlined in Section 3 below 



OOmiUMICATIONt 


837 


recently been provided by Luce and Krantz (1971). The latter have 
shown that, provided an individual’s preference ordering over alternative 
(conditional) ‘ decisions obeys a restricted set of fairly plausible axioms, 
then under the circumstances described above he will behave as an 
“expected utility maximizer *’ That is;, the individual will behave as if 
each of his alternative deasions yields probabilities p and (1 — ^) for 
states of the world “death” and “life,” rcsjiectively, and he then selects 
that deasion which maximizes the mathematical expectation of utility 
given by 

E{U) = (1 - p)L{W) + pD{W), (1) 


where L{W) is the utility associated with wealth fV conditional upon 
occurrence of state of the world “life,” while D( W) is the utihty associated 
with wealth W'' conditional upon occurrence of state of the world 
“death ”* The functions L{W) and D{W) are umque up to the same 
hnear transformation In addiUon, it will be assumed that both L[W) and 
D(IV) are continuous and at least twice-differentiable functions 

Given that he will hve through the current pienod, the individual will 
be taken to prefer more wealth to less and also to be financially nsk 
averse, so that 


and 


^>0 

dW 


d^L{W) 


dW^ 


< 0 


( 2 ) 

( 3 ) 


Given that he will die during the current period, the individual will be 
assumed not to be misanthropic to his heirs and also not to wish them more 
rather than less financial nsk, so that 


dDjfV) ^ 0 


(4) 


‘ Roughly speaking, Luce and Krantz’s analysis diflcn from other axiomatic theories 
of choice under uncertainty such as Savage's (1954) in that the former is concerned with 
decisions which determine the probabilities with which states of the world occur, svhercas 
in Savage’s analysis decisions have no effect on the probabiliUes of occurrence of the 
various states Decisions in the Luce-Krantz analysis aie referred to as “conditionid” 
decisions and are clearly the relevant kind of decisions in the current context in which 
alternative deasions affect the probabilities of occurrence of the states “life" and “death ” 

’’ This IS not meant to imply that the individual is able to bequeath all of (K to his 
heirs but signifies merely that the bequeathable sum is related to current wealth Pre¬ 
sumably, in the absence of opportunities for entering into life insurance contracts, only a 
fraction of IV is bequeathable The availability of insurance on the other hand would 
seem to provide the possibility of augmenting bequeathable wealth at the expense of 
current wealth to such an extent that the former exceed the latter Insurance u ignored 
in the current section but is discussed below in the Appendix. 

* Both Drtse (1962) and Fromm (1968) employ^ the expected uhlity-maximiwiig 
assumption in considenng the question of wfety improvements However, in contrast to 
the purpose of the present paper, both authixs were pnmanly conoetned with the 
optimid private expenditure on safety improvement given a relatranthip between 
expenditure and risk reduction. 



838 

and 


JOURNAL OF POUTiCAL KOONOKY 


^ 0. (5) 

Furthermore, it would seem reasonable to suppose that for any given level 
of wealth, W, the individual prefers state of the world “life” to state of the 
world “death,” so that for all iV, 

HW) > D{W) (6) 

Finally, we introduce a rather strong assumption which, although it 
IS not crucial for the analysis which follows, docs permit the unambiguous 
interpretadon of certain results It will be assumed that the individual 
displays a more pronounced sensidvity to variadons in either wealth or 
financial nsk if he expects to live rather than die during the current 
period. (The most extreme example of this kind of relative sensitivity is 
the bachelor with no dependent rcladvcs who is also unconcerned for the 
welfare of others This type of individual will be indifferent concerning 
the amount he can bequeath and will therefore be indifferent concerning 
the level of his wealth and its associated financial risk if he expects to die 
during the current period.) This assumpdon is taken to have the following 
formal implicadons for L{IV) and D{W) 

dL{W) dD{W) 

dW ^ dW 

and 

d^L{W) d^D{W) 

dW^ ^ dW^ 

In fact, the only arcumstance under which this would not seem to be a 
plausible assumption is when only a very small part of current wealth is 
bequeathable. The standard of living of the individual’s dependents would 
then be considerably reduced by his death, so that it might be that the 
individual would be more sensitive to variations in current wealth or 
financial nsk if he expects to die dunng the current penod. While this 
latter case is taken to be the exception rather than the rule, it must be 
admitted that the prevalence of life insurance contracts tends to cast some 
doubt <m the matter. The question of insurance and its possible implica- 
tions for the forms of L{W) and D{W) is therefOTC considered in the 
Appendix. 


(7) 

( 8 ) 


d^D{W) 

dW^ 


2. The Analyais 

Suppose that imtially the individual has wealth (> 0) and faces a 
probability / (0 < / < 1} of death during the current period. His 
eiqjected utility is then given by 

£(!/) - (1 - f)L{y^) + 


(9) 



oOHHinaaATioNS 839 

where L(f^) and D(f^) denote, respectively, L(IV) and D(fV) evaluated 
at 

Now suppose that the individual is offered the q}portunity to reduce 
the probability of his death during the current period from / to p( < /). 
Given the restrictions on L(H') and implied by (2), (4), and (6), 

the individual will be prepared to forfeit a positive amount, F, in order to 
effect such a probability reduction. By definition, the maximum amount 
he will forfeit must be such as to leave him with the same level of expected 
utility as in the initial situation The quantity F will therefore be given 
by’ 

(1 - />)L(PP' - F) + fiD(fP - F) » (1 - + §D{W) 

( 10 ) 


Similarly, if the mdividual is to be induced to accept an increase m 
probability, then the minimum sum he will accept as compensation will 
mcrease his wealth to a level such that the augmented probability and 
wealth also satisiy equation (10) *** 

Clearly, V as defined by equation (10) is the Hicksian compensating 
vanation in wealth for a change in probability from p to p. It is therdbre 
a matter of some interest to establish the form of the functional relation¬ 
ship between Fand p imphed by (10). Since this will mvolvc examination 
of a number of complex expressions, it will be convement to introduce 
some notational simphfication In what follows we shaU therefore employ 
the following defimtions. 

L = L{W - F), 

D = D{W - F), 


- 

V) 

sv 

9 

dD{W - 

n 

dV 

' 9 

d^L{W - 

y) 

dV^ 

t 

d^D{W - 

■ V) 



* IVovided that bankruptcy unpotes no limitatico on borrowing, it u pmble far V 
to exceed W However, if bankruptcy does mipw a borrowing limitation, then it it 
necewary to rertnet F to be lew than or equal to IF. Strictly qieaking, (10) diould then be 
imtten as a weak inequality with a complementary ilacknm oondition far the restrictian 
on K. For the sake of timpliaty, the powihility ofbiuiknqitcy will be ignmed in mmt of 
what fellows, thoii^ it i^I be necewary to give it explicit c o r ui drr ation vdicn dealing 
with the value of oomplete of the risk of death. 

The quantity F will theicfere be negative fcr increases in prohsdnbty. 



140 


JOtnUKAL OF POLmCAL ECONOKY 



Fio ] 


Finally, the functional relationship between F and p will be denoted by 
V{p) Since p denotes a probability, it is clear that V[p) is defined only 
on the closed interval 0 ^ 1 

Differentiating through (10) with respect to p and rearranging, we 
obtain 

gr, .. (H) 

dp (1 - p)L' -^pD' 

It then follows that 

[(1 - p)L' + pD']{L - D')[dVli)p) 


¥ 


{L - D){[(1 - p)^ + pD"][dVm + jP' - 


[(i - p)L' + pD']^ 


But, from (2), (4), and (7), 

L' < D' <, 0, 

while from (3), (5), and (8), 

L” < D" <,0 

Hence, from (6), (11), and (13), 

dVjdp < 0, 

and from (6), (12), (13), (14), and (15), 

d^v 


dp^ 


< 0 . 


( 12 ) 

(13) 

(14) 

(15) 

(16) 


The graph of the relatioiuhip between V and p will therefore be of the 
general form depicted in figure 1. Note first that V u positive for values of 


cKnoonaoA'noNs 


&fi 

p len than p, lince the individual will be prepared to forfeit wealth to 
effect improvements in safety Cbnversely, V is negative for values of p 
greater than p, since the individual will require compensation for deterio¬ 
ration m his safety. It should perhaps be emphasized that allocative 
decisions will frequendy mvolve consideration of adverse as well as 
beneficial effects on the safety of life, so that the relationship between V 
and p for values of p exceeding / is of more than acadenuc interest 
Having established the general form of the relationship between V and 
let us now consider three aspects of this relationship in greater detail 
(fl) the behavior of V{p) as p approaches zero, {b) the behavior ofV{p) as 
p approaches umty, and (c) the denvauve dVjdp evaluated tAp — p 
Clearly, {a) and (6) will involve individual attitudes toward extremes 
of safety and danger and will therefore be relevant only for those mvest- 
ment programs which have very pronounced effects on the safety of hfe 
However, —[SV)jdp evaluated at /i = j6 is the marginal value of a 
decrease “ in nsk from the imtial risk level and is therefore directly 
relevant for all investment programs which have the effect of making hfe 
marginally safer (or riskier) * ^ Since investments having relauvely small 
effects on the safety of hfe are probably most common, it is clear that 
dVjdp evaluated &xp - / is of primary importance for policy purposes 

a) The behavior of V{p) as p approaches zero 
Supjjose that L{W) is such that 

/.->[(! -^)£ + /;/)]> 0 , ( 17 ) 

where L~'{U) is the inverse function'^ of Z.(M^), L denotes L{}V) 
evaluated at = fP, and D denotes D{ iV) evaluated at IV = The 
implication of (17) is that a jjositivc value of W makes L{IV) equal to 
initial expected utility 


'' Note that d y/dp u the marsinal value of an menase in risk and u, therefore, negaUve 
By contrast, — {dV)ldp is the marginal value of a decrease in nsk and u consequently 
positive 

‘ ^ For a society of n individuals, the aggregate Hickium compensating variation in 
wealth for changes in direct risk (in Mishan’s sense, see n 2) due to some public inveii- 
ment program would then be approximated by 



where Spi is the (small) change in direct nsk for the ith individual 

'* It IS assum^ throughout that this foncuon is defined fix all values of U that we 
consider-, i e., it is assumed that if L{W) a bounded below, then the greateM lower 
bound, y, u such that U > y (or all values of V considered, while if HW) u bounded 
above, the least upper bound, a, is assumed to be such that {/ < a for all values of 
V considered 



843 joiniNAi. or poLmoAL saotmcy 

In this case, setting » Oin (10) yields a value of F — sayFoSuchdiat 

Fo < W. (18) 

If (17) holds, then, the individual will not be prepared to bankrupt 
himself to make life completely safe during the current period,and the 
graph of V(p) will have an intercept Fo( < IF) on the F^axis. 

If, by contrast, the mequahty in (17) is reversed, then the mdividual 
will be prepared to bankrupt himself in ordn' to reduce the probability 
of death in the current period from / to some value f such that 

0<p<f (19) 

In order to determine the behavior of V (p) for values of p between ^ 
and zero, it is necessary to establish whether the individual is able to 
borrow when bankrupt (i.e , when W 0) If he is unable to do so, then, 
as noted earher,‘^ F will be subject to the constraint 

F s: iF (20) 

The fimction V(p) will then be such that 

0 p ^ F(^) = IF (21) 

The graph of F (p) will therefore become horizontal for values of p ^ p 
Note that since (15) and (16) apply only when (10) holds as a stnet 
equahty, which will not be the case for values of p p, there is no 
inconsistency among (15), (16), and (21) 

If bankruptcy imposes no constraint on borrowing, then F will con¬ 
tinue to increase as / is reduced below p. The graph of V(p) will then 
have an intercept F^ on the F-axis such that 

F; > IF (22) 

Finally, suppose that 

£-*[(1 - P)L + pD] =0 (23) 

In this case the individual will not bankrupt himself for anything other 
than the complete eliminaUon of the nsk of death during the current 
period. The graph of V{p) will therefore have an intercept F* on the 
F-axis such that 

F; » IF. (24) 

b) The behamor of V{p) as p approaches anty 

It can be shown that if Z.(M^) is not bounded above, then it is possible to 
construct a vanant of the Saint Petersburg game such that the individual 
will be prepared to pay an indefinitely large sum to participiate in the 

>4 Thu it not intended to imply diat it u in fiwt phyiically pouible to make Hfe com¬ 
pletely safe during any period 
See n. 9 above. 



OOMBRnSCATlOMI 


843 

game.** Since it ii an observable fact that the nuyority of people will not 
pay indefinitely large stakes to participate in such games, it would seem 
niost reasonable to regard L{W) as bounded above If this is so, then by 
(6), D(W) will also be bounded above. The least upper bound of L{W) 
will be denoted by a and the least upper bound of D{IV) by 
Suppose that 

/»>(!-/)£ + pD. (25) 

It follows that there exists a fimte value of W which will make D{W) 
equal to the initial level of expected utihty There must therefore exist a 
finite value of V which will satisfy equation (10) when ^ s 1, so that there 
exists a finite sum sufficient to compensate the individual for the certainty 
of death during the current period. In this case, the graph of V{p) will 
have a fimte negative mtercept on the Ime p >= 1 
Conversely, suppose that 

P ^ (1 - ^)I + (26) 

In this case, no finite sum will be sufficient to compensate the individual 
for the certainty of death. Furthermore, it follows from (2) and (10) that 

lim F = — 00 , 

where 

g - (1 - - pP 

Note that if (26) holds as a strong inequality, then p* will be stricdy less 
than umty, so that there will exist a range of probabihties of death 
(namely, p* ^ p ^ 1) for which it will be impossible to compensate the 
mdividual ** The graph of V{p) will then be asymptotic to the Ime 
p-p*. 

Figure 2 summarizes the various results that have been established 
concernmg the behavior of V (p) for extreme values of p. 


(27) 

(28) 


c) The dmoatwe dV/dp evaluaUd atp*‘p 

When p ^ p. It follows from (10) that F = 0. The derivative dVjdp 
evaluated alp p, which will be denoted by {dVldp)f, may therefore be 
obtained simply by setting p ™ P and F k 0 m (11), Uu^t is. 


'aF\ L-D 

.dp)," (1 -f)L'+pD'’ 


(29) 


** See, e.f., Arrow (1965) 

''’Theoemimuty of L(W) and D(W), together with the iwtnctiaM emho d ied ui (6) 
and (7), implia diat t > 0. 

'* Note that F(/) is then defined only in die interval 0 S P < P* 



844 JOGIWAL or poutiou. boonomy 



tl \ 


'I ) 


Flo 2 —(1), no bankruptcy at p === 0, (2), bankruptcy constraint for 0 £ p s 
(3), no bankruptcy constraint, (4), possibility ofcompensating for death, (5), impossibility 
of compensating for p* £ p ^ 1 


where L' denotes dL{W - V)ldV evaluated at P' = 0 and D' denotes 
dD{W ~ F)/5devaluated at d = 0 
Now let us consider the manner in which the marginal value of a 
decrease in risk vanes with initial nsk, p, and initial wealth, Ditferen- 
tiatmg through (29) partially with respect to p gives 

djdVIdp), (D - L){D' - L') 
dP ’ [{i-p)L’ + pDr’ ^ 

while differetitiaung through (29) parually wtth respect to ^ gives 

[(1 - P)L> + pD']{D' - L‘) 

+ (I - />)[(l - /)£■ -b pP-^ 
[{I - p)L' + pD’f 


djBVIdpl 

df^ 


( 31 ) 




QomtUNicATican 


845 


where Z" denotes d^L(t^ — V)ldV^ evaluated at F = 0 and 7)" 
denotes d^D{W — V)jdV^ evaluated at F = 0 Thus, from (6), (13), 
and (30), 


Sidvmp < 0 
dp 


(32) 


while from (6), (13), (14), and (31), 

sjdvmp ^ p 

dW 


(33) 


It will be recalled that the marginal value of a decrease m risk is given 
by —{dV)/dp Thus, (32) and (33) indicate that the marginal value of a 
decrease m nsk increases with both imUal nsk and initial wealth 


3. Indirect Risks, Nonfatal Accidents, and Multiperiod Changes 
in Risk 

This section outlines the manner in which the analysis might be modified 
to accommodate indirect nsks (i e , outcomes of death or injury to 
others), nonfatal accidents, and changes in nsk effective over extended 
periods of time 

At the most general level, indirect nsks and nonfatal accidents can be 
dealt with simply by extending the number of states of the world from 
“hfe” and “death” for the individual himself to the vanous mutually 
exclusive combinations of life, death, and mjury for the individual and 
those he cares about The funcuons Z(IF) and D(fV) of the earlier analy¬ 
sis are then supplemented by as many funcuons as there arc additional 
mutually exclusive states of the world Clearly, the results of an analysis 
extended to cover indirect risks and nonfatal accidents will depend 
crucially on the relative magnitude of the vanous uulity indices and on 
the signs and relative magmtudes of their first and second denvatives with 
respect to imtial wealth It is because of the extensive restnctions required 
to generate determinate results in such cases that a purely a pnon analysis 
is unlikely to prove parucularly fruitful 

However, the most significant single deficiency of the earlier analysis 
IS that the changes in nsk were taken to be effecuve only dunng the 
current penod In order to expand the framework of the analysis to cope 
with changes in risk effecuve over extended time honzons, it ’vould seem 
most saUsfactory to deal with continuous rather than discrete time This 
may be done within the context of the Luce-Krantz cxpcctcd-uuhty 
analysis by considenng, as states of the world, “death at Ume I,” where 1 
takes on any value in some interval on the real line. One of the advantages 
of this procedure is that the potenual temporal ambigiuty associated with 
the outcome of death during, say, the tth period is completely ehminated, 
since it becomes possible to talk of death at a particular pomt of tunc 



S46 JOCKKAL or POUnCAL aOOMOMY 

Suppose dsat the utility index assodated with initud wealth W and death 
at time i is denoted by {/(H'', f), the latter being a continuous and twice 
diflerentiable function of both arguments. If the (continuous) probability 
density function for the time of death is then written as p(t), expected 
utility for initial wealth and probability density function /(() is given 
by ^ 

£((/) = £ U((^, t)f(l) dt, (34) 

where time t is the present and time T is the latest date at which the 
individual conceives that he will still be alive The maximum amount 
which the individual would forfeit (or the minimum sum he would accept 
as compensation) for swapping the probabihty density function /(<) for 
some other density function ^(t) would then be given by V, where V is 
such that*’ 

U{W-V, t)p{l) dt = U{ft', t)P{t) dt (35) 

Given appropriate restrictions on the form*® of U{W, i), it should then 
be possible to develop qualitative theorems concerning the value of 
different types of change in the probability density funcuon, though it is 
not at all obvious which particular types of change should be considered 
in an a pnon analysis of the “general case ” 


4. Concluding Comments 

In the preceding sections we have succeeded in developing a set of 
interesting results concerning the value, in the form of Hicksian com¬ 
pensating vanattons in wealth, of current period changes m the nsk of 
death. An attempt has also been made to outline the manner m which the 
analysis might be extended to accommodate (a) nonfatal accidents, 
(i) the nsk of death and nonfatal injury to individuals other than the 


'* Note that eq. (10) above may be regarded as tbe specul case of (35) obtained when 
{/(H', <) IS the diKonUnuous fiinctioD 

U(W,t) ^ D{W){x m I,), 

U{W,f) • < t i T), 


where I, u the tune at vduch tbe current period ends 

”E.g., 

at/ ^ „ d*V . 

Bw ‘ an" ’ 

and 


8t 


> 0 , 


at' 


< 0. 


However, the qipiopriate aMumpdon conceming the sign of B*UldW8t does mi seer 
at all clear 1 priori. 



ooHMumoAnoNS 


847 

dedrion maker under connderatkm, and (r) changes m risk effective 
over an extended rime horizon. However, further analysis of these three 
cases ivould require stronger specification of the utility functions (and 
the probability density function for time of death in case [c]) than seemed 
appropriate in the current context. 

Of course, qualitative information is of only limited usefulness, and it is 
natural to wonder whether the analysis developed above might provide an 
effective basis for the derivation of quantitative information concerning 
the private values of changes in risk In fact, it seems that if it were possible 
to obtain some form of estimate of the utility functions L(IV) and D(IV), 
then derivation of the actual values of various single-period nsk changes— 
and in particuiar the maigmal value of changes in nsk—would be a 
relatively straightforward matter Some vanant of the cxpcnmcntal 
procedures developed by Mosteller and Nogee (1951) or Davidson, 
Suppes, and Siegel (1957) might well provide the basis for a procedure 
capable of generating estimates of the utihty functions 


Appendix 

Life Insurance ConOmeta^^ 

Suppose that the individual enters into a contract with an insurance company 
whereby he pays a premium, It, in order that the beneficiary nominated m the 
contract should receive a sum, I (dependent on It), contii^ent on the individual’s 
death Clearly, thu u a highly simplified version of the typical life insurance 
contract, but it does capture those features important for the present discussion. 

The individual’s optimal expenditure on insurance will be such as to maximixr 
expected utihty, E(U), given by 

E{U) - (1 - f)UW - 1*) + fD{W +/-«), (36) 

where 

/ = /(«) (37) 

and 

R^lO. (38) 

For sunpliaty, the net insurance payoff will be denoted by N, wo that 

N I - R. (39) 

If the individual insures in some nonzero amount and f {R) is contmuous and 
differentiable, then the optimal value of R will satiriy the necessary condition for 

For an example of a very imperfect a ttemp t to obtain empirical infacmatioa con¬ 
cerning the value of ixobalNlity changes, see Joocs-Ixe (1969) 

“ My thoughts on thfa subject were greatly clarified by adisosuioo with JaegunPreae. 
** It is essential to appreciate that rxpmditwre « the kind of life inananoe under 
cxmsideration is not inteuM to affect the safety of life in any way whatever. There may 
of coune exist a **oiand hazard” in the sense ti^ die individual, once insured, will lake 
a more teeUm attitude toward his own mfety. However, this pcnsibdity will be ignored 
m the analyais which fellows. 



848 JOURNAL OF POLJTIOAL BOONMIV 

a turning-point maximum of E{U) given by 


^ SR ^ ^ SN dR 


(40) 


It will be assumed that the insurance payoff, /, is proportional to the premium, 
R, so that we may write 


N= kR{k> 0). 


(41) 


Now insurance is typically categorized in the following manner 

1 -? 


fair insurance k = 

less than fair insurance k < 

more than fair insurance k > 


? • 

! - 


? ’ 

Lzi 

f 


(42) 

(43) 

(44) 


Thus, from (40), (41), (42), and (43) it follows that if the individual purchases 
fair or less than fair insurance, then 


SD{W + N) { 8L{W - R) 


BS 


[ SR 


(45) 


But SD{W + S)ISN IS simply dD{W)l8W evaluated at = W' + AT, 
while - [8l.{W - R)I8K\ a equal to 8L(W)I8W evaluated nW=W - R 
Inequality (45) is therefore inconsistent with the restrictions on L(W) and D[W) 
given ui (7) and (6) Consequently, in the case of an individual who purchases 
either fair or less than fair insurance of the type described, it would seem necessary 
to discard our earlier assumption concerning relative sensitivity to variations in 
wealth contingent on the life and death outcomes. The crucial question is therefore 
whether or not the observable prevalence of the “life insurance” type of contract 
(this being a generic term describing a wide variety of different contractual 
arrangements between insurers and insured) does in fact reveal a widespread 
willingness to undertake fair or less than fair insurance of the specific type des¬ 
cribed above There are at least two reasons for doubting that this is the case In 
the first place, insurance premiums are partially tax deductible in the United 
Kmgdom and the United States Thus, even though insurance companies will 
typically offer less than fair insurance (in order to cover administrative costs and 
earn a return on capital), it may nonetheless be the case that after tax such 
insurance will be more than fair for inchviduals whose estates arc sufficiently 
modest to escape death duties Moreover, the typical life insurance contract is for 
"endowment” insurance in which the face value of the policy accrues to the 
individual or his estate at the earlier date of the term of the policy or the death 
of the individual This type of insurance contract therefore provides “pure” life 
insurance jointly with an extended savings commitment. 

For these reasons, there would seem to be very little justification for concluding 
that the popuianty of life insurance implies a widespread wiHingness to enter 
into fair or less than fair versioas of the type of insurance contract desenbed by 
equatumi (37), (38), (39), and (41). 


** See, for example, Fnedmon and Savage (1948) 



aOMMtmiCATION* 


849 


Reference* 

Arrow, K. J, A^ueis of the Theoiy ^ Risk-Bearmg. Hcbinki: YijoJahmnoinn Saatio, 

1965. 

Davidson, D.; Suppes, P ; and Siegel, S. Decuum-Makmg An Experwuntal 
Approach, Stanford, Calif.* Stanford Univ, Press, 1957. 

Dr^e, J. “L’utiht^ sociale d’une vie humame.” Reo.froiumst rtchtrdu operationelU 6, 
no. 23 (1962). 93-118 

Friedman, M, and Savage, L J “The Utility Analysis of Choices Involving 
Risk.” 7P£ 56 (August 1948)* 279-304. 

Fromm, G “Comment on Schelling.” In Problems in Public Expenditure Analysis, 
edited by S B Chase Washington Broolungs Inst, 1968 

Hicks, J. R “The Four Consumers’Surpluses ”/Jm Econ Studies \ \ (1943) 31-41 

Hirshleifer, J “Investment Decisions under Uncertamty-Cboice-Theoretic 
Approaches " QJ E 79 (November 1965)* 509-36 

-. “Investment Decision under Uncertamty Applications of the State 

Preference Approach " Q J E. 80 (May 1966) 252-77 

Jones-Lee, M W, “Valuation of Reduction m Probability of Death by Road 
Accident ” J Transport Eton and Polity 3 (January 1969) • 37-47 

Luce, R D , and Krantz, D H “Conditional Expected Utihty ” Econometnea 39 
(March 1971) 253-71 

Mishan, E J “Evaluation of Life and Limb A Theoretical Approach ” J.P.E. 79 
(July/August 1971) 687-705 

Mosteller, F, and Nogee, P “An Experimental Measurement of Utihty ” 
59 (October 1951) 371-404 

Savage, L J The Foundations of Statistics New York WUey, 1954. 

Schelling, T C “The Life You Save May Be Your Own ” In Problems m Publu 
Expenditure Analysu, edited by S. B Chase. Washmgton Brookings Inst., 1968. 




A Reexamination of the Kinky Oligopoly 
Demand Curve 

Walter J Primeaux, Jr., and Mark R Bomball 
Vnmmly Mxuus^ 


Over 30 years have elapsed since the kinked demand curve was introduced 
as a theoretical hypothesis to explain oligopoly pricing behavior * There 
have been no empirical studies confirming the pndng behavior predicted 
by the hypothesis, and two separate studies by G Stigler and J Simon 
have rejected the theory, ^ yet the kinky demand curve persists as the theory 
of oligopoly for many economists. Obviously there must be some expla¬ 
nation for the tenacity with which economists cling to the kinky demand 
curve At least three possible explanations exist for the survival of the 
theory up to this time: (1) perhaps the profession has elected to ignore 
the previous empirical evidence rejecting the theory; (2) perhaps there 
are problems in the previous studies which make their results something 
less than conclusive, and (3) the profession (because of the second possi- 
bihty) requires additional evidence prior to passing final judgment on the 
theory Apparendy the explanadon is that there is a need to generate 
more empirical evidence and tests of the kinky demand curve theory; 
that IS the mam purpose of this paper 

Broadly, the results of this study fiul to support the kinked demand 
curve hypothesis The results of this analysis, therefore, are generally con¬ 
sistent with those of both Stigler and Simon m their earlier studies. 

Tbe sathon are indebted to George J Stigler, Almann F1iilli|>, Julian L Sunon, 
Robert J Lamer, Coldwell Oaniel III, Robert H Raicbe, and an anoDymooi referee 
for their helpfiil commena and suggesuons. Gbaries G. Moacanllo of the Federal Power 
CommMiion was very helpful m suppiyiiig data and infbiinatioo. A portioo of the 
reseaich for thi« p^ier was conducted while Pnmeauz was a viutor at the Econonun 
Department of the Umvemty of Pennsylvania prepanng another paper for the Br no l nn gi 
Imututum. The fint paper, “A Re-Eiouninaticni of the Monopoly Market Structure for 
Electric Utihties,” was pr erented at a Brookuigs confe r ence on AntttruM and Regulated 
Industries, October 28, 1971; publication is forthcoming Throughout this paper, that 
prt^t is referred to as the original study. 

' Tbe theory u generally crated to Sweeay (1939) Snmlar pricing behavior a also 
discussed by Hall and Hit^ (1939) 

* Stigler 1947 and 1968, Simon 1969. The Stigler pa g es cited here refer to the 1968 
version. 


851 



85a 

Previous Studies 


JOURNAL OF POUnOAL ECONOMY 


Stigler’s landmark study examined price rigidity in industries which were 
monopobes and compared the results with price rigidity in obgopoUstic 
indtistnes Stigler in testing the theory argued that “the kinky demand 
curve would prove to be an incorrect or unimportant construction if 
oligopoly prices were as flexible as monopoly and/or competitive prices 
Although Stigler's investigation rejected the kinky demand curve hy¬ 
pothesis, there were several problems associated with his study which 
could have affected its results First, there were several data problems 
generally acknowledged by Stigler in the arUcle.'* Second, Stigler’s anal¬ 
ysis did not mvolve very many monopoly producers for comparison with 
oligopolists, only two monopolies are included in his study * Third, the 
firms examined by Stigler produced products which are heterogeneous 
between the monopoly and oligopoly market structures (for example, 
automobiles, steel, nickel, and aluminum, among others), to the extent 
that price elasticity of demand for a product affects pricing behavior, and 
to the extent that this elasticity vanes among products, some of Stigler’s 
counted pnee changes among and between industnes could be the results 
of product differences, father than differences in market structure 
Simon’s study dealt with magazines and was also concerned with exam- 
imng the number of pnee changes among firms to determine if there was, 
in fact, more price rigidity in oligopoly than in monopoly Simon’s study 
also rejected the bnked demand curve theory, but his analysis was sim¬ 
ilarly based on data which were characterized by certain weaknesses 
First, as indicated by Simon, the status of any magazine as an oligopolist 
IS doubtful, because the audiences of various magazines differ from one 
another so they are not perfect substitutes,* since various magazines con¬ 
stituted his universe, perhaps the outcome of the study was affected by 
this problem Second, Simon was deahng with published advertising rates, 
there could have been haggling with major purchasers of advertising over 
the actual transaction pnee ’ 


Methodology 

The procedures used in this study arc very similar to those used by Stigler 
and Simon The essential difference exists in the universe and sample 
used which overcomes virtually all of the data problems of the previous 


* Stigler, p. 221. 

* Ibid., pp 222-23 At the tune hu study was made, given his industries, these 
diflSculties o^d not be avoided. 

»Ibid., p 224 

* Simon 1969, p, 972. 

'' Simon, however, indicates that he did not consider this a serious problem (ibid , 
p.971). 



C)(»lllt»aQA.TIONB 


853 

Studies. The focus of attention is upon the relative number of price changes 
of a subset of firms in monopoly markets compared with another subset 
of firms which are oligopolists. Stigler, as quoted earber, asserted that if 
oligo^ly prices were as flexible as monopoly and/or competitive prices, 
the kinky demand curve would prove to be an incorrect or unimportant 
construcUon * 

Although frequently overlooked by economists, there are clectnc utibty 
duopolies in the Umted States.* Jn the cities of interest, a mumcipally 
owned electric firm was m direct competition with a privately owned firm, 
and there was dupheadon of electric facihties The customer m these 
cidcs had a choice of bemg served by one firm or the other. In the Texas 
and Missouri ades included, for example, a customer could switch from 
one firm to the other at will In Portland, Oregon, on the other hand, 
new customers could take service from cither company, however, cus¬ 
tomers could not switch from one electric firm to the other in that aty 
This contrast is made so that the competiUve conditions in these atics 
will not be confused with those exisdng in cities where temtones are 
allocated and dupheadon of faahdes does not exist Cidcs of this type are 
of no interest to this study 

The methodology involved an examinadon of price changes in dme- 
senes data for each firm operatmg in ades with two electric udhty firms. 
Such an examination would reveal oligopoly priang behavior For pur¬ 
poses of companson with the pnemg behavior of the duopoly firms, 
another subset of firms was selected which was composed entirely of mo¬ 
nopoly udhty firms The monopKily subset consisted endrely of munic¬ 
ipally owned firms which were virtually free of pnee regulation In the 
duopoly subset, a mumcipally owned firm completed with a privately 
owned firm, in these cases, the privately owned firms were subjected to 
some sort of pnee reguladon at either the state or local level The mumc- 
ipaliy owned firms generally were free from pnee regulauon The results 
of this study, as will be explained later, were not affected by these 
consideradons 

One may be inchncd to quesdon the companson of mumapally owned 
and privately owned electnc uuhdes because of piossible differences m 
profit incentives between firm typies This tendency may raise two pomts 
concemmg the procedure outlmed above First, it may be asked whether 
the compansons are vahd, that is, it may be asked whether mumcipial 
firms behave hke pnvate firms since they sue “nonprofit” insututions and 
thus not profit onented as pnvately owned utilities Second (because of 
the first question), the question may be raised as to whether these non¬ 
profit monopoly firms arc a valid control with which to compare the 

• Sugler, p. 221. 

* Prisneun: 1971. 



jovsMAL OF fHKjiwAi. moonoux 


854 

behavior of the duopolies in the study. Although the common imprestion 
is that municipally ovmed firms are nonprofit, publk-serviag firms, their 
profit performance indicates that they are ind^ profit oriented. For 
example, after discussing the level of profit for privately owned electric 
firms from 1950 to 1961, W. G. Shepherd explained that "despite the 
common impression, the average rates of return for pubhcly owned elec¬ 
tricity suppliers (other than Federal projects) have been almost identical 
with these private profit rates.”’® Moreover, data from the Federal 
Power Commission also indicate that mumcipally owned electric utility 
firms seek profits. These data show that in 1968 net income amounted to 
15.4 percent of sales for privately owned electric utilities and 22.3 percent 
for municipal electric utihties. ” P Mann found that municipally owned 
firms in his study earned a 7 14 percent rate of return on net investment 
while privately owned firms earned 6.74 jjerccnt.’* M. Colberg has also 
discussed the profit orientation of municipally owned utility firms. ’ ^ Thus 
it is obvious that municipally owned clectnc firms are not nonprofit 
institutions, and they are indeed motivated to seek profits. The fact that 
the average profit rates of mumcipally owned utilities are comparable with 
and sometimes exceed those of pnvately owned utilities seems to satisfac¬ 
torily answer the two questions posed above, apparently municipally 
owned electric utilities’ profit incentives are more like those of pnvately 
owned clectnc firms than is commonly supposed. 

As indicated above, the procedure used in this study eliminates virtually 
all of the data weaknesses of the previous studies. Oligopoly pncing 
behavior is observed where the producu arc homogeneous, for which 
there are no good substitutes for most uses; therefore, the influence of the 
differences in type of product on the number of price vanations is elim¬ 
inated Moreover, the study examined an industry where rate schedules 
are not frequently vaned, where deviations from printed rate schedules 
constitute illegal behavior for public utilities; there is no haggling con¬ 
cerning published pneesas there could have been in previous studies 
Also, since prices arc not generally varied more than once during a given 
year, the possibility of distorted data is reduced The data do not include 
years of depressed economic conditions which could abnormally reflect 
on firm pricing behavior. Another advantage of this approach is that there 
is a larger sample of monopolists than in Stigler’s study. Furthermore, the 
duopolists in this study are clean-cut oligopolists, and there is no doubt 
concerning this matter as in Simon’s study. Because of the advantages 
discussed above, after eliminating problems associated with previous 
studies, it should be possible to draw important conclusions concerning 

Thit refm to a return on capital (see Shepherd 1966, pp. 21-22). 

* ‘ Informatiat obtained innn the Federal Power Conumaiioa, January 8, 1970. 

“Mann 1970. 

** Ooiberg 1955. TUs reference oqilaint that electric profits were an inqwrtant 
source of revenue fat Florida cities and was a substitute for taxation. 



aawroMic&iifnom 855 

(digopoly pricing 1>riiavior by comparing the pricing variations of the 
duppc^ subset with the monopoly subset. 

The number of electric dutches in this country is hmited, therefixv, 
it was necessary to include as many cities with that characteristic as 
possible. Even so, certain conditions discussed below made it necessary 
to exclude some electric duopolies from the study The monopmly subset 
was essentially that group included in the orig^inal study. Each firm in this 
subset was selected for the original study because of specific character¬ 
istics; the selection was on an entirely nonrandom basu. The major 
criteria for their selection were as follows. First, to the extent possible, the 
monopoly firm should be from the same state as the competitive firms 
Second, the monopoly firm should be approximately the same size as the 
competitive firms; however, if no such firm existed in the relevant state a 
larger firm would be acceptable, and in no case would a monopoly firm 
be smaller than the competitive firms Whenever necessary to select a 
monopoly firm from another state, the substitutes would be chosen so 
that they would be situated in states geographically near to the state of 
the competing firm. Third, to the extent possible, types of power sources 
would be identical for the “matched” compctiuve and the monopoly 
firms. 

The original study consisted of 23 duopolies and 24 monopolies The 
same sample was the original target of this study However, since several 
of the duopoly atics did not have competition during the entire 11 years 
mcluded in this study, and several of the monopoly firms experienced 
compeution at some time dunng this period, the decision was made to 
eliminate all firms for which the market structure had changed anytime 
during the years included. As a result of these elinunauons, the final 
sample for this study consists of 17 cities with competiuon (since these are 
duopolies, this subset consists of 34 firms) and 22 municipally owned 
monopohes, which constitute the subset of matched firms. The firms finally 
mcluded, and the cldcs m which they arc located, arc presented in the 
Appendix below. 


Nature of the Tests 

Two different methods were employed to test the kinked demand curve 
hypothesis. First, the number of price changes for the years included were 
computed for both the competidve and monopoly subseu; since the kinky 
demand curve predicts fewer changes m the duopoly case than in the 
monopoly case, the theory would not be upheld if the duopoly subset m 
this study displayed more price changes than the monopoly subset. 

‘*Soiiie films were eliminated fiom this subwt for various reasons which will be 
diiCT Msed bdow. In the earlier study, this subset was selected on the basM of inqiortant 
cbaiacteristiGS which are rdauvely unimportant to this study. 



JOURNAL OF POLITICAL BOOMCWY 


856 

A second test consisted of counting the number of times price increases 
or decreases by one firm were followed by a competitor. TTie theory pre¬ 
dicts that pnce decreases would be followed but pnce increases would 
not be followed 


Number of Pnce Changes 

The priang information used was extracted from the FPC’s Typical 
Electric Bills Each category or classification of price per kilowatt hour was 
treated as a separate and distinct product, and the number of price 
changes were calculated for each year In other words, 25 kwh of con¬ 
sumption was considered one product, 40 kwh another product, etc 
The various categories or classifications used are presented in table 1 
The procedure m this analysis follows very closely that employed by 
Simon ** First, the number of pnce changes in each product category 
were computed This procedure yields the data presented in columns 1 
and 3 of table 1 Then, the grand mean number of change years was com¬ 
puted by dividing the number of pnce changes in columns 1 and 3 by the 
number of firms m each type of market structure to arrive at an average 
which would make compansons possible; otherwise, the effective number 
of pnce changes would be affected by differences in the sizes of the two 
subsets of market structures 

The computations in table 1 reveal that there is significantly more price 
ngidity in the monopoly market structure than in the ohgopoly market 
structure. These results reveal that monopoly electnc utilities with no 
competition change prices less frequently than the firms in the duopoly 
market structure, these results are in contradiction with the kinky demand 
curve hypothesis and are consistent with those presented by both Stigler 
and Simon 


Pnce Increases and Decreases 

A fundamental proposiuon underlying the kinky demand curve is that 
rivals will follow price decreases but will not follow price increases This 
secuon examines this basic proposition of the theory. Using pnce informa¬ 
tion for the 1959-70 penod (excluding 1963 for reasons indicated in n, 15). 
pnce changes were observed in the duopioly markets, and each rival’s 

’ * Thu procedure is consistent with that used by the Federal Power Commission 
{Typteai EUctrU Bills, 1969, p 6) It was neceaary to eliminate 1963 from the study 
because the FPC used uniform categones between 1959 and 1962, changed in 1963, and 
changed again in 1964. Between 1964 and 1970 the classifications were uniform but 
difierent from those used in 1939-62 Since unique categories existed in 1963, no com- 
pansons were poasible, thus the elisunaoon. 

** Simon 1^, pp. 972-73. 



OOliHUHiGA'noin 


TABLE 1 

Pmowtation or SrArmoAt. Data: No or Pauoe Ohanoc* 


857 



1959-62 



1964-70 


kwh 

No 

of Pnee 
Changes 

(I) 

Grand Mean 

No of 

Change Yn 

kwh 

No 

of Price 
Changes 

(3) 

Grand Mean 
No of 
C3>an|[e Yn 




Monopoly 


25 

1 

0 05 

100 

14 

064 

40 

2 

009 

250 

19 

086 

too 

4 

0 18 

500 

22 

I 00 

250 

4 

0 18 

750 

23 

1 05 

500 

5 

0 23 

1,000 

24 

1 09 

Duopoly 

25 

9 

0 27 

100 

35 

1 03 

40 

9 

0 27 

250 

42 

1 24 

100 

9 

0 27 

500 

42 

1 24 

250 

- 10 

0 29 

750 

48 

1 41 

500 

11 

0 32 

1,000 

52 

1 53 


SovKCB <-~TBbul»tionB for tablet 1 and 2 made from data extracted from FPC*b Typiml EUttv Btih, 
vanout yean. 


reaction to those changes were noted These data are presented in 
table 2 * ’ 

The data show that there were 42 instances during this period where 
price increases by one firm were not followed by its rivtd, this outcome 

' ’’ Annual data are used instead of monthly data becatise the latter were unavailable 
A thorough search of the FPC’s Nahanal EUctru Rah Btak as well as personal contact with 
the commission revealed many complex problems when attempting to reconcile data 
among firms on effective dates of rate changes In some cases firnu changed their rate 
structure, discontinued discounts, changed the minimum bill charge, added fuel adjust* 
ment clauses, and changed water heater and space beater provision Since rate schedules 
reported in the Natumal EUctnc RaU Book were not standard among Erms, evaluations 
concerning the real effect of such changes on rates were extremely difficult to aacss 
accurately Therefore, it was decided that annual data from Typual EUctnc Btlb iwould be 
more accurate for a companion because they arc present^ in a standardized form 
through the use of a weighted average (see Typical EUctnc BtUs, 1968, p 140). Obviously 
there aro difficulties which arise when analyung timing with somedung so crude as 
annual data Perhaps the greatest difficulty occun from the foct that some firms may 
change pncea more than once in a yesu and this would be obmured m annual datsu 
Information from the FPG indicates that multiple pnee changa withm a single year 
were uncommon for the yean covered by the study. This iact may be verified by a review 
of vanous issues of the commisuon’s NaUmai EUctnc RaU Back, therefore, this difficulty 
does not emit in thu study Another timing difficulty e merg o i becaiae <£ the inability 
to pinpoint the aaact tune of a price change by ooc firm oompwed with another firm 
The proceduK of impkmentmg logs to evaluate whether a pnee change was followed 
was used to help overcome this problem; this procedure u diiniisrd later On boianoe. 
It IS unlikely friat the lenalts of the study were hissed one way or the other beca me annual 
data were used. 










858 


jooKNAL or pounOAi. woHowr 


TABLE 2 

Phbowtation or SrAnmoAi. Data: Fuob Inormm anb Dwweaw 


1. Frice tncreaaoiiiot followed (within 3 yn) , . . 42 

2. Flice decresKS not followed (within 3 yn) 42 

S. SunulUneoiu price change* 48 

Increaies . . 20 

Oppoute 10 

Debate* 18 


4 Delay in following price increaK* 

1 yr 

I Delay in following price decrease* 

1 yr . 

2 yr* 

6 Delay in changing in the opposite direction 

Municipal initiated a decrease 1-yr delay in increasing 
Private initiated a decrease 1-yr delay in increaiing 
Private initiated a decrease 3-yr delay in increasing 


18 

10 

5 

2 

4 

5 


18 

IS 

II 


Not* —A reowciliaucm of thu table and uble 1 u powible by Ibe followins p f o c ed m e Tbe Ant two linn 
of thk uble record independcot action, not followed Dy the competina Ann. Therefore, tbeie ebanae* total 
1084 Lim 3^canititiitereactKiosof Armi to pricecfatngamitlued by the other dtio^y Arm Therefor^ 
there were really Ifowr the number of once changet Indicated ui lines 3-^ In fumfitary, then, Itnct 1 and 2 
account for 84 price changer and liner 3~€ account for 184, total, 266 changea Tho dupooly roctwn ui uble t 
(coir I and 3) accounn for 267 pnee changes. Tlie diflbrence of one price change between the two ublu can 
be explained by the rituauon where one fbm changeil pneer in one period, with no reacnon Arom tbe com- 
peiing Ann, dunng the next period both Amu changed prices Accmding to theprocedure cstabiished for 
counting price changer, thu rerulted u a double counting of a change in uble 2 The rerultr arc nor affected 
by Ihw renal] dHTerence 


tended to support the kinked denumd curve theory ’ * Dunng the same 
time, however, there were 42 instances where pnee decreases were not 
followed, and this outcome is in direct opposition to the kinky demand 
curve theory These aggregate results show that pnee decreases were not 
followed any more frequently than price increases, and these data, there¬ 
fore, tend to refute the theory. 

There were 48 simultaneous price changes dunng the 1959-70 period, 
thu means that in 48 instances both firms changed pnees Within this 
group of pnee changes, in 20 instances simultaneous price increases 
occurred, in 18 instances hems simultaneously decreased prices, and in 
10 instances the direction ol pnee change of one firm was not matched by 
the competing firm In other words, in the latter case, there were simulta¬ 
neous changes but the direction of change of price for one firm was the 
opposite of that of competing firms. In the cases of both the simultaneous 
price mcreases arid the simultaneous price changes in “mixed” directions, 
the patterns cannot be eiqiiained by the kinky demand curve Such 
pneing behavior does not conform to the patterns expected firom the 
theory. 

The peunt might be raised that regulatory restrictions on some firms 


In g iituation wlme pnee leadenfaip exists, Use demaad curve would not kink 
The ogiginml study expbun* that sinoe a nnmicipal firm &ced a privately owned finn in 
all cities included in the sample, diat it is doub^l that any oolkition or antwompeuuve 
bdumor existed. A duee year time period was arbitrarily selected to allow Car any tag 
in pnoe changes due to regulatory firicUon*. 







OOUMUNIOATIONB 


859 

may preclude free movement of prices in this type of business; therefore, 
it may be argued that the previous comparison does not really reflect 
behavior which is inconsistent with theory because instant pnee adjust¬ 
ments are not possible for the privately owned firms A lag was imple¬ 
mented to take the time constraint into consideration to eliminate that 
problem. Nevertheless, when one examines table 2, analysis of the delays 
in following price changes further refutes the theory After a 1-ycar lag, 
18 price increases were followed This behavior, too, is contrary to the 
theory because the theory would not expect price increases to be followed 
at all Ten price decreases (fewer than the 18 price mcrcases) were fol¬ 
lowed after 1 year, while five were followed only after 2 years The signif¬ 
icant fact is that price increases were followed when the theory predicts 
that they would not be followed, pnee decreases were also followed after 
a time lag, however, only after a longer time and not even to the same 
degree 

These results reveal that price increases were more nearly simultaneous 
than were pnee decreases These results are consistent with those presented 
by Sogler and arc the opposite of what would be predicted by the kinky 
ohgopoly demand curve ' ® 

Although the point has not been previously discussed in detail, it is 
significant that the results fail to suppiort the assumption of pnee ngidity 
in oligopoly behavior in spite of a bias which works against this outcome 
This argument is based on the notion that pnvately owned firms arc, 
according to the conventional view, reluctant to ask regulators for pnee 
(rate) changes and open the way for rate hearings If this proposition is 
true, and the mumcipally owned firms arc virtually free to mampulatc 
pnees at will, one would cxpiect more pnee ngidity in the markets with 
regulated firms, even if only one firm is regulated Since the results show 
more pnee flexibility m the duopoly markets, the case against the kinked 
oligopoly demand curve emerges even stronger than if this bias did not 
exist 


Conclusions 

Like Stigler’s and Simon’s studies, the data from electnc utility firms do 
not support the kinky oligopoly demand curve hypothesis The data show 
that oligopolists changed prices more often than monopolists and there 
was a marked tendency for ohgopolists to follow a competitor’s price 
increase, both of these outcomes are inconsistent with the theory 

Empirical evidence concerning the kmky demand curve is still nonexist¬ 
ent, so perhaps the behavior observed in price wars, which do tend to 
support the assumptions of the theory, merely reflect temporary disequi- 


' * Stigler, p. 221 



86o 


JOVSNAt OF POUnGAL SOOMOHY 

libnutn conditions in iinperfectly competitive markets and may not apply 
to a general case of oligopc^y conduct. In any case, after 30 years of 
waiting for proof of the theory, it seems appropriate for ectmomists to 
concentrate on tests of this hypothesis and determine its appropriate niche 
in economic analysis 


Appendix 

Cmcs AND Fikmb on Which Test Was Conducted 


Cities with Cities without 

Competition Competition 


Bessemer, Ala 
Alabama Power Co 
Bessemer Electric Service 

Tarrant City, Ala 
Alabama Power Co 
Tarrant City Electrical Dept 

Fort Wayne, Ind 

Indiana and Michigan Electric Co 
Ft. Wayne Electric Light and 
Power Works 

Hagerstown, Md 
Potomac Edison Co 
Hagerstown Electric Light Plant 

Bay City, Mich 
Consumers Power Co 
Bay City Electric Light Dept 

Dowagiac, Mich 
Michigan Gas & Electric Co 
Dowagiac Board of Public Works 

Traverse City, Mich 
Consumers Power Co 
Traverse City Light & Power Dept 

Zeeland, Mich. 

Consumers Power Co. 

Zedtmi Board of Public Works 

Kennet, Mo. 

Arkansas Minoun Power Co 
Kennet Board of Public Worb 

Fc^shu’ BfadF, Mo. 

MkHuri Utiiitiee Go. 
fofiar Bh^ Board of nibiic 
Worts 


Florence, Ala 
Florence Electricity Dept 

Scottsboro, Ala 
Scottsboro Electric Power 
Bd. 

Richmond, Ind 
Richmond Power & Light 


Bristol, Va. 

Bristol Va Utilities Bd 


Niles, Mich. 

Niles Board of Public Worb 


Wyandotte, Mich 
Wyandotte Dept of Municipal 
Service 

Hillsdale, Mich 
Hillsdale Board of Public 
Worb 

Lansing, Mich 
Lansing Board of Water & 
Light 

Sturgis, Mich 
City of Sturgu 

Petoskey, Mich. 

Petoskey Municipal Light 
Plant 



OmOfUMICATKMffi 


86i 


Appendix (Continned) 


Gttiet with Cities without 

Oompetitun Competitkn 


Trenton, Mo. 

Minoun Public Service Go. 
Trenton Municipal Utilities 

Cleveland, Ohio 

Cleveland Electric Illuminating Co 
CSeveland Dept of Public 
Utilities 

Columbus, Ohio 
Columbus & South Ohio 
Electnc Co 

Columbus Division of Electricity 
Piqua, Ohio 

The Dayton Power & Light Co 
Piqua Municipal Light Dept 

Springfield, Ore 
Pacific Power & Light Co 
Spnng^eld Utility Board 

Sioiuc Falls, S Dale 
Northern States Power Co 
of Mum 

Sioux Falls Light & Power Dept 

Garland, Tex 

Texas Power & Light Co 
Garland Muniapal Utilities 


Carthage, Mo. 

Ciuthage Water & Electnc 
Plant 

Rolla, Mo 

Rolla Muiucipal Utihties 


Columbia, Mo. 

Columbia Water & Light 
Dept. 

Marshall, Mo. 

Marshall Muncipal Utilities 

Spnngficld, Ill 
Springfield Water, Light & 
Power 

Anderson, Ind 

Anderson Municipal Light 
& Power 

Logansport, Ind 

Logansport Electnc Light 
Dept 

Eugene, Ore 

Eugene Water & Electnc 
Board 

Greenwood, S C. 

Greenwood Commis. of Public 
Works 

Watertown, S. Dak 
Watertown Mumcipal Utihties 

Spnngficld, Mo 

City of Springfield Utihties 

San Antonio, Tex 

San Antomo Public Service 
Board 





862 

References 


JOURNAL OF POUnOAL XOONOUY 


Colberg, R. Marshall. “Utility ProfiU' A Substitute for Property Taxes.” Sai 
Tax J. (October 1955): 382-87. 

Federal Power Commission (FPC). National BUctnc Sale Book. Washington: 
Government Printmg Office, various 

-. Typical EUctnc Bills, Washington- Government Printing Office, various. 

Hall, R L , and Hitch, C.J “Price Theory and Business Behavior.” O^ord JScon. 
Pap<rj2 (May 1939) 12-45 

Mann, Patrick “Publicly Owned Electric Utility Profits and Resource Alloca¬ 
tion ” Land Scan. 46, no 4 (November 1970) - 478-84 

Primeaux, W J., Jr “A Re-Exammation of the Monopoly Market Structure for 
Electric Utilities ” Paper presented at Brookings conference on “Antitrust and 
Regulated Industnes,” Washington, October 28, 1971 

Shepherd, William G “Utility Growth and Profits under Regulation ” In Utility 
Regtdalum New Directions in Theory and Poltiy, edited by W G Shepherd and 
T G Gies New York Random House, 1966 

Simon, J L “A Further Test of the Kinky Oligopoly Demand Curve." American 
Econ Rev 59, no 5 (December 1969): 971-75 

Stigler, G. J “The Kinky Oligopoly Demand Curve and Rigid Prices " J P E 
55 (October 1947): 4.34-49 Reprinted in The Organization of Industry Home- 
wood, 111. Irwin, 1968 

Sweezy, P M “Demand under Conditions of Oligopoly ” J P E, 47 (August 
1939) 568-73 



Voucher Plans, Voting Models, and the 
Efficiency of Local School Finance 

M L Eyscnbach 
Umetmti) of Utah 


Discontent with the present public school system is obvious, but ways to 
improve the system are not The search for new alternatives has led to 
serious consideration of the voucher plan proposed by Milton Fnedman 
(Friedman 1962), and some experimental voucher schemes have been 
offered funding by the Office of Economic Opportunity Public interest 
in voucher plans seems likely to continue despite the Supreme Ckiurt’s 
recent ruling against Ohio’s tuition grants Economists piarticularly have 
been attracted by voucher plans that increase consumer choice and might 
improve production efficiency through greater competition among 
schools But discussion of the efficiency and equity effects of voucher 
plans has not been integ;rated into a political-economy model of voter 
behavior That task is undertaken in this paper The essential aspects of 
Friedman’s voucher plan and a simple voting model are set out in Section 
I The implications of the voucher plan for voting behavior and hence 
the size of the school tax support arc explored in Section II In the third 
section the model is applied to Robm Barlow’s study of Michigan house¬ 
holds (Barlow 1970) It is shown that a shift to a voucher plan may be 
neither efficient nor equitable 

I 

A simple voucher plan such as suggested by Fnedman is used for the 
analysis in this paper Attendance laws are unchanged children must 
attend approved schools Such schools may be nm by pnvate or pubhc 
bodies, may set their own fees, and may supplement their tuiUon incomes 
in any way they wish The only tax support any schools receive is voucher 
payments toward tuition fees The total school tax revenue (less ad- 
nunistrative costs) is divided equally among ail eligible children For 
each child, parents receive a voucher which may be used in payment 
of fees at any approved school If the school’s fees exceed the voucher, 
parents pay the difference directly If the fees are less than the face value 
of the voucher, parents may not receive the difference in cash (and 
hence there is no incentive for a school to charge fees less than the 
voucher). 


863 



JOURNAL or POLtnOAL BOONCaiV 


864 

The political-economy model is straightforward: a citizen votes for a 
particular budget if his marginal cost in increased taxes is less than the 
marginal benefits he will receive from the choice of that budget compared 
to Its alternative. The political process is viewed as a sequence of votes 
on increasingly large budgets: when American voters are asked to approve 
a total school levy, the impheit alternative is usually a smaller levy 
rather than no schools at all.* Marginal benefits per dollar of expenditure 
are assumed to decrease with increase quantities of schooling per pupil. 
It follows that if voters are ranked m terms of their net gains (marginal 
benefits less marginal costs) from a budget choice, the actual budget 
passed will be the largest one for which the net benefits arc positive for 
the last voter required for a legal majority 

n 

A voter’s benefits from school taxes may be divided into “internal” 
benefits associated with the schooling of one’s own child, and “external” 
benefits associated with the schooling of other people’s children The 
voucher plan will change voting behavior because it changes crucially 
the internal benefits which voters receive from their tax dollars Under 
the traditional system school taxes finance services in government schools, 
and the internal benefit is the amount a parent would be willing to pay 
for the schooling his child receives in these schools Under the voucher 
system school taxes finance vouchers which parents may use in place of 
cash toward payment of school fees The internal benefit is now the 
amount a parent would be willing to pay for this nontransferable, non- 
cashable voucher (not the amount a parent would be willing to pay for 
schooling per sc) Since a parent may buy schooling directly for cash, 
the maximum he would be willing to pay for a voucher is the face value 
of the voucher But the internal benefit need not be that large the 
internal benefit is less than the face value if parents would have preferred 
less schooling than that provided by the voucher Thus, the voucher 
follows the usual rules for a nontransferable gift-in-kind. the value of the 
gift to the recipient is equal to or less than the market price of the com¬ 
modity received 

A simple illustraUon may clanfy this crucial difference between the 
two systems Suppose for some increase in taxes, school expenditures will 
be raised by SlOO per pupil. Under the traditional system, this SlOO 
per pupil IS spent for government school improvements, improvements 
which some parents consider worth more than SlOO, but which others 


* Several recent school tax baliou in Wadungton have been explicitly of a sequential 
fonn, e.g.. Levy A. X million dollan; Levy B: provided Levy A panes, an additional Y 
nuUion ddtars, Levy C‘ provided A and B pass, an additional Z million dollars. 



OOMnnnoATioNa 


865 

coDiider a frivolous waste. Under die voucher system, the $100 appears 
as an increase in the free value of the voucher. Parents u^o previously 
supidemented their vouchers by $100 or more now receive precisely a 
$100 benefit, that is, the saving in their out-of-pocket payments to the 
school of their choice. There u no reason for these parents to change the 
quality of the schooling they purchase for their children (aside from income 
effects if income redistribution results from the increased tax and increased 
voucher.) Other parents previously supplemented their vouchers by 
direct payments of less than $100 or not at all. The increased voucher 
now purchases some improvements in the quality of education received 
by their children, but we know that these improvements are not worth 
$100 to these parents since, prior to the tax and voucher mcrease, they 
did not choose to purchase the same improvements by direct, out-of- 
pocket payments 

In addition to the crucial change in internal benefits, the voucher plan 
might also change the external benefits from any level of school taxes. 
School taxes no longer determine the total expenditures on schooling 
smcc parents may supplement their vouchers An increase m taxes would 
result in no increase m total schooling bou^t, and hence would yield 
no mcrease in external benefits, whenever piarents merely substitute 
voucher payments for out-of-pocket payment of school fees In general 
one would expect out-of-pocket payment of school fees mainly by wealthier 
parents, while for children of poorer parents, increases in tax level would 
result m increases in total schooling under the voucher plan as under the 
traditional system It is poorer children who figure in most arguments 
for the external benefits of public schoohng* for the subsequent analysis 
m this paper, I assume that the external benefits of a given level of school 
taxes are unchanged by the voucher plan m both average level and m 
distribution by income class of voters * 

To summarize, under the tradiUonal system the mtemal benefit from 
school taxes is the value the parents place on the education of their child. 
Under the voucher system the internal benefit is the lesser of two figures: 
(1) the value parents place on the education of their child, or (2) the face 
(money) value of the voucher Therefore, if external benefits are un¬ 
changed with the voucher plan, the total benefits from any budget arc 
less than or equal to the total benefits of that same budget under the 
traditional system unless the quahty of schoohng per dollar of expenditure 
improves with a change in the method of financing ^ The relative pro- 

* Greater parental choioe might affect the benefits of conqxilsary educauon as per¬ 
ceived by othen. Advocates of uniibnnity will perceive a decrease m external benefits, 
advocates of diversity, an increase. 

* To underitaiid thu reduction in the internal benefits ftom the school tax budget, it 
may hdp to recall that the value of any particular good depends on what substitutes 
exist. Tfe voucher plan mtroduoes direct parental financing of unproved Kliooling, a 
close subiutute to tax-finanoed schooling. 



866 


JOintNAL OF POUtlCAL ECONOMY 


ductive efficiency of each type of system is a controversy I will not enter 
here, for the subsequent analysis I assume no change in the quahty of 
education per dollar spent 

The general rule for all voters, “vote yes if increased total (i.e , internal 
plus external) benefits exceed marginal taxes,” results in some econom¬ 
ically interesting subordinate deaston rules for specific cases. All voters 
who choose to supplement their vouchers have marginal internal benefits 
exactly equal to the increase in the voucher they receive For these voters, 
the general decision rule can be restated as “vote yes if increased external 
benefits exceed marginal net taxes” where net taxes arc the difference 
between taxes paid and the face value of the voucher, that is, the subsidy 
paid to, or received from, others.* When these voters pay above-average 
taxes, they vote yes on a budget if their positive marginal external benefits 
exceed their marginal net taxes When these voters pay below-average 
taxes they vote yes on a budget unless their marginal net subsidy is 
offset by increases m negative external benefits * This voting rule for the 
individual, a direct comparison of external benefits and net taxes or 
subsidies, reminds one of the traditional welfare economist’s tax-subsidy 
prescription for efficient resource allocation when externalities are present, 
but of course an. efficient individual-decision role does not necessarily 
lead to an efficient collective decision in a voting situation 

Nor docs that decision rule apply to all voters Taxpayers who do not 
supplement their vouchers have marginal internal benefits less than the 
increased face value of the voucher They would vote for a tauc increase 
only if the marginal internal plus marginal external benefits exceeded 
their increased taxes, it would be necessary but not sufficient for marginal 
external benefits to exceed marginal net taxes For these taxpayers an 
increase in school budget means an increase in the schooling provided 
their children, as well as an increase in the schoohng of others—under 
the voucher system as under the traditional system, a voting choice 
analogous to a tie-in sale In contrast, those who value the education 
of their own children highly can, with the voucher system, separate the 
choice of superior quality for their own children from the choice of 
muumum quality of education required for all children However, this 
change in the constraints governing the choice of schooling for some 
children, the achievement of “marginal conditions” in this one area of 
resource allocation, in no way implies that the chosen budget is Pareto 
efficient The voucher plan schools are firmly in the world of second 
best 

* Net taxes in the tradiuonal system are the difference between taxes paid and the per 
pupil cost of schooling received m government schools 

’ Ne^tive extenud benefits might arise if, for example, some panna viewed increased 
education of others ta a source of increased job competition. 



owmntiQATioia 

m 


867 


The exact size of the budget passed tmder the voucher plan would depend 
on the configuration of benefits and taxes. In this section one particular 
con^;uration, that found in Robin Barlow’s study of Michigan house¬ 
holds, is analyzed Barlow’s figures indicated that local school taxes rose 
with a household’s income, but benefits from pubhc schools rose even 
more rapidly. Hence, votem ranked m terms of mcreasing net gams were 
also ranked in terms of increasing income; the poorer voters were net 
losers and the richer voters were net gamers at the level of budget actually 
passed with the traditional school system.* 

Figure 1 is a diagrammatic presentation of the Barlow situation with 
school taxes and benefits rising with income level. On the vertical axis 
are the marginal costs and benefits of increases in the tax budget, and 
on the horizontal axis are voters ranked by income class. Budgets X and 
Y are additions of $100 per pupil to successively larger total budgets, for 
example, moves from $400 to $300 per pupil and from $500 to $600 per 
pupil, respectively A single marginal tax line is shown since the marginal 
tax burden of each voter does not change as the total budget increases. 
For simplicity linear functions are shown so that the crucial median 
voter IS also an average taxpayer. If the legal majonty is other than 
50 percent, or taxes asymmetncally distributed about the median, the 
cruaal-voter and average-taxpayer positions would not comade, and 
the voucher level would be above or below the level of taxes paid by the 
crucial voter Either of these conditions would shift the relative political 
weight of the group of “richer” voters, that is, those paying above- 
average taxes, but would not alter the basic exposition of difference 
between the voucher plan and the traditional-schools plan. 

The tiaditional school budg^et with marginal internal plus external 
benefits depicted by budget Y m figure 1 would just pass with a 50 percent 
majonty rule. All voters to the nght of B, the 50 percent mark, vote 
yes, and those to the lefi vote no; budget X would pass easily with all 
voters to the right of A voting yes 

Would this budget Y pass if the school district adopted the voucher 
plan^ The answer depends on the pattern of internal and external benefits 
which would together yield Barlow's pattern of total benefits rising with 
a voter’s income Further discussion requires that each type of benefit 
be a nonrandom function of a voter’s mcome; for simplicity, hncar 
functions will be used. 

At one extreme, if schooling has no external benefits, the school tax 
budget will fall to zero. The ncher voters, who pay above-average taxes, 

* Unlas u n a nimi ty iircquired to pMiny budget. CSompliotioiisi n ttoc lM oed by vsrymg 
family siae will be ignofed. 



868 


JOURNAL OF POUnOAL KOONOMY 
VOTER DISTRIBUTION OF MARGINAL COSTS AND BENEFITS 
BUDGETS X AND Y 


Marginal Intarnol plu> axtarnai banaflti- 



Fig 1 


would vote no These richer voters pay taxes which exceed the face value 
of the voucher, since the voucher is equal only to the average tax; they 
always find it cheaper to pay school fees for their own children directly 
rather than through the tax system. With no external benefits, their net 
taxes necessarily exceed their external benefits at any positive levels of 
tax budget At the initial V level of the school budg[ct, the poorer voters, 
to the left of the 50 percent mark, also vote no They value their benefits 
from schooling at less than their taxes even though the face value of the 
voucher, and the cost of schooling, exceeds their taxes* the poor would 
prefer a lower level of both taxes and services At a lower budget level, 
marginal benefits exceed marginal taxes for some poorer voters, but the 
richer 50 percent of the electorate sull votes no. Hence, no school taxes 
can be passed. (Obviously, if below-average taxpayers were numerous 
enough by themselves to constitute a legal majority, some budget could 
pass.) 



OOUMUNICATIONB 


869 

At the other extreme the voucher budget is as large as the traditional 
budget. With Barlow's configuration of total benefits, the necessary and 
sufficient condition is that no richer voter's external benefits fall below 
his net taxes. If external benefits nse at least as rapidly as taxes, then 
external benefits exceed net taxes for all the richer voters at the budget 
level passed with traditional schools, these richer voters vote yes, and the 
same budget passes under either plan 

Interesting voting changes take place when external benefits exceed 
net taxes for some richer voters as they do m figure 1 The internal 
benefits of the increased voucher of budget Y cease to nse with income 
class to the right of point C since the increased voucher merely substitutes 
for previous out-of-pocket payments by these parents To the nght of C 
total benefits now rise only to the extent that external benefits nse with 
income class The marginal taxes of budget choice Y now exceed the 
marginal internal plus external benefits for all voters to the nght of 
point D This dividing point is also charactenzed by net taxes exceeding 
external benefits Voters to the nght of point D vote no under the voucher 
plan, and budget Y is defeated 

A lower total budget, such as budget X in figure 1, would pass with 
the voucher plan At the lower level of total school budget we have 
higher marginal benefits for a $100 increase in school taxes per pupil, 
as shown, both the internal and external benefits are assumed to be higher 
for every income class At the lower budget level, the higher marginal 
schooling benefits exceed the school taxes of some poorer voters (those 
between pioints A and B) The higher external benefits exceed the net 
taxes of more of the richer voters (i e , those between points D and E). 
A new majority coalition is formed of the middle-income voters (between 
A and E), while the very poor and very nch still vote no. 

We can now ask who might gam and who might lose from a change 
to this voucher plan The essential features of the change arc’ (1) the 
reduction, if any, in tax support, and (2) the freedom to purchase, without 
penalty, any quality of education above the voucher level ’’ 

One extreme case, the unchanged tax budget, is Pareto efficient the 
poor are no worse off, and the nch may now buy the more costly schoohng 
they desire for their own children * At the other extreme, with no external 
benefits, there would be unanimous consent to the voucher plan with a 
shghtly smaller tax budget, but as the tax budget falls the more prosperous 


’’ Under the traditional American system, if one wishes to purchase education in a 
nongovernment school, one must forfeit all internal benefiu from the taz^upported 
school budget. 

* Even this chai^ would not be Pareto opuinal if the tradiuonal syston permitted 
private schools: since all children are entitled to a voucher, the tax-su{q>oned expenditiuc 
per pupil mutt fall if the total budget u unchained, since that totiJ must now be spread 
over all students including farmer private school students. 



JOURNAL OP POLmOAL CQONOIIV 


870 

poor become net losers. The zero budget actually passed with a majority 
rule IS not Pareto efficient compared to the original budget. 

The in-between case illustrated in figure I also is not Pareto efficient. 
Unambiguous gainers arc the very poor and the very rich. The very 
poor, those to the left of point A, would prefer a still smaller budget than 
passes under the voucher system but their losses arc reduced. The very 
nch, to the right of E, can now buy the better quality of education they 
want for their children They would be better off even if the budget 
level did not fall but in addition, with a smaller budget, the difference 
between the net taxes they pay and the smaller external benefits they 
receive is reduced 

Unambiguous losers are the lower group of above-average tax payers 
(between points B and C) Under either system they would prefer a 
larger budget than budget Y, since even at the larger budget their 
marginal benefits exceed their marginal taxes; every reduction in budget 
means a net loss to them This loss is not offset by the freedom to buy 
a better quality of education for their own children because the ong^inal 
budget provided more schooling than they wanted for their own children 

Three groups occupy an ambiguous position The richer voters between 
C and D, like the unambiguous losers, would prefer a larger school budget 
than IS passed under their system, and hence every reduction in budget 
creates a net loss But members of this group, unlike the unambiguous 
losers, are willing to pay for better education for their own children and 
hence gain from the freedom to supplement under the voucher plan 
Their net gam or loss from the change to a voucher plan is uncertain 
The more prosperous poor (between points A and B) are the second 
ambiguous group They would prefer a smaller budget than passes 
under the traditional system—they vote no on budget choice Y under 
either plan—but would prefer a larger budget than passes under the 
voucher plan Whether they are better off with “overproduction” or 
“underproduction” cannot be determined from qualitative data alone. 
Similarly, those richer voters between D and E find the budget X with 
the voucher system too large and budget Y too small. Although they gam 
from the freedom to supplement, their net gam from the change of plan 
plus change in budget size is uncertain 

Obviously, within the limitations of this model, there is no case for 
the voucher system on the basis of Pareto-efficiency. Nor is the cqiuty case 
any clearer, for the only certain gainers are the extremes of very rich and 
very poor. 

We have analyzed here the political economy of a voucher plan only 
^within Barlow’s configuration of taxes and total schooimg benefits 
Alternative configurations of cither benefits or taxes would yield different 
results The benefits of schooling may differ from Barlow’s pattern, and 
the change in benefits as the results of a voucher plan will vary with the 



ooumumioahonb 


871 

specifics of that plan, especially with provisions which limit p>arental 
fimdom to supplement vouchers. Nevertheless, the results of this study 
show that the political economy of both simple and complex voucher 
plans requires more careful consideration by both their advocates and 
their opponents. 


References 

Barlow, Robin. “Eifiaency Aspects of Local School Financing.” J.P.E 78, no. 5 
(September/October 1970); 1028-40. 

Friedinan, Milton. CapikUtsm and Fradom. Chicago Univ. Chicago Press, 1962. 




Employer Costs'and Discrimination: 
The Case of Baseball 

Jamc3 Gwartney and Charles Haworth 

Florida Stalt Umversitr 


Economic theory implies that employers who discriminate arc at a 
competitive disadvantage relative to firms that follow a less discriminatory 
pohcy. The low discriminator is able to reduce per unit cost because of 
his willingness to hire a disproportionate share of low-cost (relative to 
productivity) factors of production against which other firms discnminate 
Cetens paribus, the profits of the low discriminator will be greater, and his 
share of the market will tend to increase * This theoretical proposition 
IS difficult to test in the real world because of the lack of available in¬ 
dicators of discnminatory preferences and our inabihty to adjust for 
violations of cetens panbus conditions between firms of an industry 

Analysis of desegregation m major league baseball provides a unique 
opportumty for researchers because in this industry it is possible to 
quantify the impact of a discriminatory policy on important revenue 
and cost variables * In 1947, professional baseball teams were placed 
in an unusual position A pKxil of readily available, proven baseball talent 
existed in the Negro leagues ^ The contracts of most of these players 
could be purchased at a fraction of the cost that would be necessary to 
obtain comparable white players Negro clubs were usually paid between 
$1,000 and $3,000 compietisation for black players capable of transferring 


The authors would like to thank David Beggs for his research assistance and two 
anonymous referees for their helpful comments on an earlier diaft of the manuscript 
' See Becker (1957, chap 3) for a complete statement of the theory. Becker concludes 
"In general, firms with discnminaUon coefficients less than the median discrimination 
coefteient are profitable and tend to expand relauve to other firms" (p 36) 

‘ See Pascal and Rapping (1970) and Scully (1971) for evidence on the exutence of 
employment discnminaUon in nmor league baseball Ihese two papers, utilizing pn- 
manly data from the sixties, focused on earmngs and producuvity differences between 
players according to color Our paper focuses on the transition penod (1947-59) and is 
a quantitative study on the impact of differing reqionies by individual firms to a de- 
segregauon of the industry. We are attempUng to ei^uate the payoffs (or costs) of being 
an innovator and/or a low discriminator in an industry that previously had been com¬ 
pletely segregated. 

* An iqdtcatKm of the talent available in the Negro leagues is the fact that six of the 
12 “RooLes of the Year” selected dunng the 1947-53 period ftinnerly played in the 
Negro leagues. Dunng the 1950-59 penod, ei|d>f 20 major league “Most Valuable 
Player” (MVP) winners had played in the Negro leagues. (See n 12 below.) 

873 


JOURHAl or P0l4mCAL ECONOMY 


874 

directly to the major leagues The highest price paid for a contract of a 
player from the Negro leagues was $15,000 (Peterson 1970) In contrast, 
the cost of player development ibr major league teams during the period 
1945-50 was probably over $100,000 per major league player.* 

Two inputs, white and/or black players, could be utilized m an effort 
to win games (an output variable), but the price of an equally productive 
black player was much cheaper. Once the color line was broken, aims 
panbus, firms had an incentive to substitute the less expensive blacks for 
the higher-priced white players 

There is considerable evidence that baseball executives were aware 
of the potential As early as 1944, Bill Veeck tried to buy the sinking 
franchise of the Philadelphia Phillies and stock the club with stars from 
the Negro leagues In his autobiography Veeck (1962) states’ “With 
Satchel Paige, Roy Campanella, Luke Easter, Monte Irvin, and count¬ 
less others in action and available, I had not the slightest doubt that in 
1944, a war year, the Phillies would have leaped from seventh place 
to the pennant ” 

Vecck’s plan to buy the Phillies was foiled After a conference with the 
baseball commissioner, the owners sold the team to someone else for 
about half the price that Veeck was willing to pay But 4 years later Veeck 
did gam control of the Cleveland Indian franchise Paced by established 
players from the Negro leagues such as Paige, Easter, and Larry Doby, 
the Indians and Yankees dominated the American League in the late 
forties and early fifties ’ 

Other baseball men were also aware of the potential of black baseball 
players John McGraw, in the early days of organized baseball, once 
tried to play a Negro, referring to him as an Indian (see Peterson 1970) 
Branch Rickey, the first baseball owner who was willing to break the 
colpr line, is reported to have felt that the employment of Negroes was 
necessary to restore his Brooklyn team that had been ravaged by war 
and old age 

* Data on player development costs for the New York Yankees and Chicago Cubs are 
available from the Cellcr Committee Report (U S , Congress, House 1951) The Yankees 
estimated a development cost per player of $201,000, and the Cubs, $131,000 These 
figures include all of the losses of the farm system, which would be appropriate only if 
the whole purpose of the farm system was to develop major league players Since there 
were some other advantages to farm systems such as retention of territorial rights, this is 
probably an overestimate If no farm club losses arc used (and some should be) the cost 
figure per player would be New York Yankees, $180,000, and Chicago Cubs, $90,000 
Duniig this period bonus payments were made almost exclusively to white players The 
payments aim are indicative of the cost of acquiring white player talent. A bonus is a 
payment to a free agent to sign a contract with a team Between 1947 and 1958 over 100 
sdute free agents were paid bonuses exceeding $25,000. The probability of any one of 
the bonus recipients becoming a nu^ league player was one in three. These data are 
from Pascal and Rapping (1970) and various editions of the Sportmg Nmt. 

* Between 1948 siid 19M the Indians won the American League pennant twice and 
finished second five tunes. 



OOHUUraQAIIONa 


875 


TABLE 1 

CuiiDiATivs Nuion or Alaok Plaw Ykabi, and tmb 
WoN<La«r Rmokd or Majos LiAOUm Baiuau. Tbami 1952-56 


TkAM* 

CUMULATIVI No Q« 
“BfJlCK Playu Ybaxs” 

1947-56 1952-56 

Gauu Won 
1952-56 
(%) 

Rakximo 

Wom-Lost Raooan 
1952-56 

Brooklyn (N) 

37 

22 

0 630 

2 

New Y«wk (N) 

25 

18 

0 527 

6 

Cleveland (A) 

20 

12 

0 619 

3 

Boston (N) 

18 

18 

0548 

5 

Chicago (A) 

14 

12 

0 571 

4 

Chicago (N) 

12 

12 

0 439 

12 

Cmannati (N) 

8 

8 

0489 

10 

Philadelphia (A) 

8 

8 

0394 

14 

Pittsbuig (N) 

3 

3 

0 351 

15 

Samt Louis (A) 

3 

3 

0 387 

16 

Saint Louis (N) 

2 

2 

0502 

9 

New York (A) 

2 

2 

0 638 

1 

Washington (A) 

1 

1 

0 432 

13 

Philadelphia (N) 

0 

0 

0509 

8 

Boston (A) 

0 

0 

0 516 

7 

Detroit (A) 

0 

0 

0440 

11 

Average (top 8) 

178 

138 

0 528 

70 

Average (lowest 8) 

1 4 

1 4 

0 472 

10 0 

Average (top 5) 

22 8 

16 4 

0 579 

40 


* Naciooal League teaim are denoted by (N) and Amencan League b> (A) 


In the 1940s, there were 16 major league baseball teams The sport had 
sizable monopoly power, protected by a judiaal intcrprctaPon providing 
baseball with exemption from antitrust prosecution So there was little 
fear of a new innovative competitor, utilizing black players, entering 
and reducing the share of the market (and profits) of the existing firms 
But, if an existing team was willing to employ blacks, it should have been 
able to increase its wms (output) per unit cost 
Just how large was the potential competitive advantage that could 
occur to a “low discriminator”? The ranking of teams according to the 
cumulative number of blacks on the team each year—“black player 
years”—clearly illustrates the contribution of black players to team 
success (sec table 1) * By 1956 the Brooklyn Dodgers had accumulated 
37 “black player years,” 22 of them during the 1952-56 period They 
were followed by New York (National League), Cleveland (Amencan), 
and Boston (National) Dunng the 1947-56 penod, the eight teams with 
the greatest representation of blacks averaged 17 8 “cumuiauve black 


* An additionAl “black player year” was aaigned to a team for every additional black 
player on the team’s annual ofiicial roster dunng the time penod of the 25-player roster 
limitation of the league The number of “cumuiauve black player years'* is the number 
of b lac ks on the team’s raster each year —mm**! over a tune penod. Note that a player 
who u on the team’s oflScial raster for X yean would contnbute X tunes to the team’s 
“cumulative Uadt player yean ’’ The lining of Negro playen from 1947 to 1959 by 
kardolph (1959) was uuln^ to classify playen on the basis of race 







876 


JOURNAL OP POLITICAL BGONOUY 


TABLE a 

lupoRTAMOB Of Black Plavem as a Dstbuonamt or Games Wok by a TkAM 
DtlBlNO THE PBXIOO IMUKDIATBLY FoLLOWWO THE FaIX Of THE CiOLOR BaIUUBK 


Equauon No and 





/t* 

Dependent Vanable 

Intercept 

Xt 

x^ 

Xt 

(df) 

(1) Games won per 
year (1950-M) 

20 4 

0 89 


0 105 

.42 

(2) Games won per 
ycar (IQSO^-W) 


(4 23) 


(5 84) 

(93) 

156 


3 75 

0 111 

40 



(4 00) 

(6 25) 

(93) 

(3) Game* won per 






year (1950-59) 

26.8 

0 32 
(3 63) 


0 094 
(7 39) 

32 

(157) 

(4) Games won per 




year (1950-59) 

22 5 


2 05 

0 101 

34 




(4 38) 

(8 08) 

(157) 


NoTB^Jfi n cumulative black player yean, Xt * number of black playen during the tth year, 
Xi K percentage of gazne« won during the immediately prcceeding 4-‘year period (1946-49) 


player years ” The comparable figure for the other eight teams was 1 4 
The top eight teams in terms of “black player years” won 5 6 percent 
more of their games than did the eight teams with the least represen¬ 
tation of black players during the 1950-56 period 
The evidence is even more impressive for the top five The five teams 
that employed the most black players through 1956 comprised five 
of the top SIX teams according to won-lost percentage dunng the 1952-56 
period Only the New York Yankees, were able to attain this stature with 
relatively few black players The five teams with the most black players 
won 58 percent of their games dunng the 1952-56 period By contrast, 
the other 11 teams won only 46 percent 

Table 2, utilizing regression analysis, investigates the possibility that 
the teams with the most black players may have won more games dunng 
the early fifties because they started with better player personnel at the 
beginning of the penod The number of games won by a team dunng a 
season is the dependent vanable. Equations (1) and (3) show that even 
after adjustment for initial team quality (ATj), the cumulative number of 
“black player years” had a highly sigmficant positive impact on the 
number of games won dunng the ith year 

Equations (2) and (4) (table 2) consider the same two time penods, 
but the number of black players on the team dunng the tth year was 
substituted for the cumulative vanable Again, the evidence shows that 
the number of black players was a sigmficant determinant of the games 
won by a team for both time periods. For the 1950-55 penod, the results 
suggest that inclusion of an additional black player on a major league 
team, on average, resulted in an additional 3 75 wins per year, after 
adjustment for the initial team performance at the beginning of the 







COMMONICATIONS 


877 

period ’ For the longer 1950-59 period, each additional black player i» 
estimated to have meant two additional wins per year for his team * 
The importance of black players to team success is not surprising 
given their high productivity, on average, relative to whites Rosenblatt 
(1967) has previously shown that the batting averages of black players 
substantially exceeded that of whites * Table 3 presents an even more 
dramatic measure of productivity differences according to color The 
slugging averages of players according to color and position arc presented 
for 1950 and 1955 ‘“In each case, the slugging average of blacks, in¬ 
dicating both their ability to get on base and to hit for extra bases, sub- 
tantially exceeded that of other players even when grouped by position 
Both the batting and slugging average data highlight the potential of the 
available black playmg talent during this era 

Thus far we have emphasized the contribution of high-productivity 
and less expensive black players to team success as measured by games 
won However, if consumer discrimination against teams playing blacks 
was intense, team attendance, and therefore revenues, might suffer even 
while the team was winning Table 4 presents evidence that strongly 
contradict this hypothesis If consumer discrimination was strong, we 


’’ To appreciate the importance of 3 75 games, it thoutd be noted that in the period 
1930-60, 3 75 games would have made the difference between wimung a pennant and 
being runner up 30 percent of the time and the difference between second and third 
place over 30 percent of the time 

* Annual games won was also regressed on “black player years" (Xt) and percentage 
of games won during the immediately preceeding 4-ycar period (X3) for 1960-69 The 
number of black players per team remained significant at the 90 percent level, although 
the l-ratio (1 80) was considerably smaller for the sixties than for the fiftia (eqq [Z] 
and [4]) The size of the legresion coefficient during the sixties was 0 77, less than one- 
half the comparable figure for the fifties Analysis of the partial cocSicients of detemuna- 
tion for the number of black players per year {X 2 ) suggests that the contribution of this 
variable to the explanation of the dependent variable was substantially smaller during 
the sixties The adjusted partial coefficients of dctenmnation for X^ in eqq (2) and (4) 
were 131 and 096, compared to 007 for the same equation during the sixties 

* The batting averages of Negroes and whites for 1953 and 1959, as indicated by 
Rosenblatt, were 


Year 

Negro Batting 
Average 

White Batting 
Average 

Oiffcrcncc 

1953 

288 

257 

031 

1954 

262 

244 

oie 

1955 

271 

251 

020 

1956 

270 

246 

022 

1957 

264 

252 

012 

1958 

274 

249 

025 

1959 

265 

246 

019 


Rosenblatt found that the differential continued to penist into the 1960s. 

‘"A player’s slugging average is his total bases on hits divided by hu number of 
at-hats. Ctbfus panbus, it will be higher for playeis who hit more home runs and other 
extra bate hits. 







Bye 


JOUltNAL OP KHXnOAL WXmOMV 


TABLB S 

Slvooimo Avbkao* or Au. PtAynt Am o* lli,Aac Playxu 
• y PoonoN m 1950 and 1935 


PaemoN 


SuiOOlHO AVBMAOa 


1950 

1955 


All 

BUrlc. 

All 

All 

.496 

403 

.440 

.385 

OF-IB 

492 

426 

453 

.427 

2B, SB, SS . 

461 

S72 

421 

358 

Catchen 

550 

387 

.582 

.380 


would expect a negative relationship between home team attendance 
and the number of black players on the team Equations (1) and (4) 
(table 4) show that for both the 1950-55 and 1950-59 periods there was 
a highly significant positive relationship between team attendance and 
number of black players on a team, even after adjustment for the team’s 
average attendance dunng the preceding 4-year base period (1946-49) 
On average, each additional black player on a team was associated with 
between 55,000 and 60,000 additional annual home-team admissions 
during the 1950s Undoubtedly, teams with black players had sub- 
stanUally higher attendance, at least partially because they won more 
games (see table 1) Equations (2) and (5) (table 4) estimate that team 
attendance was increased 12,000-15,000 by each additional game won 
during the 1950s Given our previous estimates that, on average dunng 
the fifties, each additional team “black player year” was associated with 
between two and four additional wms, the positive impact of black 
players on attendance is not surprising An additional black player meant 
additional wins, and the additional games won meant higher attendance 
Equabons (2) and (6) (table 4) estimate the impact of black players on 
team attendance, after adjustment for both base-period attendance and 
annual games won The number of black players is still posibvely as¬ 
sociated with attendance, although the size of the black player coefficient, 
as expected, was reduced by the inclusion of the “games won" variable 
It was insignificant for the 1950-55 period but was positive and significant 
when the entire 1950-59 period was considered (eq [3]) (>rtainly 
these results indicate that black players not only helped win games but 
also brought customers to the ball park 

In addition to their impact on team performance, two other factors 
may have contributed to the ability of black players to attract fans to the 
ball park The first might be referred to as the “superstar” phenomenon 
Outstanding players are thought to exert an additional impact on 
attendance b^use customers like to observe players of exceptional 
ability, even if they play for a “iMing" team. The fact that blacks were 
ov e rrep r e se nted among superstars may, at least partially, explain their 



OOmiUKICATIONS 


879 


TABLE 4 

Impact op Gamu Won and Black Flavui on Annual TIkam 
Houk ATTBMDANoa fOK 1950-55 amp 1950-59 


Dspbndbnt Vauablb—^Annual Tbam Home Attemdamoe 


Inobpbndent 

Vamablei 


1950-1959 



1950-55 


(0 

(2) 

(3) 

(4) 

(5) 

(6) 

Team black jdayety 
(aimual) 

55,696 
(4 2) 


28,702 
(2 4) 

58,327 
(2 5) 


15,658 
(0 7) 

Games won 


15,193 
(8 2) 

13,772 
(7 2) 


11,973 
(5 7) 

11,369 
(5 0) 

Average attendance, 
1946-49 

0 45 
(7 8) 

022 
(3 7) 

0 24 
(4 1) 

0 53 

(7 1) 

0 33 
(4 3) 

033 
(4 3) 

K-tntercept 

df 

366,521 

33 

157 

-404,195 

46 

157 

-373,860 

49 

156 

268,706 

40 

93 

-336,808 

52 

93 

-310,389 

.52 

92 


positive impact on attendance Second, black players may have 
attracted black customers who previously did not attend major league 
games While no data on the racial compiosition of customers during the 
hfties are available, substantial attendance at Negro league games 
immediately after World War II and the rapid decline of the Negro 
leagues once the major league color bar was eliminated suggest that 
many black baseball fans switched their allegiance ** 

Given their influence on home team attendance, what impact did 
black players have on team revenues dunng the fifties’ Assuming that 

' * There la no procue definition of “luperstar” available, but to illustrate thu point 
two measures are suggested At the end of each season an mdividual u voted by baseball 
writers as the outstanding performer in hu league During the period 1950-59, of the 
20 most valuable players (NfVP) selected, eight were black (40 percent) As an alteroauve 
measure of star status, one could toe the All-Star Team Selection Each year the out¬ 
standing players at each posiuon, approaunately 12 percent of all players, art chosen 
The table below shows that blacks were dramaticsdly ove r repr e sented on these teams 
dunng the early 1950i 


Year 

Blacks as % 
of All Players 

(1) 

Blacks as % 

oTAll-Suus 

(2) 

(2)/(l) 

1950 


80 

47 

1955 

77 

ISO 

2J 

1960 


20.0 

19 


' ‘ Many of the Negro league teams shared stadium occupa n cy with major league 
teams. Fetetson (1970) reports that crowds of 12,000-15,000 were not uncommon in 
1946 fiir Negro lesgue baseball games Over 45,000 bns were present ibr the East-West 
All-Star game during that year By oontratt, juM 2 yean later, attendance of lea than 
2.000 was oemiiMm. Thu diarp reddctioa in attendance resulted m the demise of several 
teams. For wtample, desegregation of the Dodgen and Giants coincided with the demise 
of two New York entiim (Newark Sagiea and New York Black Yankees) m the Negro 
Natioiial League. 










88o 


JOURNAL OF FOtmCALAeONOHV 


the net income per admission to a major league game was {1.00/^ our 
attendance estimates suggest that the black-white diiTerential revenue 
product was, on average, greater than $55,000 per black player. The 
potential financial gains available to low discriminators in this industry 
were large indeed For example, our estimates suggest that a low dis¬ 
criminator, willing to employ as few as four black players (among a 
25-playcr team) could, on average, have increased his revenues by more 
than $200,000 annually, while simultaneously reducing the cost of 
acquinng major league players. The profit motive surely served to hasten 
the desegregation of baseball. 


Conclusions 

As IS implied by economic theory, the empirical evidence shows that 
“low discriminators” (i e , teams willing to employ black players) obtained 
a competitive advantage in major league baseball relative to other teams 
They were able to win more games, acquire quality players at a lower 
cost, and increase annual revenue from admissions These gams offered 
an incentive for teams to desegregate 

However, given the estimated contribution of black players to team 
success and financial gam, the results are not particularly encouraging 
as to the expected rate of decline in discriminatory bamers In the late 
1940s and early 1950s, proven-quality baseball players were readily 
available from the Neg^ro leaigfucs Yet nearly 5 years after the fall of the 
color bar, less than one-half of the teams had desegregated ** Perhaps 
major league baseball is unrepresentative of other industries because of 
the cartel power that the league has over the behavior of individual 
firms Additional evidence on other industnes will be necessary before 
general conclusions can be drawn about time requirements necessary 
to change social attributes and reduce instituUonal bamers But in the 
case of baseball these factors did not change rapidly even when the 
opportunity for substantial financial gam was present 

References 

Bardolph, Richard The Negro Vanguard New York Rinehart, 1959. 

Becker, Gary The Eamormes of Discnmnatum CSiicago Univ Chicago Press, 
1957 

Pascal, A H, and Rappmg, L. A. “Racial Discrimination in Organized Base¬ 
ball ” Xeroxed 1970. 

' * In 1950, after admission taxes, visitor share, and league share, the average income 
per admission was $1.05 (UJS , Congress, House 1951, p. 1619) This figure does not 
include any revenue from parking, concessions, radio-TV rights, and road receipts. 

One mt^t argue that disemninatory bamers were reduced slowly in nnuor league 
baseball because of the local monopoly possessed by each team. However, it should be 
recognized that de^te the possession of monopoly power, the incenbve to reduce costs 
and therefbie obtain additional financial gain Still remains. 



COUHUNIOATIONg 88 1 

Petenon, R, W. Only the Ball Was While Englewood Clif&, N.J . Prentice-Hall, 
1970. 

Rotenblatt, A. “Negroes in Baseball* The Failure of Success.” Trans-Aetum 
(September 1967), pp. 51-53. 

Scully, Gerald W. “Tlie Economics of Ditcnmmation m Professional Sports 
The Case of Baseball.” Paper prepared for a conference on Government and 
the Sports Busmess, December 6 and 7, 1971 

U.S., Congress, House, Subcomnuttec on the Study of Monopoly Power 
Organized Baseball 82d Cong , 1st sess , 1951, ser. 1, pt 6 

Veeck, Bill Vetch as in Wreck New York Putnam, 1962 




Friedman versus Tobin: A Comment 


T. Norman Van C!ott 
Wist Giorgia ColUgi 

Gary Santom 
Ktttuas Stall Tiachtrs CoUtgi 


In a recent issue of this Journal Milton Fnedman (1972) replied to five 
critics of his work This short comment is directed only to Friedman’s 
reply to James Tobin This particular exchange was pnmanly concerned 
with putting the debate between the monetarists and neo-Keynesians 
into an JS-LM framework Fnedman, in disputing Tobin’s contention 
that the argument reduces to one of determining the slope of the LM 
schedule, claims that the debate really centers around two other factors 
First, “saving” and “investment” should be mterpreted more broadly 
Second, the monetary effect of any fiscal action will ultimately be pre- 
dormnant (i e , an LM schedule continually shifting through time will 
“swamp” a oncc-and-for-all shift in the IS curve) 

In this short comment we want to point out that Fnedman makes a 
small mistake when discussing his first point in terms of the IS-LM 
paradigm While the validity of his conclusion is not affected, manipula¬ 
tion of the model as Fnedman suggests will always yield the nght answer 
for the wrong reason Were it not one of Fnedman’s major points m his 
exchange with Tobin, this comment would not be justified 

In referring to his first pioint Fnedman (pp 915-16) wntes, “The 
monetanst view is that ’saving’ and ‘investment’ have to be interpreted 
much more broadly than neo-Keyncsians tend to interpret it, that the 
categones of spending affected by changes in interest rates are far broader 
than the business capital formation, housing construction, and inventory 
accumulation to which the nco-Keyncsians tend to restrict ‘investment ’ 
Hence, even a fairly substantial tax increase will produce only a minor 
shift in the IS curve ” Fnedman goes on to point out that, regardless 
of the slope of the LM, a “minor shift” in the IS results in a minor change 
■n mcome 

Fnedman is incorrect in claiming that a broadened interpretation of 
saving and investment results in smaller shifts in the IS schedule m 
respemse to a change in taxes. This is not true for a change in taxes or 
any other change arising in the expenditure sector The only effect of a 

883 



JOURNAL OF POUnOAL ECONOMY 


884 

broadened interpretation of saving and investment is to make the IS 
schedule “flatter.” The leftward shift in the schedule in response to 
higher taxes always equals the tax increase times the relevant multiplier 
Perhaps the simplest way to incorporate a broadened interpretation of 
saving and investment into the IS-LM paradigm is to introduce the 
interest rate as an argument in the consumption function Assume that 
consumption and the interest rate vary inversely as do neo-Keynesian 
investment and the rate of interest If we assume taxes arc of the lump¬ 
sum vanety, the equation for the IS curve reduces to 


Y = 


_ ^ + ^0 + So + gjf + 


1 


( 1 ) 


where F is output, a is the constant term in the consumption function, 
b is the marginal propiensity to consume, T is lump-sum taxes, Gq is 
government spending, gg is the constant term m the investment function, 
and and show the responsiveness of investment and consumption, 
respectively, to interest rate changes Presumably, for the monetarists 
gy IS greater in absolute value than it is for the neo-Keynesians and 
Cq is equal to zero for the neo-Keynesians and less than zero for the 
monetarists 

With reference to the interest rate axis, the slope of IS is 


— _ Si + ^0 
dt I - b 


( 2 ) 


That IS, the more responsive investment and consumption arc to the 
interest rate, the “flatter” the IS curve is 

The shift in the IS curve in response to higher taxes is 


dY = 


■dT 


( 3 ) 


This is independent of gy and cg 

In terms of the IS-LM diagram (fig. 1) we have the following Assume 
IS^ IS the IS schedule for the monetarists and IS^ is the nco-Keynesian 
schedule From an initial equilibrium at point A, a rise in taxes shifts 
the IS schedules leftward by the same amount, or, say, to IS^ and 75* 
Note that the decline in income is smaller for the monetarists but nol 
because of a smaller shift in the IS curve 

One might object that Fnedman is really referring to a vertical shift 
in the IS curve, tn which case he is correct However, a reading of the 
section of his paper to which this comment is directed indicates that 
Friedman refers to shifting IS and LM schedules 10 times, with each 
shift described as a leftward or nghtward shift 



OOMMUNICATIONg 


885 



Fio I 


Reference 

Fnedman, Milton “Comments on the Critics " J P E 80 (Septembcr/October 
1972)- 906-50 



Miscellany 


The Economics of Brushing Teeth 


The ever-growing literature on human capital has long recogpiized that 
the scope of the theory extends well beyond the traditional analysis of 
schoohng and on-the-job traimng Migration, maintenance of health, 
enme and punishment, even mamage and suicide, are all decisions which 
can usefully be considered from the human capital pioint of view Yet 
economists have ignored the analysis of an important class of acUviUes 
which can and should be brought withm the purview of the theory A 
prime example of this class is brushing teeth ' 

The convenuonal analysis of toothbrushmg has centered around two 
basic models The “bad taste in one’s mouth’’ model is based on the nouon 
that each person has a “taste for brushing,’’ and the fact that brushing 
frequencies differ is “explained” by differences m tastes Smee any pat¬ 
tern of human behavior can be rationalized by such impheit theorizing, this 
model IS devoid of empirically testable predictions, and hence unmteresting 
The “mother told me so” theory is based on differences m cultural 
upbringing Here it is argued, for example, that thnee-a-day brushers 
brush three Umes daily because their mothers forced them to do so as 
children Of course, this is hardly a complete explanauon Like most 
psychological theories, it leaves open the quesUon of why mothers should 
want their children to brush after every meal But it does at least have one 
testable implication' that mdividuals from higher social classes will brush 
more frequently 

In these ps^es I describe a new model which is firmly grounded m 
economic theory and which generates a large number of empincally 
testable hypotheses. I then show that the predicbons of the model are 
supported by the data. 

EDrraa*i non.—^Thii paper denvet from die Fnneeum oral mdiiMw 
I wish to thank my datnt far filling in Kxne important gaps in the analyns, and my 
colleague, Michael Rothschild, for insi|^itiul tabitaipg Support for this research is 
graciously aolicited. 

‘ The analyds to fbllaw can aiao be applied to sudi important problems as combing 
hair, washing hands, and cutting fingemaila, as I hope to show in a senes of future papers. 




888 


JOURNAL OF POLmOAL EOONOMY 


The basic assumption is common to all human capital theory, that 
individuals seek to maximize their incomes. It follows immediately that 
each individual does whatever amoimt of toothbrushing will maximize 
his income The “mother told me so” model can be considered as a 
special case where the offsprmg only does as he or she is told, but the 
mother’s deasions are governed by income maximization for the child 
Thus, offspring will behave as tf they maximized income 
An example will illustrate the usefulness of the model Consider the 
toothbrushing decisions of chefs and waiters working in the same estab¬ 
lishments Since chefs generally come from higher socioeconomic strata, 
the “mother told me so” model predicts that they wiH brush more 
frequently than waiters In fact, it has been shown that the reverse is true 
(Barnard and Smith 1941) Of course, the human capital model predicts 
precisely this behavior On the benefits side, chefs are rarely seen by 
customers and work on straight salary Waiters, by contrast, are in 
constant touch with the public and rely on tips for most of their income 
Bad breath and/or yellow teeth could have delctenous effects on their 
earnings On the cost side, since wages for chefs arc higher, the opportunity 
cost of brushing is correspondingly higher Thus, the theory predicts 
unambiguously that chefs will brush less It is instructive to compare this 
rather tight theoretical deduction with Barnard and Smith’s glib attribu¬ 
tion of the observed differences to the different hygiene standards m the 
birthplaces of the individuals (The chefs were born mostly in France, 
while the waiters were largely Brooklynites ) 


I. Review of the Literature 

A substantial literature on dental hygienics exists It is ironic that econ¬ 
omists are almost completely unaware of these studies, despite the fact 
that most economists brush their teeth 

The best empirical study was conducted by a team of researchers at 
the Umversity of Chicago Medical Center in 1967 They compared tooth¬ 
brushing habits of a saentifically selected sample of 27 sets of twins who 
had appeared m Wngley’s chewing gum commercials with a random 
sample of 54 longshoremen The twins brushed their teeth an average of 
3.17 times per day, while the longshoremen brushed only 0.76 times daily 
The diflfcrencc was sigmficant at the 1 percent level As noneconomists, 
the doctors advanced two jiossible explanations for this finding either 
twins had a higher “taste for brushing” than nontwins, or the Wngley 
Company deliberately set out to hire people with clean teeth Further 
study, they concluded, would be needed to discriminate between these 
two hypotheses (Baker, Dooley, and Spock 1968). The human capital 
viewpomt makes the true explanation clear enough. Earnings of models 
depoid strongly on the whiteness of their teeth On the other hand, no 



MnCELLANY 889 

direct connection has ever been established between the income of long¬ 
shoremen and the quality of their breath. 

Another recent contnbution was a survey of professors in a leading 
Eastern umversity It was found that assistant jH^fessors brushed 2 14 
times daily on average, while associate professors brushed only 1 89 tunes 
and full professors only 1 47 times daily. The author, a sociologist, 
mistakenly attributed this finding to the fact that the higher-ranking 
professors were older and that hygiene standards m America had advanced 
steadily over time (Persons 1971) To a human capital theorist, of course, 
this pattern is exactly what would be expected from the higher wages 
received in the higher professorial ranks, and from the fact that younger 
professors, looking for promouons, cannot afford to have bad breath 

n. A Theoretical Model of Toothbnishing 

Let VO be the wage rate of an individual, let y be an index of his job, and 
let B be the time spient brushing his teeth With no loss of generality, I 
can reorder the jobs so that jobs with higher J arc the jobs where clean 
teeth are more important The assumed wage funcUon is therefore 

«' = “'ey. ^). ^0. “'at = “'JB ^ 0 0 ) 

Since jobs have been reordered, there is no a pnon presumption about 
the sign of Wj It is also assumed that w{ ) is conunuous, twice differen¬ 
tiable, and semistnctly quasi-concavc in the nonnegative orthant 

Each individual is assumed to maximize his income. 

Y = wU, B){T - B) + P. (2) 

where T is the fixed amount of ume per ficnod available for working or 
brushing^ and P is the (exogenously determined) amount of unearned 
income.^ That is, each individual selects a value of B to maximize (2). 
The necessary condition for a maximum is* 

w,{J, B){T - B) - w{J, B) = 0 (3) 

Several important implications follow from (3) First, since both w and 
Wg are presumptively positive, (3) implies that T — B must be positive 
In words, the theory predicts that no person will spend every waking 
hour brushing his teeth—an empirically testable proposition not dcnvable 
from either the “bad taste” or “mother told me” models. 

* It is aHumed, for nmpliaty, that these are the only passible uses of tune The model 
can easily be extended to accommodate an arbitrary number of uses of tunc, as is not 
shown in an appendix 

’ A more general model would allow for the posaibibty that cleaner teeth can lead 
toalarger inheritance, that IS, /’(fi) with > 0 For evidence of thu, see “Toothpaste 

Heir Disinherited far Having Bad Breath,” IVaU Stmt Jrmntl, April I, 1972, p I 

* Since m is anumed semistnctly quasi-ooocavc, thu is abo sufiicient 1^ a weak 
maximuin. 



JOintNAL OF POUnOAL BOffiNOW 


890 

Second, (3) can be rewritten 

B Bwg 
T - B " w 


(4) 


In words, the ratio of brushing to nonbrushing time is equated to the 
partial elasticity of the wage with respect to brushing time. So individuals 
in jobs where wages are highly sensitive to brushing will devote more time 
to brushing than will others—as indicated in the verbal discussion Also, 
for any two jobs with equal Wg's but unequal w’s, (3) implies that the 
higher-wage person will brush less due to his greater opportunity cost 
Finally, consider the important case where (Ij is linear in B (though 
possibly nonlinear in J) 

^ - a(y) + a S 0, fi ^ 0 ' {!') 

Substituting into (3) and solving yields 



a 

2P 


(5) 


In jobs where brushing is immatcrisd to success, 0, so (5) calls for a 
comer maximum with B = 0 Thus, we have a second strong prediction 
from the model' such persons will never brush At the other extreme, as 
the ratio alfi approaches zero, (5) implies B -* Tj2 In words, mdividuals 
whose wages depend almost exclusively on the whiteness of their teeth 
(M C’s of television quiz shows arc a good example) will spend approx¬ 
imately half their lives brushing Again, no soaological theory can 
generate predictions as strong as this 


m. A Regression Model 

The implicauons of the model can be put to an empirical test thanks to a 
recent cross-section study of American adults in the civilian labor force 
conducted by the Federal Brushmg Institute. In its Survey of Bmshmg, the 
institute collected data on toothbmshing frequency and many socio¬ 
economic charactcnstics of 17,684 adults in 1972 From these data, the 
following regression model was formulated 

NBRUSH «= Oo + a,AGE -t- a.WAGE + ajNTEETH + a^S 

( 6 ) 

+ a^EXP + a^FDUM + a^Y + u. 

The dependent variable is the number of times teeth were brushed dunng 
the year. AGE is included as a proxy for the number of years remaining 
before the individual’s teeth &11 out Viewing brushing as a human 
^ investment dearly implies that 0 | < 0. WAGE, of course, measures the 
opportunity cost of time; so NTEETH is the number oX teeth m 



UnCKUANY 


89 * 

the penon’i mouth. Since brushing tune m nearly independent of the 
number of teeth brushed, having more teeth should certainly encourage 
more brushing. S and EXP are, respectively, years of schooling and ivork 
experience They are included because this is a human capital model; 
although there are no a priori expectations about the signs of and a,, 
both should have high t-ratios FDUM is a dummy for persons who live 
in an area with fluoridated water supply, included smcc there is some 
substitution m the production function for good teeth between brushing 
and fluondaUng the water Finally, Y is nonlabor income, which enables 
us to estimate the mcome effect on toothbnishmg frequency. 

Since I have argued above that WAGE should depend on NBRUSH, 
equation (6) was estimated by the instrumental-variables technique 
Denture wearers were included in the sample, but 189 pieople with no 
teeth at all were omitted from the analysis The empirical results are 
reported below, with standard errors m parentheses 

NBRUSH = 2 04 - 0 006 AGE - 0 096 WAGE + 0 054 NTEETH 
(0 63) (0 001) (0 001) (0.009) 

+ 0 0043 S - Q 0022 EXP - 0 146 FDUM 
(0 0002) (0 0001) (0 027) 

+ 0 0006 y, R^ = 79, = 0 056 

(0 0002) 

By any standards the results arc very good The R^ is very high for 
cross-secUon work, indicating that the data have been successfully mined 
All the variables suggested by the thcoreucal model are highly significant 
and, wherever the theory implied a pnon sign restrictions, they are 
satisfied 

In summary, the survey data strikingly confirm the predicUons of the 
theorcucal model of toothbrushing presented here Of course, this is only 
one of many possible tests of the theory But it does pomt out the usefulness 
of human capital concepts in understanding dental hygiene Hopefully, 
these results will stimulate renewed mterest in such quesuons on the part 
of economists 

Alan S Bunoer 

Princeton Umvernty 


References 

Baker, M. D.; Dooley, C ; and Spodt, B. “Brushing by Longshoremen and 
Twins: A Case Study ” Q J. Orthodontics 3 (1968) 377^2 
Barnard, C., and Smith, ll “Brushing ftochviucs of Restaurant Emplovees in 
New York Gty." Rtp. Pertodonbes and Dmittstics 7 (1941) 1-2 
Persons, T. “Dental Hygiene and Age: A Soaologica] View.” J. Dental Soc 11 
(1971): 1-243. 



Proxies and Dummies 


Let us remember the unfortunate econometrician who, in one of the 
major funcuons of his system, had to use a proxy for nsk and a dummy for 


sex 

New York Umvetstty 


Fritz Machlup 





The Cambridge>Cambriclge Controversy 
in the Theory of Capital; A View from 
New Haven: A Review Article* 


Joseph E. Stiglitz 

Yale Unwerntf 


Economics has long been plagued with controversies, the Keynesian- 
monetanst controversy, the investment function controversy (is the 
elasticity of substitution unity ^), the liquidity preference-loanable funds 
controversy. Geoffrey Harcourt has extended his survey article from the 
Journal of Economu Literature (1969) into a book dealing with one of the 
latest of these so-called controversies, that between Cambridge, England, 
and Cambridge, Massachusetts, concemmg capital theory The book is 
more balanced and more complete than the original survey aroclc and 
includes some good pieces of exposition 

The problems that I find with the book are basically problems 1 find 
with the Cambridge (U K ) theory of which he is a parusan on one side— 
as, I suppose, I am on the other—and so, rather than focus on any errors 
and confusions which are peculiar to Harcourt, I prefer to focus on three 
of the major issues involved in the dispute and to suggest, in domg so, 
where Harcourt (and the Cambridge [UK] theorists) have gone astray 

L The Determinatioii of Savings and the Rate of Interest 

We all know that different mdividuals save different propioruons of th^ 
mcome, depending on their age, number of children, income, the riskiness 
of their job, etc. Neoclassical economists have tended to emphasue the 
demographic characteristics (the lifc-cyclc hypothesis), or, altemauvely, 
when they wish a simplified exposiUon for students, they simply assume 
savings is a constant fraction of income The Cambndge economists have 
emphasized alternatively the role of institutions (corporauons) m deter- 
mimng savings, and class behavior (whether the individual is a “worker” 

I wish to acknowledge the helpful conunents on an earlier draft of C. Bliss, J Mirrlees, 
R. Solow, D. Gats, and G Harcourt (although, not lurpnsingly, there wasn't imaninuiy 
on how the draft should be revised). Financial support fiom the National Science 
Foundation and fiom the Ford Foundation u gratefiilly acknowledged 

‘ G C Harcourt, Same Cambndge Caeunaentes m tbe Thaaty of Capital (New York 
Cambridge Umvemty Press, 1972). 


893 




JOURNAL OF FOLmOAL BOONOMY 


894 

or a pure “capitalist”). One would have thought that there would be an 
easy way to resolve such a controversy, which hypothesis provides a more 
accurate prediction o( savings^ Unfortunately, the hypotheses are not so 
easy to verify. For instance, direct evidence on savings of cOrporatioiis 
may be misleading, for it is not unreasonable to assume that individuals 
can see, at least partially, through the corporate veil, and that corporate 
savings are a substitute for individual savings. (There appears to be some 
recent econometric support in favor of this hypothesis. Indeed, given our 
tax structure, it is to be expected that investment is largely financed by 
retained earnings.) Flowof-funds analysis can provide relatively little 
information about the “real structure” of the economy 

The question arises, however. Why the excitement over one specification 
of the savings behavior or another^ The answer lies in the role that savings 
behavior plays in the Cambridge (U.K ) theory in the determination of 
the rate of interest in long-run equilibnura In the simplest form, if 
savings are a given fraction of profits, jt, and all of wages are consumed, 
the rate of interest, r, is equal to the rate of growth, n, divided by r, • 

r = n/s, (1) 

It IS also true that with profit-maxinuzing competitive firms, in long-run 
equilibrium where all relative pnees arc constant, the rate of interest is 
equal to the own rate of return of every capital good^ (the margmal 
productivity of every capital good m terms of itself): 

r = MPK„ (2) 

The controversy arises in the peculiar theory of causation. the Cambridge 
(U.K ) economists argue that the rate of profit is determined by the rate of 
growth and j,, and that this in turn determines the own marginal pro¬ 
ductivity of capital goods The reason for this view is that one can first 
solve equation (1) for r, without knowing anything about the production 
function or the factor supplies, and having solved for r, one can solve the 
second equauon for the marginal productivity of each capital good and 
the capital goods-iabor rauos 

There arc three objections to this view first, there is a confusion 
between “causation” in a temporal sense and “causation” in the formal 
analysis of the structure of a set of equilibrium conditions It is only in 
the temporal sense that causation has any economic significance. At any 
moment, there is a given vector of capital goods and of labor. Under the 
extremely simplified models conventionally used, these endowments 
determine the marginal productivity of the diflTcrent capital goods and 
the rate of interest. Given the savings behavior, this determines the change 
in the stocks of capital goods, eventually the economy converges to a 

* I.e., the value of die marguial product of the capital good divided by its [»ice. (I am 
asninung difiereotiability, so the marginal products are defined. See below.) 



RgVBW ASnCLB 


895 

statt where the rate of interest is equal to the rate of growth divided by 
the savings propensity; still, at each moment, it is the “capital goods- 
labor ratios’’ which determine the rates of return on the different capital 
goods. 

Second, in more general models, there is a simultaneity of the deter¬ 
mination of the values of the variables of interest in the short- and long- 
run equilibrium.^ 

Third, the Cambridge (U.K.) approach cannot be extended to the 
determination of the relative factor prices if there are more than two 
factors; for example, if labor is not homogeneous, we have to have a 
theory which determines the relative prices of the different kinds of 
laborers. The marginal-productivity theory provides an explanation of 
all relative prices at the same tunc, the Cambndge theory has nothing to 
say except that, somehow, however the factor prices of the different kmds 
of labor are determined, the average wage must satisfy the “Cambridge” 
equation 


IL Reswitching of Techniques 

In comparing economies in balanced growth, one set of techmques may 
be chosen at a high and a low interest rate, and another set of techmques 
at an intervemng interest rate This phenomenon—called “reswitchmg”— 
IS the extension to a whole economy of the well-known phenomenon of 
multiple internal rates of return * All that this implies is that the weak 

' For instance, if there are some savings out of wages, we replace eq (1) by aJf = 
I o s^K + i^{Y — tK) (where K is the capital stock, I u investment, F is output, 
s„ IS the savings propensity out of wages), in which case cqq (1) and (2) must be solved 
simultuieously. Pasinetti (1962) has shown that if there arc two daises, workers and 
capitalists (who do no work), then even though there is some savings out of wages, 
eq (1) still holds But the result is not robust if some capilaluts save more than others, 
m the Paiinetti model r may be detemuned by the single capitalut with the highest 
savings rate, on the other hand, if workers save too much, then the pure capitalists 
disappear (in a relaUve sense), and eq (I) no longer obtains More generally, savings 
today will depend on expectauons about the future, and these in turn will depend on 
the values of the relevant vanables this period, which in turn depend on the level of 
savings, 

* Assume we have a single consumption commodity, produced by labor and machines 
Machines last T yean and require 3 yean to construct Msicfaine A requires «, units of 
labor the fint yesu, a, the second, the thud, madune B requires k,. kj, and kj in the 
fitst, second, and thud yean Which machine will be used depends umply on the sign 
of («! - k,) (1 + r)’ + («a - *j) (I + r) + (sj - k,) If a, > k„ Sj < k,, 
Ss > kj, thro there may be two values of r, say r' and rj, such that hr r £ r' and 
r ^ ft miirhiiw B is employed; for rl ^ r ^ r*, A la employed, and at r, and rj the 
two nuchmea are equally profitable In an earlier ternunology, we would have said 
that this is an inatanoe in which A u not an unambiguously more or lea roundabout 
method of productioti than B; in ptcMnt tenninology, we say that A is not unambiguously 
more capital imroitve than B. Note that oonsumpuon per man m a stationary tute 
using A will be more or less than that in a stationary state using B as «, + si + s, ^ 
*! + *! + is- See Samurlson (1966) and the other articles in the QJ E symposium 
on reswitching. 



JOURNAL OF POLITICAL ECONOBIY 


896 

qualitative assumptions we conventionally make in economics—that is, 
convexity of the technology, with its implications of diminishing returns— 
do not have any strong implications for comparisons of economies in 
steady state that is, reswitching establishes that the denvation of simple 
comparative dynamics propositions requires stronger assumpuons than 
those required for the existence of competiuve eqmlibnum and the 
denvation of qualitative properties concerning economies with given 
imtial endowments (In steady-state analysis, on the contrary, we let the 
mitial capital stock be whatever it has to be to sustain the given steady 
state) The conventional economic assumptions do enable us to make 
qualitative statements like “a decrease of a unit m the level of consumption 
today, will allow an increase m consumption tomorrow, c*, of dc^jdc°, 
keeping consumption at all other dates constant, the one-jieriod con¬ 
sumption rate of interest is equal to dc^fdc^ and decreases (more accurately, 
does not increase) with successive increases in consumption today ” (If 
the technology is not differentiable, we must make the usual modifications 
involving left- and right-handed derivatives )* 

A comparison between the role of the “reswitching phenomenon” in 
capital theory and “Giffen’s paradox” in consumer theory may be 
instructive. Competitive-equilibrium theory generally assumes convexity 
of preference sets, but to make qualitative propositions, even weak ones 
such as that demand curves are downward sloping, requires stronger 
assumptions Giffen’s paradox was important m the development of 
demand theory; the resolution of the change m demand into income and 
substitution effects is, most of us would agree, insightful The reswitching 
phenomenon may eventually play a similar role in capital theory But I 
doubt It, for two reasons 

1 Steady states are of limited interest in themselves, even the best of 
well-run economies never have a choice of steady states They have a 
choice of consumption paths beginmng with present initial conditions, 
conventional assumptions do allow us to make qualitative statements 
about such paths (economies may of course, eventually go to one steady 
state or another) 

Moreover, it is easy to develop, using steady-state analysis, all manner 
of paradoxes * For instance, the opening of free trade may actually lower 
steady-state consumption (this docs not contradict the classical prop- 


* Tliat u, if (hc'ldc")' and {Sc'lde")* are the left- and right-hand derivatives, then 

the one-penod rate of interest lies between and {Bc'lBt'*)*. 

* Elaevdiere (Stiglitz 19736) I have diacusied the concept of rtaarenct, that in the 
procen of economic growth one technique u used, then another, and then the economy 
return* to the earhcr technique. Recurrence can occur in technologies in which re¬ 
switching does not occur, and need not occur in technologies in which reswitching does 
occur. 



KBVIBW ARTIGUB 


897 

ofitions concerning gains to trade).^ One can show that of all the 
feasible steady states in a life-cycle model, the one which maximizes 
steady-state utihty is not sustainable by a competitive equihbnum 
(without appropnate lump-sum transfers), and conversely (Thu does 
not contradict classical welfare propositions ) * 

2. Though Giffen’s paradox is irrelevant for most commodities, there 
are a few, perhaps potatoes and cheap wine, which have upward-sloping 
demand curves. There is thus some empirical basu for an interest m 
Giffen’s paradox ^ So far, no one has provided us with a corresponding 
empincal basis for an interest m rcswitchmg The reason, I suspect, is 
that the conditions under which it can be ruled out are very weak, for 
instance, reswitching cannot occur if, in only one industry in the economy, 
it IS possible, by an increase in labor, either to reduce the input of some 
other factor, keeping other factors and output fixed, or to increase output, 
keeping all other factors fixed ‘ ** For rcswitchmg to occur not only must 
there be essentially no substitutabihty in eoerj industry, but also there 
must be large changes in the relative price for different capital goods, for 
It IS the change in the relative pnee of capital goods which, in effect, leads 
one process to be more “capital intensive” at one interest rate (so the 
cost of using It nscs more with an increase in the interest rate than the 
cost of using the other process) while it is less capital intensive than the 
other process at other interest rates The question is, Arc the kinds of 
changes of interest rates expenenced in a growth process likely to be 
associated with such large changes in relative pnees’ Unfortunately, this 
IS not a question which can really be answered, since most of the changes 
in relative pnees which we observe have more to do with techmeal 
change, ignored in the discussion of reswitching, than with changes m 
interest rates, perhaps this simply testifies to the irrelevance of the 
reswitching controversy. 

Thus, although reswitching has no implications for the validity of 
neoclassical distnbution theory or for qualitative statements concermng 
the consequences of a given economy’s increasing its level of consumption. 
It does mean that statements such as “the reason that one economy has a 
lower interest rate than another is that the former economy has a higher 

’ See Suglit* (1970) 

* Although the pouibility, in the abaence of money, of ovenavtng (rate of intercM 
less than rate of growth) in a compctiuve eqmlibnuro 11 an example of a compctiuve 
equilibnum which 11 not Pareto optimal. See, c g , Samuelson (1958), Cass and Yaan 
(1967), and Diamond (1965) 

* It is perhaps worth noting that the occurrence of Giffbi's paradox depends on the 
level of aggregation chosen; wine is not likely to be GiiTen, “cheap” wme is 

Not only must there be this limited amount of substitutability, a must be “smooth 
substitutability”; i.e., the directional derivative must be defined. This is still much 
weaker dian diflerentiatulity. For thu and other “nonnswitching theoreim,” see Stighiz 
(1979 b). 



JOUKNAL OF POUHOAX. EaONOKV' 


898 

capital labor ratio” may be seriously questioned, for it is posablt that a 
given economy have a lower interest rate than another and a higher 
interest rate than a third, but that the two ccunparison ecanomies be 
identical in all physical respects. But by the same token statements like 
“sales of the commodity were high because the price was low” may be 
questioned, because it is possible for the higher sales to be associated with 
higher prices as well (It should be noted, however, that in comparing 
economies at the same tune, which can trade capital goods with one 
another so that they have the same relative prices [appimimately], these 
problems cannot arise) 

Some of the confusion arises because m dynamic analysis, just as in 
static analysis, there is a disUncUon between macroeconomic and tmcro- 
econoznic models. Although we all recognize that there should be a close 
relationship between the two, the models which are appropriate for 
answering one class of questions may not be appropriate for answering 
another. Thus, to answer the kinds of questions conventionally raised in 
general equilibrium analysis, we never need to talk about an aggregate 
savings function; we need only impose fairly weak restnetions on each 
individual’s preferences In macroeconomics, we want fairly precise 
qualitative (and mdeed quantitative) answers to particular quesuons, 
such as the effect of a change m some policy This necessitates making 
stronger assumptions. The criticism of neoclassical growth theory is 
really an assertion that the restrictions embodied in neoclassical macro- 
economic models do not nccessaniy follow from the microeconotmc 
(disaggrcgative) models from which they should be denved, But 
the same cnticism could be leveled against the static macroeconomic 
models. 

Thus, to put the matter another way, if you accept (as most participants 
on both sides of the controversy do) the von Neumann (Sraffa-Samuelson- 
Solow, et al.) dual pnee-interest rate inequalities and the assumption 
that at any prices the cost-mimmizing techniques are chosen, you have 
accepted the full content of the (microeconomic) neoclassical analysis of 
steady states. Reswitching says something about what that does or does 
not imply for comparisons of steady states, but no more. 


ED. Afgr^te Cental 

Tlie first major attack of the Cambridge (U.K.) economists was not 
concerned with the saving hypothesis or the “reswitching phenomenon’’ 
(indeed, the latter was considaed more of a cunosum, or a “perversity,’’ 
than a serious economic problem) but with the existence of an aggregate 
ca[»tal stock. But like the reswitchmg phenomena, this is now recognized 
to be a. red herring—and for very much the same reasons. Neoclassical 
distribution theory nowhere requires the use of aggregates, and to make 



wrww akhcu 899 

qualitative statements of the land discussed above requires only convexity 

the technology. ‘ * 

Wicksell (a neoclassicist!) was the first to discover the difiicidties one 
encounters in making comparative dynamic statements mvolving the 
aggregate value of capital stock, consumption, and mterest rates of 
economies in long-run equilibnuro, he discovered, lor instance, that 
changes in the consumption per unit of change in the value of the capital 
stock might not be equal to the rate of mterest, because of changes in the 
relative price of capital goods 1 have already argued that from a 
theoretical point of view, the analysis of steady states is not of much 
economic significance and may be misleadmg, and that the basic quali¬ 
tative properties of economies out of steady state do not in any way depend 
on the ability to form aggregates From a practical pomt of view, econo¬ 
mists are always dealing with aggregates one person’s labor is aggregated 
with another, one piece of land is aggregated with another, one kind of 
steel is aggregated with another, even though they all have diflcrent 
properties The condition under which these aggregates can be formed, 
that is, under which the aggregates act as if they were homogeneous 
factors of production, are very restrictive, nonetheless, I believe that, 
under most circumstances and for m(»t problems, the errors introduced 
as a consequence of aggregauon of the kind mvolved in standard macro¬ 
analysis are not too important, nonetheless, we must always be on our 
guard for situations in which this is not true. The quesuon is, Do the 
problems associated with the accumulation of capital in growth processes 
represent one area in which properly formulated aggregates (e g, using 
chain indices) are likely to lead to senous error^ This, I suggest, remains 
a moot question 

' * In an attempt to deemphanze tbe role of aggregate capital, Solow (see, e g , Solow 
1963, p. 98) has emphasized an alternauve “aggregate”—the consumption rate of 
return (aggregating over different consumption goods rather than over different capital 
goods). It con be shown, for instance, that if A and B are two technologies wduch arc 
both viable at a given rate of interest r* (ix , they both make zero profits and all other 
technologies make losses, at tbe given rate of interest and the aoociated pnets), and if 
the economy u in steady state in A and widies to "switch” to steady state B, the present 
discounted vtdue, at the rate of interest r*, of the change in consumption (from that on 
the path remaining at A) u zero, provided that tbe economy fully utilizes its resources 
at every date Hus resuh does not 10 any way depend on the ability to form aggregate 
capital stocks, or on the pomibility of reswii^ng Nor does the bet that in the steady 
state A there u more of a particular capitad good than in steady state B imply that in 
the tTamitkm toaie of the capital good must become redundant, as Pasinetn (1969) 
seems to have sti gg m t ed. (The cooditioas under which a full-einployment transiuon u 
passible are fiu mote subtle than that.) 

** The other eonporative dynamic propoauons which have been shown not to be 
genenUy valid ore: (a) eopoomscs with hi^ier mterest rales have lower values of steady- 
state cotuumpiioii, (k) economies with higher interest rates have lower aggregate 
capital hbor esti^ The “perverse" cme can occur even without leswitching, and, 

indeed, It can be shown that It may oocor even in a mild modificatioii of Wicksellli model 

See Struts (1973*). 



goo JOURNAL OF POUTIOAL BCONQMV 

There are other issues in captal and growth theory which are not 
discussed m this book, and these omissions are perhaps as revealing as the 
topics which are discussed For instance, as I have noted above, most of 
the analysis concerns steady>state growth paths. The interesting and 
difficult questions concerning disequilibrium dynamics, expectation 
formation, and adjustment processes cannot be properly analyzed in that 
context ‘ * Second, the models analyzed all ignore the role of the govern¬ 
ment and, in particular, of the monetary authontics in detcrming the 
growth path of the economy. Finally, the vast amount of work in norma¬ 
tive capital theory—for example, the characterization of efficient paths 
and the analysis of optimal gprowth paths—is completely ignored. Even 
if one doubts the validity of the neoclassical models as a description of how 
competitive economies develop, the models may prove to be of some use 
in planned econormes 

It is difficult to leave this subject without asking. How can we explain 
the Cambridge-Cambndge controversy^ This is a question which a 
sonologist of knowledge might perhaps be in the best position to answer 
Although "personalities” clearly have something to do with the dispute, 
the fact that the controversy has received such widespread interest 
suggests that there is more to it than just that. I suggest that there are 
four reasons—all well illustrated in this book—why a seemingly dry 
subject like capital theory is the subject of such controversy 

1 Harcourt and the Cambridge (UK) economists keep insisting that 
adherence to the alternative approaches to capital theory is a matter of 
ideology As Harcourt puts it “It is my strong impression that if one were 
to be told whether an economist was fundamentally sympathetic or 
hostile to basic capitalist institutions, especially private property and the 
nghts to income streams, or whether he were a hawk or a dove m his 
views on the Vietnam War, one could predict with a considerable degree 
of accuracy . which side he would be on in the present controversies ” 
Without examining the empincal validity of the statement,’* one can 
see that it illustrates the common confusion in the Cambridge (U K.) 
approach between correlation and causation For Harcourt and the 
Cambndge (U K) economists, one cannot separate ideology from 
economic analysis, just as one cannot separate the determinants of the 
rewards to factors from the analysis of the personal distnbution of income 

' * For imtaoce, the difloenoet in subility properties of the Harrod and Solow models 
and the qucMions of st^ltty of economies with heterogeneous capital goods (see Hahn 
1966) are never diicuMed. The question of whether “savings determines investment or 
inveaonent determines savings” is also a question whidi cannot be properly analysed m 
a steady-state model with fidl enqiloyment, for m tlmt context, an increase in savings 
must neoesnnly be assoaated with an increase in investment. 

As 1 note below, one of the chief diaraetenttics of the Cambndge (U.K.) "me- 
lliodriogy" M a heavy reliance on "casual enqxncum'’ and a reluctance to sutitfert 
alternative hyp oth eses to statisdeal verification 



RKVISW ARTIOLB 


90 « 

and the pattemi of expenditure of that income Many other econoroistt 
would argue that ideology plays a far less important role m their economic 
analysis than Harcourt would suggest. It may have some limited influence 
on the quesUons we ask, but not on how we go about answering them, 
for instance, the F-test is not ideological, nor u the logical consistency of 
the assumptions of neoclassical analysis an ideological issue. 

2. There has been a remarkable absence of an attempt at cmptncal 
verification of any of the underlying hypotheses at any but the most 
casual level by advocates of the Cambridge (UK) approach Certainly 
some of the underlying hypotheses should be testable (e g , the alternative' 
savings hypotheses) In fact, on many of the issues, the economic data we 
have may not be adequate to say that one approach is much better than 
another, yet a proper account of the alternative approaches would have 
made clear what tests should be performed 

3 What makes the matter more difficult is not only the absence of any 
attempt at empirical verification with any but the crudest of “casual 
empiricism’’ but a fundamental misunderstanding of the role of economic 
theory A model which is appropriate for illuminating one class of prob¬ 
lems may not be adequate for illuminating another we know that 
Newtonian physics is only a first approximation, that gravity does not 
quite vary with the square of the distance, etc Yet for most of the 
questions we wish to discuss in elementary physics courses, it is perfectly 
adequate Similarly, an aggregative growth model may do reasonably 
well in explaimng long-term movements in certain macroeconomic 
variables 

4 There is a well-known propensity of individuab to dislike what they 
don’t or can’t understand This book, as well as the writings of the 
other Cambndge economists, makes perfectly clear that they do not 
understand neoclassical capital theory A modem treause on capital 
theory which makes no reference to Malinvaud, to Debreu, or to 
Samuelson’s consumption loan model is indeed striking To a neoclassical 
economists (com, money, ) interest rates are just intertemporal prices 
If is the pnee today of com delivered at time / -(- 1, then the com 
interest rate is just [p^{t + l)]/I/r(f)] — 1 The relationship between 
prices (interest rates) and maigmal products, in a competitive economy, is 
fundamentally the same in the capital theoreuc models as it is m the 
static models One can object to the competitive assumption, one can say 
that firms use markup pricing (without explainmg how the markups are 
determined, except vague references to competitive forces), but these are 
not controversies in capital theory Indeed, most economists would aiguc 
that the competitive profit-maximicing model is a more appropnate 
description fijr the behavior of the economy m the long run than it is in 
the short run; that is, there is more competition in the long run than in 
the short, and in the long nm, firms that happen to maximize profits will 



909 JOUKNAL OF POUTIOAL XOOiNOMy 

do better than iirins that don’t (the Schumpeterian approach). Thm, the 
appropriateness of the mai;ginal-productivity theory as a theory of 
distribution of income among factors k completely unrelated to any of 
the controversies concerning double switching, savings behavior, or the 
aggregation of capital 

(There is one important difference between the “static” model and the 
intertemporal model nsing from the almost complete absence of futures 
markets This means that an analysis of expectation formation is crucial 
in the description of dynamic economics. To their credit, the Cambridge 
[UK] economists have stressed the role of expectations and stochastic 
disturbances, but unfortunately the role of expectation formation is 
obscured by focusing on “Golden Ages” [steady-state analysis] as the 
Cambridge economists—^as well as most others—have done.) 

Yet it appears that it is the confused attempt to discredit the marginal- 
productivity interpretation of the interest rate which imbues the topics 
of capital theory with their ideological interest to the devotees of 
Cambridge (U.K) doctnne 

Even though Harcourt’s book will not lead to a resolution of the issues, 
or even to a resolution of what are the real issues, by setting out in perhaps 
as complete and intelligible a manner as is possible one side’s view of 
the debate, Harcourt may have performed a service 


Referencea 

Cass, D , and Yaan, M. E “Individual Saving, Aggregate Capital Accumulation, 
and Efficient Growth.” In Estays on the Theory qf Economu Growth, edited by 
K SheU. Cambridge, Mass. M.I T. Press, J967. 

Diamond, P A. “National Debt in a Neoclassical Growth Model.” AJER 55, 
no. 5, pt. 1 (December 1965) 112fr-50, 

Hahn, F H “Equilibrium Dynarmes with Heterogeneous Capital Goods ” 
Q.J E 80, no. 4 (November 1966) 633-46. 

Harcourt, G. C “Some Cambndge Controversies in the Theory of Capital ” 
J. Eton, leteratwe 7, no 2 (June 1969) 369-405 

Pasinetti, L. L. “Rate of Profit and Income Distribution in Relation to the Rate 
of Economic Growth.” Jleo. Earn. Studies 29 (October 1962): 267-79. 

- “Switches of Technique and the ‘Rate of Return’ in Capital Theory ” 

Eton. J. 79 (September 1969). 308-31. 

Samueison, P. A. “An Exact Consumption-Loan Model of Interest with or 
without the Social Contrivance of Money.” J.P.E. 66, no. 6 (December 1958) 
467-82. 

- . “A Summing Up ” Q.J.E. 80, no. 4 (November 19W): 568-83. 

Solow, R M. Capited Theoiy and the Rale of Return. Amsterdam ■ North-Holland, 
1963. 

Stightz, J. E. “Factor Price Equalization m a Dynamic Economy.” J.P.E 78 , 
no. 3 (May/June 1970): 456-88. 

-. “The Baffiy Behaved Economy with the Well Behaved Production ” 

In Modds qf Etonmu Growth, edited by J. Mirrlees and N. Stem, Proceeding' 



IISVIEW AKTICLE 903 

of the International Economic Aaiociatton conierence, Jeruialein, 1970 
LoBdon- M ac m i l lan, 1973 (*) 

' ■ ■ — « "Recurrence d Techniquei in a Dynamic Economy " In Models qf 
S a mmk Gmatk, edited by J. Mnrlees and N. Stem. ProceecUngy of tbe Inter- 
nai^oiMl Economic Aatodation conierence, Jeruialem, 1970. London* 
btacmlUan, 1973. (h) 



JOURNAL OF FOLmOAL BOOMOMT 


90a 

do better than firms that don’t (the Schumpeterian approach). Thus, the 
appropnateness of the marginal-productivity theory as a theory of 
distribution of income among factors is completely unrelated to any of 
the controversies concerning double switching, savings behavior, or the 
aggregation of capital. 

(There is one important difference between the “static” model and the 
intertemporal model rising from the almost complete absence of futures 
markets. This means that an analysis of expectadon formation is crucial 
in the description of dynamic economies To their credit, the Cambridge 
[UK] economists have stressed the role of expectations and stochastic 
disturbances, but unfortunately the role of expectation formation is 
obscured by focusing on “Golden Ages” [steady-state analysis] as the 
Cambridge economists—as well as most others—have done.) 

Yet It appears that it is the confused attempt to discredit the marginal- 
productivity interpretation of the interest rate which imbues the topics 
of capital theory with their ideological interest to the devotees of 
Cambridge (U K ) doctrine 

Even though Harcourt’s book will not lead to a resolution of the issues, 
or even to a resolution of what are die real issues, by setting out in perhaps 
as complete and intelligible a manner as is possible one side’s view of 
the debate, Harcourt may have performed a service 


References 

Cass, D , and Yaan, M £. “Individual Saving, Aggregate Capital Accumulation, 
and Efficient Growth," In Eaays <m the Thmy ef EeoHomu Growth, edited by 
K Shdl. Cambridge, Mass. M.I.T. Press, 1967. 

Diamond, P. A. “National Debt ui a Neoclassical Growth Model.” A.E R 55, 
no. 5, pt. I (December 1965) 1126-50. 

Hahn, F H. “Equilibrium Dynamics with Heterogeneous Capital Goods ” 
QJE 80, no 4 (November 1966) 633-46 

Harcourt, G. C. “Some Cambridge Controversies m the Theory of Capital " 
J. Earn. iMiratwt 7, na 2 (June 1969). 369-405 

Fassmetti, L L. “Rate of Profit and Income Distribution in Relation to the Rale 
of Economic Growth” Ret. Earn. Studies 29 (October 1962); 267-79. 

-. “Switches of Technique and the ‘Rate of Return’ in Capital Theory " 

Earn. J. 79 (Septanber 1969) • 5(»-81. 

Samuelson, P. A. “An Exact Consumption-Loan Model of Interat with or 
without the Soaal Gootrivance of Money.” J.PJE. 66, no. 6 (December 1958) 
467-82. 

-. “A Summing Up." Q,JJE. 80, no. 4 (November 1966): 568-83 

Sdow, R. M. Capilid Ttuaiy and the Rale ^ Jielam. Amsterdam: Nmth-Hoiland, 
1963. 

Stiglita, J. £. “Factor nice EquaUatkm in a Dynamic Economy.” XP £ 78, 
na 3 (Mayjune 1970): 456-88. 

-. “The Badly Behaved Economy with the Well Behaved Production " 

In Modds if Ecenmie Omdk, edited by J. Mirrlea and N. Stem. Proceeding! 



UVjSW ARTICLE 903 

of the InternatioiiAl Economic Anoraation conference, JeruKlem, 1970 
Lnadon: MacmiUnn, 1973. (») 

-—. "Recurrence of Tec^quei m a Dynamic Economy.’* In Madeb of 

OuMeth, edited by J. b^leei and N. Stent, ftoceeduigi of the Inter- 
Mllional Econoittic Aijociation conference, JcruMkra, 1970 London' 
MaonUlait, 1973. (t) 



I 


,1 \ntUrn-t I 


, 5 \ i" 

iS;,,,! iioUm-M*. .If. 




•'v. \ 


\ 

►f' 


. . ;i. iA ^ 

1,, I uiv >-''' — 


Journal of Political Economy 


Volume 82 Number 2, Part II March/April 1974 


51 Prefatory Note 

5 2 The High Value of Human Time: Population Equihbnum 
Theodore W Schultz 

Si I A Theory of Marriage Part II 
GaryS Becker 

S 27 Comment The Ek:onomics of Nonmonetary Variables 
William J Goode 

S 34 The United States Marriage Market 
Alan Freiden 

S 54 Comment 

T Dudley Wallace 

S 57 Benefits of Women’s Education within Mamage 
Lee Benham 

Sy7 Comment 
Finis WeUh 

S 76 Family Investments m Human Capital Earnings of Women 
Jacob Miruer and Solomon Polachek 

S 109 Comment 

OHs Dudley Duncan 

Sill Home Investments in Children 
Arisen Leibowitz 

Si 3 a Comment 

Frank P. Steward 



Si 36 Effects of Child-Care Programs on Women’s Work Effort 
James J. Heekman 

Si 64 Comment 
Shernm Rosen 

Si 70 Economics of Postwar Fertility in Japan: Differentials and Trends 
Masaaori Hashtmto 

S 195 Comment 

Gttsy R. Saxotthouse 

Saoo Household and Economy; Toward a New Theory of Population 
and Economic Growth 
Marc Nerlove 

Satg Comment 
Zm Grtluhes 


S233 Combined References 



National Bureau of Economic Research 
1974 


Officers Arthur F Bums, Honorary Chairman, Walter W Heller, Chairman, 
J Wilson Newman, Firs C/njimian, John R Meyer, Prsniifn/, Thomas D Flynn, 
Treasurer, Douglas H Eldndge, Vue President-Executive Secretary, Victor R 
Fuchs, Vue President-Research & Co-director, NBER-West, Edwin Kuh, Director, 
Computer Research Center, Hal B Lary, Vice President-Research, Robert E Lipsey, 
Vue President-Research, Sherman J Maisel, Co-director, NBER-West, Geoffrey 
H Moore, Vue President-Research, Edward K Smith, V ue President 

Directors at Large Atherton Bean, International Multifoods Corporation, Joseph 
A Beirnc, Communications Workers of Amerua; Arthur F Bums, Board of Gooemors 
of the Federal Reserve System, Wallace J Campbell, Foundation for Cooperative 
Housing, Erwin D Canham, Christian Scterue Momtor, Emiho G Collado, 
Exxon Corporation, Solomon Fabncant, New York University, Eugene P Foley, 
Montrose Securities, Inc , Eli Goldston, Eastern Gas and Fuel Associates, David L 
Grove, International Business Machirus Corporation, Walter W Heller, University 
of Minnesota, Vman W Henderson, Clark College, John R Meyer, Harvard 
University,] Irwin Miller, Cummins Engine Company, Inc , Geoffrey H Moore, 
National Bureau of Economu Research, J Wilson Newman, Dun & Bradstreet, Iru , 
James J O’Leary, United States Trust Company of New York, Alice M Rivhn, 
Brookings Institution, Robert V Roosa, Brown Brothers Hamman & Co , Boris 
Shishkin, Washington, D C , Arnold M Soloway, Jamauaway Tower, Boston, 
Masscuhusetts, Lazare Teper, International Ladies’ Garment Workers’ Union, 
Donald B Woodward, Riverside, Connectuut, Theodore O Yntema, Oakland 
University 

Directors by UNivERsrry Appointment Moses Abramoviu, Stanford, Gardner 
Ackley, Mufagan, Charles H Berry, Princeton, Francis M Boddy, Minnesota, 
Otto Eckstem,//amard, Walter D Fisher, AerlAa>erteni, R A Gordon, Califomui, 
Robert J. Lampman, Wisconsin, Maurice W Lee, North Carolina, Almann 
Phillips, Permylvama, Lloyd G. Reynolds, Yale, Robert M Solow, Massachusetts 
Institute of Technology, Henri Theil, Chicago, William S Vickrey, Columbia, 
Thomas A. Wilson, Toronto 

Directors by Appointment of Other Organizations Eugene A Bimbaum, 
American Management Association, Thomas D. Flynn, Ameruan Institute of Certified 
Publu Accountants; Nathaniel Goldfinger, Amenean Federation of Labor and Congress 
of Industrial Organizations; Harold G Halcrow, Ameruan Agruultural Economus 




Asjoeiation; Walter E. Hoadley, Amtruan Fminee Association; Phibp M. Klutznick, 
Commttujbr Economu Dmlopmtni; Roy £. Moor, National Assocutm nf Business 
Economists; Douglass C. North, Econorme Histoty Association; Willard L. Thorp, 
American Economic Association; W. Allen Wallis, dmsriMin Statistical Assoctaiton, 
Robert M. Will, Canadian Eeonomus Asseeiatien 

Directors Emeriti' Percival F. Bnmdage, Frank W. Fetter; Gottfried Haberler, 
Albert J. Hettinger, Jr.; George B Roberts; Murray Shield; Joseph H. WiUits. 

Senior Research Staff. Gary S. Becker, Charlotte Boschan; Philhp Cagan, 
Stanley Diller; Solomon Fabncant; Milton Friedman, Victor R. Fuchs, 
J. Royce Ginn; Raymond W. Goldsmith, Michael Gort; Michael Grossman, 
F Thomas Juster, John F Kam,JofanW Kendrick, Irving B. Kravis; Edwin 
Kuh, William M Lander, Hal B. Lary, Robert E. Lipsey, Banoit B 
Mandelbrot, John R Meyer; Robert T. Michael, Jacob Mincer, Use Mints, 
Geoffrey H. Moore, M. Ishaq Nadin, Nancy Ruggles, Richard Ruggles, 
Anna J. Schwartz, Robert P Shay, Edward K Smith, George J. Stigler, 
Victor Zarnowitz 


Other Conference 

Relation of the Nations^ Bureau Directors to 
Publications Reporting Conference Proceedings 

Since the present volume is a record of conference proceedings, it has been 
exempted f^m the rules governing submission of manuscripts to, and critical 
review by, the Board of Directors of the National Bureau. [Resolution adopted 
July 6, 1948, as revised November 21, 1949, and April 20, 1968] 


The current mterest in population economics at the National Bureau of Economic 
Research has been encouraged by a four-year grant from the Ford Foundation 
The Nauonal Bureau wuhes to acknowledge with thanks the Ford Foundation’s 
support of this research program and the two population conferences The 
Population Council’s joint sponsorship of these conferences is also gratefully 
acknowledged. 



Prefatory Note 


All papers, except the first, appearing in this, the second Supplement to 
the Journal of PoltHcal Economy devoted to the economics of feruhty, were 
presented at a conference held June 4 and 5, 1973 The searching and 
critical discussion that followed the invited comment on each of these 
papers helped the authors appreaably in the revisions that most of them 
made prior to publication 

The National Bureau of Economic Research and the Population Council 
are the sponsors and provided the funds for both the 1972 conference, the 
proceedings of which appeared m the March/Apnl 1973 Supplement, 
“New Economic Approaches to Fertihty,” and the 1973 conference, from 
which this Supplement evolved In planning the 1973 program, I had the 
advice of Gary S Becker, Victor Fuchs, Robert T Michael, Marc 
Nerlove, T Paul Schultz, and Robert J WiUis 
Once again I apologize to those who wished to attend the conference 
but for whom there was no room because of my decision to keep the 
number of parUcipants small Others who attended and took part in the 
conference, in addiUon to the authors and discussants hsted in this 
Supplement, were Mary Jean Bowman, Harley Browning, Tomas Frejka, 
Victor Fuchs, H Gregg Lewis, Robert T Michael, Wsuren Sanderson, 
T Paul Schultz, Joseph J Spengler, Boone A Turchi, and Robert J 
Willis 

The favorable responses to the combined list of all references, as they 
were presented in the first Supplement, has led Mrs. Virginia K Thumcr 
and me to repeat the combined lisUng of all references in this Supplement 
As was true last Umc, the editorial quality of this Supplement owes 
much to the competent work of Mrs Thumer 

Theodore W Schultz 


My 25. 1973 


Si 




The High Value of Human Time: 
Population Equilibrium 


Theodore W. Schultz 

Unwersify oj Chuago 


In the continuing economic dialogue between theory and data about 
fertility, we see once again that fertihty is influenced by the differences m 
the opportunity costs of children associated with the stock of human 
capital of mothers The quality of the sacrifices that parents make m 
acquiring the satisfacuons they derive from children influences the quality 
of the children But quality is expensive On these and other important 
points, the core of the studies m this volume provides additional evidence 
supporUng the first set of studies (T W Schultz 1973r). Mincer and 
Polachek evaluate the on-the-job training that women forego on leaving 
their jobs to have and to care for their children, Heckman examines the 
effects of child-care programs on the labor-force work efforts of women 
Leibowitz, using a unique sample of children with data on them spanning 
virtually four decades, provides important new evidence on home 
investment in children These studies are complemented by Becker’s 
extensions of his theory of mamage and by Frciden’s application of the 
theory to the U S mamage market Benham advances a hypothesis for 
interpreting the effects of wives’ education on the carmngs t' their 
husbands 

Turning to the marked decline in fertility in Japan, Hashimoto shows 
that much of it is explained by the economic variables that arc featured 
in the household approach to fertility But the sociologists who participated 
in the conference—Browning, Duncan, and Goode—challenged us 
repeatedly with the argument that we are not bold enough in extending 
and applying our economic theory to fertility 

Admttedly, except for the Taiwan study by T Paul Schulu (1973' 
we have been shy in extending our theory to fertility behavior in iovt 

1 am indebted to Gary S. Becker, Marc Nerlove, and T. Paul Schultz for their hrlpii 
commenu on an early draft of this paper 




POPULATION Ef^UILIBRIUU 


S3 

income countries. Then, too, with the exception of Nerlove’s paper in this 
volume, we have been timid m developing a framework for analyzing the 
interactions between the micro behavior of households and the changing 
state of the economy.* Griliches’s comment advances the inquiry, and the 
searching discussion that followed at the conference indicated clearly the 
importance of this neglected problem Although boldness is called for in 
solving this problem, it alone will not suffice 

As the Utle of my paper implies, I shall examine issues that are related 
to Ncrlovc’s analysis. The latter part of his paper opens an important new 
vista Since his approach to the high pncc of human tunc parallels my 
recent work, as he notes in his paper, and since it has been in the forefront 
of my thinking, my aim is to supplement this part of his paper I shall first 
consider two widely different {x>pulation equilibrium concepts, in support 
of the second concept I shall extend the explanation of the secular 
increases in the value of human time I will then take a critical look at the 
usefulness of the household model in analyzing fertility in countnes 
characterized by high birth and death rates and by substantial increases 
in national income in a context where the level of the economic value of 
human time is very low and not increasing much 

1 

The idea of a population equihbnum is here viewed stnctly as an analyt¬ 
ical concept to guide economic thinking in deriving testable propositions 
pertaining to fertility As an analytical device there is no presumption 
that any piopulation ever has arrived or will arrive at a precise equihbnum 
In this respect the concept of a population equihbnum is on a par with 
the many other equihbnum concepts that abound in economic analysis 

Two very different concepts of a population eqmhbnum can now be 
formulated They may be viewed as types at the two extremes with respect 
to the state of the economy The first rests on the proposition that the 
equihbnum is basically a consequence of increases in the pnee of the 
services of natural resources relative to capital and labor (wages) The 
underlying proposition of the second concept is that the equihbnum is 
determined predoimnantly by increases in the pncc of human time 
relative to that of matenals. The per capita income implication of the 
first IS subsistence and that of the second a high standard of hving 

The first concept, as it was envisaged by the early English economists, 
has long been a standard part of economics It assumes that the supply of 
land is fixed and that diminishing returns gradually increase the pnee of 
food as a consequence of population g^wth Gains in productivity from 

* For concern about aspects of this problem, see Yoram Ben-Porath (1973), T Paul 
Schulta (1973), and Marc Neriove and T. Paul Sdiulu (1970) 



JOURNAL OF POLITICAL ECONOMY 


S4 

capital arc exhausted by this process. This concept can, of course, be 
extended to encompass the results of the recent macrosystem models that 
purport to show the limits of the earth in accommodating population 
growth These models are not restricted solely by the availability of land 
to produce food, since they also include the physical limits set by the 
availability of minerals, energy, and space for people The fertility be¬ 
havior of people in these models is crudely Malthusian, population growth 
stops (suddenly) as a consequence of the inevitable food, energy, and 
space crisis Within the Ricardian framework, this concept is a logical 
conception of a population equiiibnum. It is a dismal view of human 
behavior that has long been an important idea in social thought. 

The foundation of the second concept is the high price of human time 
relative to the price of the services of material factors and goods The 
concept rests on the proposition that the state of the economy is such that 
the economic role of the services of natural resources and of intermediate 
material products is small relative to the role of the services of human 
agents in production and in consumption, in the sense that the value of 
the contribution of materials to human satisfactions is small compared 
with that of human agents In the context of such an economy, the 
opportunity cost of bearing children is high and the investment in their 
human capital is large The welfare implications of this concept are 
unmistakably optimistic because the gams in productivity from the 
accumulation of human and nonhuman capital are transformed into high 
standards of living supported by high per capita income Advances in 
useful knowledge, embodied in human and nonhuman capital, have 
gradually destroyed the assumption of the fixed supply of the “original 
properties of the soil ” In the process, it is the scarcity of human time and 
Its high value that dominate, and it is the “fixed supply of human time” 
consisting of 24 hours per day and of a man's lifetime that becomes the 
critical factor in analyzing the economic behavior of people, including 
their fertility 

Empirically, there is an abundance of evidence which shows that the 
price of human time accounts for most of the costs in a modem economy 
The upward tendency of real wages and salancs, including fringe benefits, 
of earnings foregone by mature students, and of the value of the time of 
housewives relative to the price of materials is well documented ^ 
Economic theory implies, and we observe, that matenal goods are 
substituted for human time by firms and by households. Received theory, 

^ Evidence, for example, on long-term changes in wages and talariet relative to rent 
paid for the services of farmland in the Umted States shows that the total real compen¬ 
sation per hour at work of all manu&ctunng-production workers increased between I ‘129 
and 1970 more than four tunes as much as did the rent on farm real estate per acn, 
similarly adjusted, under the aisumption that rents tended to parallel the changes in the 
price of farm real estate per acre (T W Schultz 1972^} 



POPULATION EQUIUBRIUM 


S5 

however, is silent on the eflects of the high and ruing price of human tune 
on pure consumption, although consumption obviously entails tune In 
my thinking, the ultimate economic limit of affluence (economic growth) 
u not in the scarcity of material goods but in the scarcity of human time 
for consumption * 

Theoretically, the cntical postulate assumes that there is a dynamic 
process that determines the increases in the price of human time relative 
to the price of the services of the nonhuman factors and that thu process 
tends toward an equilibrium The dynamic part u the economic key to 
the following four issues (1) the relative increase in investment in human 
capital augmenting the quahty of human beings, (2) the relatively high 
price of all labor time-mtcnsive goods and other sources of labor-inteiuive 
satisfaction, including children, thus leading to the substitution of quality 
for numbers of children, (3) the relatively cheap matenal goods that are 
not labor-intensive, and (4) the scarcity of the time for consumption, 
setting the ultimate limit to the satisfactions that can be derived from 
materials provided by economic growth 

Although It is obvious that the economic value of human time is high 
in the affluent countries that have a modern economy, it is not obvious 
why these economies have developed the demand for and supply of human 
abihues that have such a high value in terms of earnings and satisfactions 
that people derive from them I shall elaborate on these developments 
(sec T W Schultz 1972A, 1973a) and examine some of their basic aspects 

My approach to the persistent secular increase in the economic value 
of human time consists of a highly simplified framework to get at the 
supply and demand developments that appiear to be determining the rise 
of the pnee of human time in the context of the modernizing processes 
The developments explaining the increases in the supply of the quahty 
attributes of human agents arc fairly clear, whereas the developments 
underlying the incre«ises in the demand for these quahty attributes are 
still not clear Recent advances in economic analysis (summanzed in 
T W Schultz 1972a and 19734) provide the major parts of the theory 
for determining the supply of these quality attributes They treat the 
useful abilities that people acquire as forms of human capital The 
investment in these abilities is taken to be in resjxinsc to favorable in¬ 
vestment opportunities, and thus the increases in the supply depend on 
current expenditures (sacrifices) made by individuals, by families, and by 
public bodies on education, health, job training, the search for inform¬ 
ation, and g^eographical migration to take advantage of better jobs or of 
better consumption opportunities These expenditures (sacrifices) are 

’ The approach outhned in this paragraph and a considerable part of the argument 
that follows appear in my Woody Thompson lecture to the Midwest Economics Asjocia- 
tion, “EaplaMUon and Interpretations of the Increasing Value of Human Tiine" 
(T W Schultz 1973a) 










'’'A'.V.f'~K«S*»' 


»>•' i^' 




S6 JOURNAL OF POLmOAL ECONOMY 

presumably made deliberately with an eye to future satisfactions and 
earnings The theory of the allocation of time and the household pro¬ 
duction model are of special importance in analyzing the incentives and 
responses of people in acquiring education and job training, in cnhanang 
their health, in searching for information, and in altenng their fertility, 
including the substitution of quality for numbers of children. Thus, these 
supply responses to the increases in the economic incentives associated with 
modernization are not hard to comprehend The human capital literature 
abounds with studies dealing with aspects of these supply responses 

But these human capital studies have not explained the secular in¬ 
creases in the demand for these quahty attributes of human agents The 
clue to this unresolved puzzle is concealed in two basic factual issues The 
first of these is that dimimshmg returns to capital have not occurred 
generally, despite the vast accumulation of capital in the advanced 
economies The second is the relatively high rate at which the formation 
of human capital has occurred Of the two, the first is fundamental, and 
the resolution of it provides a solution for the second In my thinking, the 
key to both is in that part of the economic process that increases the stock 
of useful knowledge.* It is the acquisition, adoption, and efficient utiliz¬ 
ation of this knowledge that have provided the decisive new sources of 
investment opportunities that have maintained the growth process and have 
kept the returns to capital from diminishing over time Furthermore, 
these additions to the stock of knowledge have been relatively more 
favorable in increasing the investment opportunities in the quality 
attributes of human agents than in the quality components of material 
agents of production The investment incentives that are revealed by the 
inequalities in these investment opportumtics, as they occur over time, 
are the mainspring in this process 

In an all-inclusivc view of these investment opportunities, the know¬ 
ledge-producing sector must also be included It is not a tnvial sector in 
modern countnes, nor is it exogenous Research is an organized activity 
that requires spiecific, expensive, scarce resources. Although research is 
costly, recent studies, most of them devoted to analyzing the rates of 
return to investment in organized agncultural research, show very high 
social rates of return 

With respect to this investment process, economists could have been 
spared much aimless wandering had they perceived the implications of 

* Simon KuzneU in hu Nobel Prize lecture, which appeared in the June 1973 AmervaA 
Eamamu: Review under the title “Modem Etxinomic Growth* Findings and Reflections’’ 
(KuzneU 1973), also attributes a major role to the addiuons in knowledge in thu context 
He argues that the last two centimes have been penodi dunng which there has occurred 
“enormous accumulation in the contribution to the stock of useful knowledge by basic 
and applied research” (p. 251) 



POPULATION EQUILIBRIUM 


S7 

the concept of capital as Marshall sketched it for us. His predecessors had 
formulated the concept of the “state of the productive arts,” and they 
then proceeded to develop the core of economic theory under the 
assumption that these arts remamed constant It was an ingenious 
simplification, and their theory was m general relevant to a wide array of 
problems of their day But industrialization undenmned this simplifying 
assumption, and Marshall saw it clearly and cogently. In his treatment 
of the agents of production, he extended the concept of labor to include 
work with our hands and our heads It should be noted with care that his 
concept of capital “consists in great part of knowledge and organization, 
and of this some part is private property and the other part is not 
KnowUdgt ts our most powerful engttu of production Organization aids 
knowledge The distincuon between public and pnvatc property in 
knowledge and organization is of great and growing importance in some 
respects of more importance than that between public and pnvatc 
property in matenal thingfs In not seeing the imphcations of Marshall’s 
remarkable insights, economists have wandered for years in the wilderness 
of capital confined to matenal goods 

Thus, in a nutshell, the persistent increase in the demand for the high- 
quahty services of human agents is a function of the additions to the stock 
of useful knowledge * The complexities of the additions to this knowledge 
have been much greater in recent, modem economic growth than during 
early, relatively simple industnaiization The rate at which the stock of 
useful knowledge has increased has also been higher than the rate at 
which it grew during the early stages of industnaiization 

This approach has broad integrative piowcr in that it provides a unifying 
pnnciple for a consistent explanation of the allocation of investment 
resources encompassing both human and nonhuman capital as moderniz¬ 
ation proceeds From it we derive important empincal implications that 
can be tested against data It implies that the value of human time 
increases relative to the cost of investment resources ’ It implies that the 
relative share of national mcome accruing to labor increases over time * 
It implies that there is a spiecial premium for the allocative ability of both 

’ Marshall (1930, bk 4, chap l,pp 130-39), the italics are nunc 

* The argument in support of this summary statement appean in previous works 
(T W Schultz 19726, 1973<i) It is suiuapatcd in chaps 1 and 2 in ray /mvslnunt in 
Human Capital Tht RoU of Education and Rtsmrch \) Chapter 12 treats the “AllocaUon 
of Resources to Research " 

’ A simphfied approach to this unplicauon is to treat the cost of investment resources 
as constant under the assumpuons that the “normal" long-term real rate of interest 
remains constant and that, firara an increasing amount of capital embodied in human 
beings, people derive earnings and satislacuons commensurate with the going rate of 
interest 

* As the esuiungs &x>m the increasing stock of human capital rise relative to income 
acquired &om property lusets 



S8 JOURNAL OF POLITICAL ECONOMY 

males and females in managing firms’ and households and in allocating 
their own time, including investments in themselves. It also implies that 
as the value of the time of mothers increases, fertility declines.*® These 
implications are derived from the dynamic process, not from a model of 
the economy that has arrived at a general equilibrium. 

The concept of a general economic equihbnum in this context is 
necessary, however, as an analytical guide. It is an assumed economic 
state toward which this modernization process tends Given this state, 
there are no inequalities among investment opportunities. The high price 
of human time is stable in the sense that it is no longer increasing relative 
to other factor service prices. There is no incentive to make additional 
investments in human capital or in the knowledge-producing sector, as a 
consequence of the completion of the modernization process, and advances 
in knowledge no longer aug^ment the productivity of human time within 
firms and households, presumably, virtually all of the value added in 
production is contributed by the input of human time The basic economic 
constraint that determines the upper limits of modernization (economic 
growth) in this equilibrium model is the increasing scarcity of human time 
for consumption The underlying logic can be put very simply moderniz¬ 
ation increases the consumption stream, consumption requires human 
time, advances in knowledge, whether they are embodied in material 
capital or in human capital, are severely limited in the extent to which 
they can alleviate the scarcity of human time for consumption 

For the purpose at hand, the bearing and rearing of children is a very 
labor-intensive activity, the satisfactions that parents derive from their 
children is a large part of their “standard of living,” and the process of 
enjoying these satisfactions requires much time, the economic value of 
which, in this context, is very high 

n 

Turning to fertility behavior in the low-income countries, the household 
model as it now stands has not been developed to treat the particular 
classes of circumstances that constrain the household in these countnes 
These are countnes in which illiteracy abounds, human time is cheap, 
and the income opportunities that women have outside the home arc 
mainly not jobs in the labor market Furthermore, infant mortality is high, 
life expectancy at birth is low, debilitation dunng the adult years is 


* Ai a consequence of the dynamics of the economy, the premium for allocative ability, 
distinguished from the ability of people to do useful work, w subetantud (see Welch 1970, 
Fane 1972, Huffman 1972, Khaldi 1972) 

‘ <* Nerlove's subtle and incisive treatment of the tnteracUons between the high value of 
time and fertibty lays the groundwork for new thinking with respect to this important 
impbcauon 



POPULATION Ef^UILIBRIUM 


S9 

substantial for reasons of inadequate nutrition and endemic diseases, and 
the availability of modem contraceptive techniques, mcluding inform¬ 
ation about them, is, m general, wanting These classes of circumstances 
are not as yet at home in the household model 

The difficulty here is not that econonuc theory is {xuntless m explaimng 
ferulity behavior in the low-income countries On the contrary, in 
principle basic econonuc thinking is fully as applicable to the pioor as it is 
to the rich countries As a case in point, I have long argued (T W Schultz 
1964) that the theory of the firm is analytically as powerful in the alloca¬ 
tion of resources of poor, small, illiterate farmers in the less-dcvelopied 
countries as it is in determining the allocative efficiency of farmers, say, 
in Iowa. The usefulness of this theory is now widely recogmzed because 
of many recent successful applications The same argument holds for a 
fully developied theory of the household As yet, however, the part of this 
theory that has been applied to the Umted States, Israel, and Japan is a 
special and narrow part of what I envisage as a general theory of the 
household Once the additional parts of this theory have been formulated. 
Its usefulness in analyzing household activities in low-income countnes 
will not be in doubt, assurmng that it is also extended to treat the effects 
of economy-wide dynamic development 

Households in low-income countnes perform, in fact, a substanually 
larger economic role than they do in high-income countnes The value of 
home production is not only large relative to the total family income, it is 
also produced predominantly by family labor and only in small part by 
purchased inputs, because in low-income countnes the purchased material 
goods that households can acquire are very high in pnee relative to the 
economic value of the time of members of the household With regard to 
the costs of children, children arc labor-intensive dunng their infancy 
regardless of the country’s level of income In low-income countnes, 
however, the mother’s time is cheap in a context where health services, 
nutntion, and education for the children are dear The satisfactions and 
producer services that parents denve from their children arc in large 
measure from an assured number of children to provide help for household 
work and for fanuly endeavors consisUng mostly of farm work and to 
provide food and shelter for the parents dunng their old age and only in 
small measure from human capital that enhances the acquired quality of 
children. These parUcular economic constraints on the household arc not 
treated in the part of the theory that has been applied in these ferulity 
studies restneted to nch countnes 

As an overview, the interactions between the changes in the economy 
and the economic role of the household including ferulity are probably 
more important in most of these developing countnes than in the advanced 
countries. With respect to which of the two concepts of a populauon 
equihbnum is the more relevant in guiding our economic thinking in 



SlO JOURNAL OF POLITICAL ECONOMY 

explaining the fertility behavior m the low-income countries, my view 
IS that It IS the second concept My reason for opting for this concept is 
that the now-popular doomsday literature is not a valid characterization 
of the direction toward which the economy of such countries is moving. 
The crude Malthusian view of fertility docs not apply, these countries are 
not headed toward a population equilibrium that is being imposed on 
them by diminishing returns from natural resources. Per capita income is 
in general not falling in these countries Measured in terms of living 
conditions, there are appreciable gains, as is evident from improvements 
in health and from longer life expectancy Moreover, birth rates are 
falling and substantially so in an increasing number of low-income 
countries. Although they are far from arriving at the second type of 
population equilibrium, they are moving toward it, and it is therefore 
analytically the relevant concept 

Earlier (1973c), 1 expressed my concern that our estimates of the price 
and income effects on fertility may be subject to considerable error for 
reasons of changes in the economy over time, changes that arc not taken 
into account in these studies The changes that are taking place m the 
low-income countries appear to me to be even more important than those 
in the developed countries, and if this is true, it will be imperative to 
extend our economic approaches in ways that will make these changes 
an integral part of the theory and its application. The most pervasive 
change is in the improvement of health Estimates of life expectancy from 
time of birth are rising at a high rate relative to the further rise in the rich 
countnes The decline in infant mortality must have imjxirtant fertility 
implications Along with somewhat better nutrition, the marked decline 
in debilitating diseases during adulthood must have some price and income 
effects on household activities, including fertility The fact that in some 
parts of poor countnes, in the Punjab of India, for example, many girls 
are now enrolled m schools is an important change in this context Then, 
too, there is some progress in increasing the supply of information about 
modem contraceptive techniques and in subsidizing the supply of 
contraceptives While all of these changes are occurring, the quality of the 
labor force is slowly nsing as is the value of human time 

One measure of the fruitfulness of the 1972 and 1973 conferences on 
the econoirucs of fertility is m the agenda of unfinished research that the 
papers, comments, and discussions produced There is a couplet by Robert 
Frost beanng on research Let me paraphrase it 

We sit around the circle and suppose 

The Secret sits in the middle and knows 



A Theory of Marriage: Part II 


Gary S, Becker 

Uiitvtmty of Ckuago and National Burtau of Economu Btstank 


1. Introducrtion 

In “A Theory of Marriage Part I,” published in the July/August 1973 
Journal of Political Economy, I presented an analysis of the “mamagc 
market ” This paper extends the analysis in that paper to include caring 
between mates, polygamous marital arrangements, genetic selection 
related to assertive mating, and separation, divorce, and remarriage Its 
purpose IS both to enrich the discussion in Part I and to show the power 
of this approach in handhng different kinds of manta! behavior 

In Part I (1973) I offered a simplified model of mamage that relies 
on two basic assumptions' (1) each person tnes to find a mate who 
maximizes his or her well-bemg, with well-bemg measured by the con¬ 
sumption of household-produced commodities, and (2) the “mamage 
market” is assumed to be in cquilibnum, m the sense that no person could 
change mates and become better off I argpied that the gain from mamage 
compared to remaining single for any two persons is positively related to 
their incomes, the relative diflference m their wage rates, and the level of 
nonmarket-productivity-augmentmg vanables, such as education or 
beauty. 

The optimal association between mates with respect to different traits, 
such as ability, education, race, income, and height, was analyzed I 
showed that positive associations, matings of likes, are usually optimal, 
although with respect to some traits mating of unlikes is optimal, for 
example, with wage rates. The division of the total “output” produced 

I am indebted ibr helpful comments to Isaac Ehrlich, Robert T. Michael, Richard 
Posner, T W Schultz, Gemge J Sugler, and membcis of the Workshop in Appheabons 
of Economics at the Univenity of Chicago. This research has be en supported by a grant 
the Ford Foundation to the Natiooal Bureau of Economic Research for the study 
of the economics of population This paper is not an official NBER pubhcation, once it 
has not been reviewed by the NBBR Board Directom. 


Sii 




JOURNAL OF POLITIGAL ECONOMY 


Sl2 

by mates is not assumed to be given a pnon, but is determined by the 
nature of the mamagc-markct equihbnum. 

The simplified analyses in Part I have now been extended in several 
directions The effect of “love” and canng between mates on the nature 
of equilibrium in the marriage market is considered Polygamy is dis¬ 
cussed, and especially the relation between its incidence and the degree 
of inequality among men and the inequality in the number of men and 
women The implications of different sorting patterns for inequality in 
family resources and genetic natural selection are explored The assump¬ 
tion of complete information about all potential mates is dropped and I 
consider the search for information through dating, coeducational 
schools, “trial” marriages, and other ways This search is put in a life¬ 
cycle context that includes marriage, having children, sometimes 
separation and divorce, remarriage, and so forth 

2. Love, Caring, and Marriage 

In Part I, I ignored “love,” that cause of marriage glonfied m the Amer¬ 
ican culture At an abstract level, love and other emotional attachments, 
such as sexual activity or frequent close contact with a particular person, 
can be considered particular nonmarketable household commodities, and 
nothing much need be added to the analysis, in the earlier paper, of the 
demand for commodities That is, if an important set of commodities 
produced by households results from “love,” the sorting of mates that 
maximizes total commodity output over all marriages is partly deter¬ 
mined by the sorting that maximizes the output of these commodities 
The whole discussion in Part I (1973) would continue to be relevant 
There is a considerable literature on the effect of different variables 
such as personality, physical appearance, education, or intelligence, on 
the likelihood of different persons loving each other. Since I do not have 
anything to add to the explanation of whether or why one person would 
love another, my discussion concentrates on some effects of love on 
marriage. In particular, since loving someone usually involves canng 
about what happens to him or her,* I concentrate on working out several 
implications, for mamage, of “canng ” 

An inclusive measure of “what happens” is given by the level of com¬ 
modity consumption, and the natural way for an economist to measure 
“canng” is through the utility function ^ That is, if M cares about F, 
M’s utihty would depend on the commodity consumption of F as well as 
on his own, graphically, M’s indifference curves m hgure 1 arc negatively 


‘ The Random Houu Dietumaty of the Engluh Language includes m lU definitions of love, 
"aflecuonate concern for the well-being of others," and *‘the profoundly tender or 
passionate affection for a person of the oppoiite sex." 

* This formulauon is taken from my paper, "A Theory of Social Interactions” (1969) 



A THEORY OF MARRIAGE 


S13 



FlO 1 


inclined with respect to and Z f, the commodities consumed by M and 
F respiectivcly * If M cared as much about F as about himself (I call 
this “full” canng), the slofjcs of all the indifference curves would equal 
unity (in absolute value) along the 45” line/ if he cared more about 
himself, the slopes would exceed unity, and conversely if he cared more 
about F 

Point c in figure 1 represents the allocation of commodities to M and F 
that IS deterrmned by equilibnum in the marriage market Only if M 
were married to F could he transfer commodities to F, since household 
commodities arc transferable within but not between households If the 
terms of transfer arc measured by the line AB, he moves along AB to 
point e he transfers cd and F receives de Presumably commodities can be 
transferred withm a household v«thout loss, so that AB would have a slope 
of umty. Then the equilibrium position after the transfer would be on the 


’ Since there u only a single aggregate commodity, saying that M’s utility depends 
on F’s consumption is equivalent to saying that M’s utility depends on F’s utility (assum¬ 
ing that F does not care about M) If many coimnodiues Z,, , Z„ were consumed, 

M’s uukty would depend on F’s utility if U” =» , Z^, g(Z,f, , Z„)] 

where g describes the indiflerence surface of F Hence {dV"ldZ,f)l{SC'“l£Zj/) = 
{^!lSZtf)Hdgl8Zj/); this ratio is F's marginal rate of substituuon between Z, and Zj 
* canng might also imply that the indifference curves were straight lines with 

a slope of umty, that Zf was a perfect substitute for Z^ 



Sl4 JOURNAL or POLmOAL ECONOUY 

45” line with full caring, and to the right of this line if M preferred his 
own consumption to F’s. 

Most people no doubt find the concept of a market allocation of com¬ 
modities to beloved mates strange and unrealistic. And, as we have seen, 
canng can strikingly modify the market allocation between married 
persons For example, the final allocation (point ») after the transfer from 
M to F has more equal shares than does the market allocation (point c) ^ 
Moreover, if F also cared about M, she would modify the market allocation 
by transferring resources to M from anywhere in the interval Ae' until she 
reached a point generally to the left of e. The market completely 
detemunes the division of output only m the interval t'e • positions in Be 
arc modified to e, and those in Ae' are modified to e'. Furthermore, if each 
fully cares for the other, points e and e' are identical and on the 45“ line 
Then the total amount produced by M and F would be shared equally, 
regardless of the market-determined division This concept of canng 
between mamed persons, therefore, does imply sharing—equal shanng 
when the canng is full and mutual—and is thus consistent with the 
popular belief that persons in love “share.” 

Shanng implies that changes in the sex ratio or other variables con¬ 
sidered m section 4 of Part I (1973) would not modify the actual distnbu- 
tion of output between mamed M and F (unless the market-mandated 
distnbution were in the interval «')• This is another empirical implication 
of canng that can be used to detemune its importance. 

I mdicated in the earher paper that total mcomc would be less than 
total output in a marriage if resources were spent “policing” the market- 
mandated division of output, whereas total income would exceed total 
output if some output were a “family” commodity, that is, were consumed 
by both mates. Canng raises total income relative to total output both by 
reduemg pohcing costs and by increasing the importance of family 
commodities 

Consider first the effect of canng on policing costs. “Pohcing” reduces 
the probability that a mate shirks duties or appropriates more output than 
IS mandated by the equilibrium in the marriage market ’’ Canng reduces 
the need for pohcing. M’s incentive to “steal” from his mate F is weaker 
if M cares about F because a reduction m F’s consumption also lowers M’s 
utility. Indeed, caring often completely eliminates the incentive to 
“steal” and thus the need to police. Thus, at point e in the figure, M 
has no incentive to “steal" from F because a movement to the right 
along AB would lower M’s utility.* Therefore, if M cares about F suf- 

’ Provided it were in the interval Ae, M would not modify the market allocation. 

* We that AS also gives the terms of transfer for F, and that «' ta the point of 

tangency between AB and her indifierence curves. 

1 Folidng If neoesMry in any partnenhip or corporation, or, more generally, in any 
cooperative activity (see Becker 1971, pp. 122-23; Alchian and Demiets 1972). 

* A fomon, a movement akmg any steeper line—the difference between AB and thii 
line mratwnng the rewuroei used up m “stealing”—would also lower M's utihty. 



A THEORY OF MARRIAGE 


8*5 

ficiently to transfer comniodities to her, F would not need to "police" 
M’s consumption" Consequently, marriages with caring would have 
fewer resources spient on "policing” (via allowances or separate check¬ 
ing accounts?) than other marriages would 

M’s income at e exceeds his own consumption because of the utility he 
gets from F’s consumption Indeed, his income is the sum of his and F’s 
consumption, and equals OB (or OA), the output produced by M and F 
Similarly, F’s income exceeds her own consumption if she benefits from 
M’s consumption Canng makes family income greater than family 
output because some output is jointly consumed At point e, all of F’s 
and part of M’s consumption would be jointly consumed Since both e 
and «' are on the 45° line with mutual and full canng, the combined 
incomes of M and F would then be double their combined output all of 
M’s and all of F’s consumption would be jointly consumed 
Love and caring between two persons increase their chances of being 
married to each other in the optimal sorting That love and caring cannot 
reduce these chances can be seen by assuming that they would be mamed 
to each other m the optimal sorting even if they did not love and care for 
each other, Then they must also be mamed to each other in the optimal 
sorting if they do love and care for each other because love raises com¬ 
modity output and caring raises their total income by making part of their 
output a “family” commodity Hence, their incomes when there is love 
and canng exceed their incomes when there is not Ckinsider the following 
matrix of outputs 


Fi F, 

M, p 4 ■ 

(3, 5) (1) 

Mj 9 7 

L (5.2). 

With no caring, this is also the matrix of total incomes," and M,F, and 
M jF 2 would be the optimal sorting if incomes were sufficiently divisible 
to obtain, say, the division given in parenthesis With mutual and full 
caring between M, and F,, m',,, the income ofM,, would equal 8 > 3, 


* With mutual and full canng, neither mate would have lo "police.” On the other 
hand, if each cared more about the other than about himself (or herself), at least one of 
them, say M, would want to transfer resources that would not be accepted Then F 
would “pohee” to prevent undesired trmsftrs from M This illustrates a rather general 
pnnctple, namely, that when the degree of caring becomes sufficiently great, behavior 
becomes similar to that when there is no canng 

F’l income equals the sum of her consumption and a fraction of M's consumption 
that 11 determined by the slope of F’s indifference curve at point c See the formulauon m 
section 1 of the MathemaUcal Appendix 

' ‘ We abstract from other bndi of “fcmily” commodiues because they can be analyzed 
“1 exactly the same way that canng is. 



Si6 JOURNAL OF POLITICAL ECONOMY 

and/',j, the income of Fj, would equal 8 > 5;** clearly, Mj would still 
be married to F, in the optimal sorting 

That love and canng can bring a couple into the optimal sorting is 
shown by the following matrix of outputs. 


M, 

Ms 



Fj 

6 

10 

(6.4) 

3 


Fs 

5 ■ 

4 
10 

(5. 5). 


( 2 ) 


Without love and caring the optimal sorting is M,F,, M^Fj, and M 3 F 3 , 
with a set of optimal incomes given in parenthesis If, however, M, and 
Fj were in love and had mutual and full canng, the optimal sorting would 
become M,Fj, MjF,, and M 3 F 3 because the incomes resulting from this 
sorting, »i |3 = /21 = k > 6,*^ and, say, = /j, = 4^, and 

mjj = 73 j — 6 , can block the sorting along the diagonal. 

Does caring per se—that is, as distinguished from love—encourage 
marriage for example, couldn’t M, marry F, even though he receives 
utility from Fj’s consumption, and even if he wants to transfer resources 
to F 2 ^ One incentive to combine marriage and canng is that resources 
are more cheaply transferred within households by assumption, com¬ 
modities cannot be transferred between households, and goods and time 
presumably also are more readily transferred within households More¬ 
over, caring partly results from living together, * * and some couples marry 
partly because they anticipate the effect of living together on their caring 

Since, therefore, canng does encourage (and is encouraged by) mar- 
nage, there is a justification for the economist’s usual assumption that even 
a multiperson household has a single well-ordered preference function 
For, if one member of a household—the “head”—cares enough about all 
other members to transfer resources to them, this household would act 
as if It maximized the “head’s” preference function, even if the preferemes 
of other members are quite different ’ * 

Output IS generally less divisible between mates m marnages with 
canng than in other marnages** because caring makes some output a 

* * The output of love raises these incomes even further 

The difference between k and 6 measures the output of love produced by M, 
and F}. 

■ * So does negative canng or "hatred " A significant fraction of all murders and 
assaults involve memben of the same household (see Ehrhch 1970). 

‘ * For a proof, see section 1 of the Appendix, further discussions cim be found m 
Becker (1969) 

’ * See the proof in secoon 2 of the Appendix. 



A THEORY OF MARRIAGE 


S17 

family commodity, which cannot be divided between mates One implica¬ 
tion of this u that mamages with canng are less likely to be part of the 
optimal sorting than mamages without canng that have the same total 
income (and thus have a greater total output) * ’’ 

Another implication is that the optimal sortiiig of different traits can 
be significantly affected by canng, even if the degree of canng and the 
value of a trait arc unrelated Part I (1973) shows that when the division 
of output is so restricted that each mate receives a given fraction of the 
output of his or her mamag^, beneficial traits are always strongly piositively 
correlated in the optimal sorting A negative correlauon, on the other 
hand, is sometimes optimal when output is fully divisible Canng could 
convert what would be an optimal negative correlation into an optimal 
positive one because of the restnctions it impioses on the division of 
output 

For example, assume that a group of men and women differ only in 
wage rates, and that each potential marnage has mutual and TmII caring, 
so that the degree of canng is in this case uncorrelated with the level of 
wage rates, then the optimal correlation between wage rates would be 
positive, although I showed in Part I (1973) that it is negative when there 
is no canng ‘ ® The (small amount of) evidence in that paper indicating 
that wage rates are negatively correlated suggests, therefore, that caring 
does not completely determine the choice of mamage mates 


3. Polygamy 

Although monagamous unions predominate in the world today, some 
societies still practice fxjlygamy, and it was common at one time What 
determines the incidence of fjolygamous unions in societies that permit 
them, and why have they declined in importance over time’ 

I argpied in Part I (1973) that polyandnsts—women with several 
husbands—have been much less common than polygymsts—men with 
several wives—because the father’s identity is doubtful under piolyandry 
Todas of India did practice polyandry, but their ratio of men to women 


‘ ’ See the example discussed in section 2 of the Appendix 

' * As an example, let the matrix of outputs from diflcrent combinauons of wage rates be 


M. r 5 10 ■ 

(5, 5) (10, 10) 

M_ 12 15 

L(12. 12) (15, 15). 


If outputs were fully divisible, the optimal sorting would be and M, since 

that maximixei the combmed output over alt mamages With mutual and full canng in 
all mamages, the income of each mate equals the output m hu or her mamage, these 
incomes are given m parenthesis Clearly, the optimal sorting would now be 
and M.,F,, 



Si8 JOURNAL OF POUTICAL ECONOMY 

was much above one, largely due to female infanticide.*’ They mitigated 
the effects of uncertainty about the father by usually having brothers (or 
other close relatives) marry the same woman. 

I showed in Part I (1973) that if all men and all women were identical, 
if the number of men equaled the number of women, and if there were 
diminishing returns from adding an additional spouse to a household, 
then a monogamous sorting would be optimal, and therefore would 
maximize the total output of commodities over all marriages.*® If the 
plausible assumption of diminishing returns is maintained, inequality in 
various traits among men or in the number of men and women would be 
needed to explain polygyny 

An excess of women over men has often encouraged the spread of 
polygyny, with the most obvious examples resulting from wartime deaths 
of men Thus, almost all the male population in Paraguay were killed 
during a war with Argentina, Brazil, and Uruguay in the nineteenth 
century,** and apparently polygyny spread afterward. 

Yet, polygyny has occurred even without an excess of women, indeed, 
the Mormons practiced polygyny on a sizable scale with a slight excess of 
men * * Then inequality among men is crucial 
If the “productivity” of men differs, a polygynous sorting could be 
optimal, even with constant returns to scale and an equal number of men 
and women Total output over all marriages could be greater if a second 
wife to an able man added more to output than she would add as a first 
wife to a less able one Diminishing marginal products of men or women 
within each household do not rule out that a woman could have a higher 
marginal product as a second wife in a more productive household than as 
the sole wife in a less productive household 

Consider, far example, two identical women who would produce 5 
units of output if single, and two different men who would each produce 
8 and 15 units, respectively, if single Let the married outputs be 14 
and 27 when each man has one wife, and 18 and 35 when each has two ** 
Clearly, total output is greater if the abler man takes two wives and the 

' * See Riven (1906) Whether the inianUcide caused polyandry, or the reverse, is not 
clear 

An opUmal lorung has the property that persons not numed to each other could 
not, by marrying, make some better off without making others worse off I show in Part I 
(1973) that an optimal sorting maximizes total output of commodities. 

After the war, males were only 13 percent of the total population of Paraguay (see 
Emyclopattha Bntamaca, 1973 ed , s v “Paraguay”) I owe this reference to T W Schultr 
See Young (1954, p 124) The number of women can exceed the number of 

men, even with an equal number at each age, if women marry earher than men and d 
widowed women remarry The number of women married at any time would exceed the 
number of men nuuned because women would be mamed longer (to different men— 
they would be sequentially polyandrousl). This apparently was important in Sub- 
Saharan Africa, where polygyny was common (see Doijahn 1959) 

These numbers imply diminishing marginal products, smee 18 — 14 >• 4 < 6i 
and 35 - 27 = 8 < 12. 



A THEORY OF MARRIAGE 


S19 

Other remains single than if they both take one wife; 35 + 8 = 43 > 14 4 - 
27 K 41 If the abler man received, say, 21 units and each wife received, 
say, 7 units, no one would have any incentive to change mates 

Our analysis implies generally that polygyny would be more frequent 
among more productive men—such as those with large farms, high 
positions, and great strength—an implication strongly supported by the 
evidence on pxilygyny For example, only about 10-20 piercent of the 
Mormons had more than one wifc,^* and they were the more successful 
and prominent ones Although 40 percent of the mamed men in a sample 
of the Xavante Indians of Brazil were polygynous, “it was the chief and 
the heads of clans who enjoyed the highest degree of polyg^y” (Salzano, 
Neel, and Maybury-Lewis 1967, p 473) About 35 percent of the mamed 
men in Sub-Saharan Africa were piolygynous (Dorjahn 1959, pp 98-105), 
and they were generally the wealthier men Fewer than 10 percent of the 
mamed men in Arab countries were polygynous, and they were the more 
successful, especially in agriculture (Goode 1%3, pp 101-4) 

I do not have a satisfactory explanation of why polygyny has declined 
over time m those parts of the world where it was once more common ^ * 
The dechnes in income inequality and the importance of agriculture 
presumably have been partly responsible Perhaps the sex ratio has 
become less favorable, but that seems unlikely, wartime destruction aside 
Perhaps monogamous societies have supenor genetic and even cultural 
natural selection (see the next section) But since more successful men are 
more likely to be polygynous, they are more likely to have relatively many 
children ** If the factors responsible for success are “inherited,” selection 
over time toward the “abler” might be stronger in polygynous than in 
monogamous societies I have even heard the argument that Mormons 
are unusually successful in the United States because of their piolygynous 
past' However, if the wives of piolygynous males were not as able, on the 
average, as the wives of equally able monogamous males, selection could 
be less favorable in piolygynous societies 

The decline in polygyny is usually “explained” by religious and 
legislative strictures against piolygyny that arc supposedly motivated by a 
desire to prevent the exploitation of women But the laws that prevent 
men from taking more than one wife no more bcne6t women than the 

Young (1954, p 441) uys that “in some conununiUes it ran as high as 20-25 percent 
of the male heads of famihea,” but Arruigton (1958, p 238) says about 10 percent of all 
Mormon faimhea were polygynous 

Polygyny was more common in Islamic and Afncan soaeUes than in Western and 
Asian ones, althou(^ in China and Japan concubines had some of the rights and oblig¬ 
ations of wives (see Ckxide 1963, chap 5) 

Salzano, Neel, and Maybury-Lewu (1967, p 486) found evidence among the 
Xavante Indians of “similar means but significantly greater variance for number of 
surviving of&pnng for males whose re|Moduction is completed than for similar females ” 
This indicates that piolygynous males (the more successful ones) have more children than 
other males. 



SaO JOURNAL OF POLITICAL ECONOMY 

laws in South Afnca that restrict the ratio of black to white workers (see 
Wilson 1972, p. 8) benefit blacks Surely, laws against polygyny reduce 
the “demand” for women, and thereby reduce their share of total 
household output and increase the share of men 


4. Assortive Mating, Inequsdity, ud Natural Selection 

I pointed out m Part 1 (1973) that positive assertive mating of different 
traits reduces the variation in these traits between children in the same 
family (and this is one benefit of such mating) Positive assortive mating 
also, however, increases the inequality in traits, and thus in commodity 
income, between families Note that the effects on inequality in com¬ 
modity and money incomes may be very different, indeed, if wage rates, 
unlike most other traits, are negatively sorted (as argued in Part I), 
assortive mating would reduce the inequality in money earnings and 
increase that in commodity income 

Positive sorting of inhented traits, like intelligence, race, or height, also 
increases the inequality in these traits among children in different 
families, and increases the correlation between the traits of parents and 
children (see proofs in Cavalli-Sforza and Bodmer [1971, chap 9]) 
Moreover, positive sorting, even of nonmhented traits such as education, 
often has the same effect because, for example, educated parents are 
effective producers of “education-readiness” in their children (see 
Lcibowitz [1972] and the papers by her and Bcnham in this volume) 
The result is an increase in the correlation between the commodity in¬ 
comes of parents and children, and thereby an increase in the inequality 
in commodity income among families spanning several generations. That 
IS, positive assortive mating has primary responsibility for noncompeting 
groups and the general imjDortance of the family in determining economic 
and social position that is so relevant for discussions of investment in 
human capital and occupational position 

Since positive assortive mating increases aggregate commodity income 
over all families, the level of and inequality in commodity income are 
affected in different ways Probably outlawmg polygyny has reduced the 

An altemaUve interpretation of the religious and legislative strictures against 
polygyny is that they are an early and major example of discrimination agamst women, 
of a similar mold to the restrictions on their employment in certain occupations, such as 
the priesthood, or on their ownership of property This hypothesis has been well stated by 
(of all people') George Bemsurd Shaw “Polygamy when tried under modem democratic 
conditions as by the Mormons, is wreclied by the revolt of the mass of inferior men 
who sue condermied to celibacy by it, for the maternal instinct leads a woman to prefer 
a tenth share in a iirat rate man to the exclusive powession of a third rate ” See his 
“Maxims for Revolutiomsts” appended to Men anti Suptman (Shaw 1930, p. 220) 
Shaw was preoccupied with celibacy, he hu three other maxims on celibacy, one being 
“any msunage system which condemns a majority of the population to celibacy will be 
violently wrecked on the pretext that it outrages morality" (1930, p 220) 



A THEORY OF UARRIAGE 


S31 

inequality in commodity income among men at the pace of reducing 
aggregate commodity income. Perhaps other restrictions on mating 
patterns that reduce inequality would be tolerated, but that does not 
seem likely at present 

Since positive assortivc mating increases the between-family variance, 
It increases the potential for genetic natural selection, by a well-known 
theorem in population genetics.** The actual amount of selection depiends 
also on the inhentabihty of traits, and the relation between the levels of 
the traits of mates and the number of their surviving children (called 
“fitness” by geneticists) For example, given the degree of inheritability 
of intelligence, and a positive (or negative) relation between number of 
children and average intelligence of parents, the rate of increase (or 
decrease) per generation in the average intelligence of a population would 
be directly related to the degree of positive assertive mating by 
intelligence 

Moreover, the degree of assortive mating is not independent of in- 
heritability or of the relation between number of children and parental 
traits For example, the “cost” of higher-“quality” children may be lower 
to more-intelhgent parents, and this affects the number (as well as 
quality) of children desired *’ In a subsequent paper I expect to treat 
more systematically the interaction between the degree of assortive mating 
and other determinants of the direction and rate of genetic selection 


5. Life-Cycle Maritsd Patterns 

Life-cycle dimensions of marital decisions—for instance, when to marry, 
how long to stay married, when to remarry if divorced or widowed, or 
how long to stay remarried—have received little attention in my earlier 
paper or thus far in this one These arc intriguing but difficult questions, 
and only the broad strokes of an analysis can be sketched at this time 
A separate paper in the not-too-distant future vnll develop a more 
detailed empirical as well as theoretical analysis 

A convenient, if artificial, way to categorize the decision to marry is to 
say that a person first decides when to enter the marriage market and then 
searches for an appropriate mate The age of entry would be earlier 

Thu theorem was proved by Fisher (1958, pp 37-38) and called "the fundamental 
theorem of natural selection ’’ For a more recent and extensive discussion, see Ca\ alli- 
Sforza and Bodmer (1971, sec 6 7) 

” For a discussion of the interacuon between the quanUty and quality of children, see 
Becker and Lewis (1973) 

This categonaation is nude m an important paper by Coalc and Mc.Mcil, “The 
IhstnbuUon by Age of the Frequency of First Marriage in a Female Cohort” (1973) 
They show that the fmquency distnbution of the age at first marriage can be closely fitted 
m a vanety of environments by the convolution of a normal distribution and two or three 
Qcponential distribubons. The normal dutnbubon is said to represent the distnbuuon of 
age at entry into the marriage market, and the exponential distributions, the time it takes 
to find a mate. 



$33 


JOURNAL OF POLITICAL ECONOMY 


the larger the number of children desired, the higher the expected 
hfetime income, and the lower the level of education.^* 

Once in the marriage market, a person searches for a mate along the 
lines specified in the now rather extensive search literature.** That is, 
he searches until the value to him of any expected improvement in the 
mate he can find is no greater than the cost of his time and other inputs 
into additional search Some determinants of benefits and costs are of 
special interest in the context of the marriage market. 

Search will be longer the greater the benefits expected from additional 
search. Since benefits will be greater the longer the expected duration of 
marnage, people will search more carefully and marry later when they 
expect to be married longer, for example, when divorce is more difficult 
or adult death rates are lower. Search may take the form of trial living 
together, consensual unions, or simply prolonged dating Consequently, 
when divorce becomes easier, the fraction of persons legally married may 
actually increase because of the effect on the age at marriage Indeed, in 
Latin America, where divorce is usually imjiossible, a relatively small 
fraction of the adult population is legally married because consensual 
unions are so important (see Kogut 1972), and, in the United States, a 
smaller fraction of women have been married in those states having more- 
difficult divorce laws (see Freiden [1972] and his paper in this volume) ** 

Search would also be longer the more variable potential mates were 
because then the expected gain from additional “sampling” would be 
greater Hence, other determinants being the same, marriage should 
generally be later in dynamic, mobile, and diversified soaeties than in 
static, homogeneous ones 

People marry relatively early when they are lucky in their search They 
also marry early, however, when they arc unduly pessimistic about their 
prospects of attracting someone better (or unduly optimistic about 
persons they have already met) Therefore, early marriages contain both 
lucky and pessimistic persons, while later marriages contain unlucky and 
optimistic ones 

The cost of search differs greatly for different traits the education, 
income, intelligence, fanuly background, perhaps even the health of 
persons can be ascertained relatively easily, but their ambition, resiliency 
under pressure, or potential for growth arc ascertained with much greater 
difficulty ** The optimal allocation of search expenditures implies that 
marital decisions would be based on fuller information about more-easily 
searched traits than about morc-difficult-to-search traits. Presumably, 


For a theoretical and empincal study of these and other variable*, tee Keeley (1973) 
The pioneering paper i* by Sugler (1961). For more recent developments, tee McCall 
(1970) and Mortemen (1970). 

Thete refultt are net of difierencet m mcome, relative wage*, and the tex ratio 
** In the terminolagy of Nelion (1970), education, income, and intelligence arc 
"tearcb" traits, whereas restUeocy and growth potential are “expenence” traita. 



A THEORY OF MARRIAGE 


S23 

therefore, an analysis of sorting that assumes perfect information (as in 
Part I [1973]) would predict the sorting by more-easily searched traits, 
such as education, better than the sorting by more-difficult-to-search 
traits, such as resiliency 

Married persons abo must make decisions about marriage should they 
separate or divorce, and if they do, or if widowed, when, if ever, should 
they remarry^ The incentive to separate is smaller the more important 
are investments that are “specific” to a particular marriage The most 
obvious and dominant example of mamage-spiecific investment is child¬ 
ren, although knowledge of the habits and attitudes of one’s mate is also 
significant Since specific investments would grow, at least for quite a 
while, with the duration of marriage, the incentive to separate would 
tend to decline with duration 

The incentive to separate is greater, on the other hand, the more con¬ 
vinced a person becomes that the marriage was a “mistake ” This 
conviction could result from additional information about one’s mate or 
other potential mates (Some “search” goes on, perhaps subconsciously, 
even while one is married') If the “mistake” is considered large enough 
to outweigh the loss in marriage-specific capital, separation and jjerhapis 
divorce will follow 

The analysis in Part I (1973) predicts sorting patterns in a world with 
perfect information Presumably, couples who deviate from these patterns 
because they were unlucky in their search are more likely than others to 
decide that they made a “mistake” and to separate as additional in¬ 
formation IS accumulated during marriage If they remarry, they should 
deviate less from these patterns than in their first marriage For example, 
couples with relatively large differences in education, intelligence, race, 
or religion, because they were unlucky searchers, should be more likely to 
separate,and should have smaller differences when they remarry In 
the subsequent paper referred to earlier, I plan to develop more system¬ 
atically the implications of this analysis concerning separation, divorce, 
and remarriage, and to test them with several bodies of data 

6. Summary 

The findings from this extension of my earher papier on “The Theory of 
Mamage” (1973) include 

a) An explanation of why piersons who care for each other are more 
likely to marry each other than are otherwise similar piersons who do not 

® ’ Sec the ducuauon in section 3 of the Appendix 

The distmcuon between general and specific investment is well known, and can be 
found m Becker (1964, chap 11) Children, for example, would be a speahe mvestment 
if the pleasure received by a parent were smaller when the parent was (permanently) 
separated from the children 

If they have relatively large difTerences because they were less effiaent searchers, 
they may be less likely to separate 



S34 JOURNAL OF POUTICAL ECONOMY 

This m turn provides a justification for assuming that each family acts as 
if It maximizes a single utility function 

b) An explanation of why polygyny, when permitted, has been more 
common among successful men and, more generally, why inequality 
among men and differences in the number of men and women have been 
important in determining the incidence of polygyny 

c) An analysis of the relation between natural selection over time and 
assertive mating, which is relevant, among other things, for understanding 
the persistence over several generations of differences in incomes between 
different families 

d) An analysis of which marriages are more likely to terminate in 
separation and divorce, and of how the assortive mating of those re- 
marrying differs from the assortive mating in their first marriages 

The discussion in this paper is mainly a senes of preliminary reports on 
more extensive studies in progress The fuller studies will permit readers 
to gam a more accurate assessment of the value of our economic approach 
in understanding marital patterns 


Mathematiesd Appendix 

1 Formally, M (or F) maximizes his utility function 

= VJZ^, Zf) 

subject to the constraints 

z? + c„ = z, , 

On ^ ® 


(Al) 

(A2) 


where Z^ and Z° are the market allocations of output to M and F, and Cg, is the 
amount transferred by M to F If > 0, these constraints can be reduc^ to a 
single mcome constraint by substitution from the Z^ into the Z„ equaUon 

Z„j- Z® + Z° = Z„ + Zf, (A3) 


where Z„f is the output produced by M and F, and m„f is M's income. Maximiz¬ 
ation of U„ subject to this single income constraint gives 



dZf 


(A4) 


If C„ =B 0, IS maximized subject to the two constraints Z° •m Z„ and Zj =« Z^ 
The equilibrium conditions are BU^dZ^ •* A,,, dU^dZj *■ ii^ where A*, and 
are the marginal uuhties of additional Z® and Zj, respectively. The income of 
M would then be 

“ z: + (A5) 


where Mm/^ ^ "shadow” price of Z^ to M in terms of Z„. 
Since < 1 (otherwise C„ > 0), 



A THEORY OF MARRIAOE 


S 25 

If Q, > 0, the “family” cmmituig of M and F would act as if it maxunized the 
single “family” utility function subject to the single family budget constramt 
given by (A3), even if F’s utility function were quite different from {/„ In effect, 
transfers between members eliminate the conflict between different members’ 
utihty functions. 

2. Total income m a marriage between M and F is 

+ Jit/ ” 4/ = + Pm^J, + Pf^Z’ 

where is the total mcome in the marriage, and arc the outpuU 
allocated to M and F, Z„y + Z^) is total output, />„ is the shadow pnce 

to M of a unit of Z', and pfUa. shadow price to F of a unit of Zj^ Their incomes 
must be m the intervals 


+ Pm^Z “ "'m/ ^ 4i/> 

(A7) 

2Z + Pf2Z =fmS S Z„j. 

If = pj = 0 —no caring—and can be anywhere between 0 and 2-/ 
But if Pm= Pr ™ 1—mutual and full canng—then = 2-/ Ani 

more generally, if p^ andpy > 0, then 


< %./ S 4./ < 4/. 

Jit/ ^ 4t/ < 4/ 

Consider the following matrix of total tncomes 



F| 

Ft 

M, 

■ 8 

8 ■ 
(4, 4) 

M 2 

7 

7 


.(3,4) 



(A8) 


(A9) 


On the surface, both sorbngs are equally optimal, but this is not so if only Mj 
and Fj have a mamage with caring, say full and mutual, so that m ,2 = f 12 — 
4 The sorting M,F 2 and M 2 Fj is not as viable as the sorting MjF, and M 2 F 2 
because income is more divisible between M, and F, than between M, and F 2 
For if, say, in,j = +i,/ii 3^, mjj = 4^, and f 22 = 2^, no two persons have 
an incentive to change mates and marry each other *° On the other hand, since 
m ^2 = fi 2 =* 4, unless m 2 i = 3 and/ 2 , = 4, either M, and F,, or Mj and Fj 
would be better off by marrying each other Ifmji = 3and/2i = 4, M, andF,, 
and Mj and F 2 could be just as well off by marrying each other Therefore, this 
sorting IS not as viable as the sorung that does not have any marriages with canng 
3 Assume that the gain from marriage of a particular person M is positively 
related to the expected values of two traits of his mate, as in m = g(A,, A 2 ), with 


** The between M, and Fj also equals four, half that between M, and F, 

’• Or, put differently, the output between M, and F, exceeds that between M, and 
Fi 

F 2 would prefer to marry M„ but could not induce M, to do so because m,j 
cannot exceed four, the output prodwxd by M, and Fj (see eq [A7J), which is less than 
»iii = 4*. 



S26 


JOURNAL OF POLITICAL ECONOMY 

dglBAt E > 0 ,1 s 1,2. If the marginal costs of search were and fof Ai 
and A^, respectively, equilibrium requires that 


li„£i 

gi «2 


(AlO) 


The lower Cj is relative to cj, the higher generally would be the equilibrium value 
of d] relative to A^, since convexity of the isogam curves is a necessary condition 
for an internal maximum. 

If g) and g 2 were invanant when search costs changed to all participants in 
the marriage market, not an innocuous assumption, then A"" and .<4"** would 
be the equilibrium values of A^ and Aj to M when everyone had perfect inform¬ 
ation about all traits A reduction in the cost of searching Ai, therefore, would 
move the equilibrium value of to M closer to A"“, its value with perfect 
information 



Comment: The Economics of 
Nonmonetary Variables 


William J. Goode 

Cohmbut Unmanty 


In the history of science researchers have often borrowed theories, 
analogies, or metaphors from other fields, usually the better-developed 
ones, in economic terms, they invested their human capital by acquiring 
new and presumably more-advanced intellectual tools The most con¬ 
spicuous borrowing in nineteenth-century social science was the un¬ 
fortunately imaginary set of the developmental sequences of societies, 
worked out by anthropology and sociology on the basis of findings from 
biological evolution It is less often that scientists in a relatively developed 
field become restive with its constraints and invade another with the 
aspiration that their more powerful technical and thcoreucal tools will 
solve problems with which the less-developied field has not adequately 
coped (This is a challenge I have someumes hurled at physicists who 
believe that the findings of soaology are simple-minded ) Over the past 
two decades, physicists tried this successfully in their contributions to 
molecular biology, and now economists have been expanding their world 
by attempting to analyze problems usually brooded over by sociologists, 
social psychologists, anthropologists, and pioliUcal scientists 

In this case, interestingly enough, they will encounter colleagues in 
these fields who have themselves been movmg toward econoimc or quasi- 
economic analysis of the same phenomena, though to be sure they have not 
attempted many social analyses of purely market processes Espiectally in 
social psychology and sociology, a small group of theorists have for 
nearly two decades been working out exchange and allocauonal problems 
with the aid of economic ideas. 

They have been hampered by their lack of mastery over the tools of 
economics, their failure to use econoimcs expliatly, and their unwilling- 
ness—thus shouldering a burden econoimsts of the past did not wish to 
carry—to do much theorizing without the facts In my own case, I have 




JOURNAL OF POUTIOAL EOONOUY 


S 28 

been trying to understand social-control systems, of which monetary 
controls form one set, by considering how prestige, force and force threat, 
and love or affection are accumulated, allocated, or lost. Even in my 
monograph on divorce, written two decades ago, I was the first both to 
demonstrate rather fully the inverse relationship between class piosition 
and divorce rates (a prediction now made by Becker) and to give an 
essentially economic explanation for it (see Goode 1956, 1962) 

The differing styles of economics and sociology prevent me from making 
an adequate, brief analysis of Becker’s paper I perceive many tautologies 
in the papier, but I know that disturbs an economist less than a sociologist 
Where economists may be content with certain typics of summary indexes, 
I want to see a large cross-table of picrcentages Often I must respond by 
saying, in answer to an elegant set of mathematical formulas “It is a 
beautiful flight but it is not reliable for transportation”, that is, it simply 
IS not true—though that may seem to an economist only a crude answer 
An example of this would be Becker’s formulation that late marriage and 
the difficulty of divorce will be correlated India, for example, has an early 
age of marriage but very difficult divorce, in the West the mam line of 
division IS between Catholic and non-Catholic countries, and in general 
Catholic countnes have almost no divorce but a slightly earlier age at 
marriage (except for Ireland) than other Western countries, and so on 
Of course, to an economist, that may seem at best an unimaginative 
answer 

In any event, though I am nervous about this invasion, 1 welcome it 
In these few pages I shadl look at only a few minor pioints in Becker’s 
paper to illustrate a somewhat general problem that is often encountered 
in some explorations by economists 

A major source of weakness in this bold foray into intellectual fields 
such as sociology and piolitical science that have been trying for generations 
to create an autonomous noneconomic body of theory is Becker’s failure 
to be daring enough in a critical question, that is, whether any non¬ 
monetary variables actually enter the calculations In fact, as we see from 
the very first stages (Becker 1973, p 817), both the market and non- 
market variables are aggregated to explain S, which is the “full income, 
the maximum money attainable if the Wj [i e, wage rates, of the jtb 
member] arc constants ” 

Although I believe this monetary emphasis may ultimately be a weak¬ 
ness, since the noncontractual structure of exchanges in those sectors may 
create different behaviors than in the monetary sector,* we are left with 
no assurance that the broadened economic formulation will be adequate, 
smee in fact it has not been fully built into the equations or tested. That 


‘ This matter 11 analyzed at length in my 771« CtUbmlum nf Htnm Pnstige as a Central 
SysUm (forthcoming). Meanwhile, see my ExpUratums n SaeuU Thtory (1973). 



COMMENT; NONMONETARY VARIABLES S39 

is, the nonmonetary variables have not been given due weight, but have 
been monetized 

That monetary equivalent, so tempting to the economist because of his 
past training, can create both obstacles and factual errors in the analysis 
Perhaps I should add a theoretical point, that since the income figure is 
purely hypothetical or imaginary, it has no greater degree of reality than 
an apparently subjecUve factor such as prestige 

To take a minor example, education is worth something on both the 
market and nonmarket exchange systems, and for some purposes a high 
education can be given an equivalent monetary value, but for a man with 
little education, a wife with very much education is not worth as much as 
one would suppose from her money value on the larger market She is a 
less-fit wife for him, by nonmonetary calculations—which would still be 
economic—and he would be supported in this low evaluation by his 
social circle 

Let us consider at length a more elaborate instance of the problems 
this failure creates, specifically the lines of reasoning as to whether and 
why polygyny occurred and why it has usually been the uppier-strata men 
who could or did take advantage of this opjxirtunity The case is in¬ 
structive, for the focus on total monetary income leads to factual errors 
and obscunties, but a general focus on nonmarket economic analysis 
would clarify some of the facts we do know and leave one question 
unsolved but at least not obscure 

Becker reasons—correctly, I think—that the total monetary output of 
monogamy would be higher than that of either polyandry or polygyny 
Here the facts are more powerful than he supposes, for whether or not 
multiple marriage was permitted or encouraged, it has not been common 
in any society, except under very spiecial conditions For most men, in all 
societies over time, polyg^y was a statistically unlikely delight 

Obviously, a very low ratio of men to women—caused by successful 
conquests or by high death rates in war—might piermit polygyny for a 
while, but such conditions are not likely to conUnue for long Late mar¬ 
riage for men and early mamage for women raise the chances somewhat, 
but high mortality among women keeps the total number of woman-years 
available to each man rather limited As a consequence, polygyny has not 
been general even where it is approved 
A purely market explanation may yield this aggregate result even if, as 
I beheve, the rcasomng m its favor (see his section 3, above) is tautological 
That is, for most men and women the net market payoff is higher if they 
pair together rather than make other types of arrangements However, 
this reasoning fiuls at the next step, the explanation for the unequal 
distribution of women among men even where polygyny is permitted 
At this next step it is necessary to give due weight to stnctly non¬ 
monetary factors. First, Becker argues (see his section 3, above) that the 



JOURNAL or POUnOAL BCONOUY 


S30 

“total output over all marriages could be greater if a second wife to an 
able man added more to output than she would add as a first wife to a 
less able one.” Again leaving aside the tautological reasoning, this 
hypothetical result, an increase in total output over all marriages, is 
essentially empty as a motivating personal or social force in marital 
decisions. 

As in many family decisions, especially those concerning fertility, the 
individual who decides does not otdmarily concern himself very much 
with his or her efiects on the total output, or society as a whole. Thus, 
whether total output over all marriages is greater does not motivate people 
to enter polygyny. If Becker disclaims this as a motive, it is nevertheless 
obvious from the context that it is viewed as a partial explanation for the 
distribution of wives in a society. 

Nevertheless, if we ask only about higher or lower monetary output of 
individual marriages, under Becker’s assumptions we could expect that 
able men would try to get a second wife, and their chances of success 
would be greater than those of less able men 

If nothing else were involved, we could be satisfied that the unequal 
distribution of wives among the total number of husbands is now ex¬ 
plained, at least from the man’s side (from the woman’s side, the matter 
IS less clear, since under polygyny the husband takes a much larger share 
than the wife docs). Indeed we might then take a step further and predict 
which economic strata of men would be able to reap this advantage, and 
even how much they would pay in order to make this investment in a wife, 
as compared with alternative investments m other types of capital goods 
Where, for example, as in much of Central Africa, women may earn 
money by engagmg in small-scale trading, or by tilling their own gardens 
for the family, we might even be able to put real monetary values on this 
kind of mvestment. Moreover, we might even give a better explanation for 
polyandry than Becker essays, since for the most part such systems occur 
where the productivity of a given man is low and he has litde to invest in 
a wife, so that a set of men (often brothers) pool their assets in order to 
enjoy the benefits of one wife. * 

However, reasoning from market or monetary total output, whether of 
the marriage or of all marriages, would miss the importance of women as 
goods, of women as producers of children, of women as links in purchasing 
pohtical power, of women as prestige commodities, for monetary rcason- 
ing would ml predict that women would be added where their monetary 
market output drops or almost stops upon entering marriage. For the 
stubborn fact remains that multiple marriage is most common among the 

* In potyandry thoe are often brothers, pethapf because this causes less strife; and 
aOeviatei tennewhat the probiesn of produdne one’s own children, which Becker speaks of 
as a goal in marriage. In suiy event, vdiether or not the hus b a nd s are brothers, there art 
usually rules to deride who fe soriotogfeally defined as lather of a given child. 



comment; nonmonetary variables 


S3« 

stratum that gets the least monetary economic output from wives, that is, 
the most powerful and nch, who are most likely to wall them up and use 
them as pets, display them as objects that prove the man’s high position, 
use them in power linkages, or even allow them to become parasites who 
destroy the family wealth by their extravagance Indeed, it seems rezison- 
able to assert that the higher the class position, the greater the likelihood 
of some polygynous arrangements, but equally the greater the likelihood 
that wives represent net losses, except in purely nonmarket goods Like 
some other monetary goods, they are in part “bought in special markets”, 
they are also like such goods in another respect, for instance, furs, polo 
ponies. Rolls Royces, in that they can be put to other practical uses, and 
in revolutions sometimes are 

Thus, I am asserting that the reasoning of Becker’s paper sometimes 
goes awry not because it uses economic reasoning in nonmarket areas, but 
because it fails to take note of the powerful nonmarket variables, whose 
effects may run counter to those of income variables, however these latter 
arc aggregated Nor am I as yet sure that market prices or decisions 
“pirove” the total output is greater, whether of general nonmonetary or 
monetary goods, under these arrangements AsFrederickKnightsaidmore 
than half a century ago, all suppliers arc partially monopolies, and we 
know that these markets are rigged 

In a parallel fashion, the reasoning fails in its attempt to explain the 
decline of polygyny, that is, the system, in our time and to some extent 
(although the facts are less clear) over a longer pienod of time—less clear 
because we suppose that never in the history of the recent world have 
most nations been polygynous except under rare conditions 

After flirting with a biological explanation, Becker involves himself in a 
complex refutation of the claim that women would be better off econom¬ 
ically if fxilygyny were abolished (see his section 3, above) But his 
argument, and those he tries to refute, arc essentially irrelevant, because 
all of them still seem to refer to the total market output of the household 
It IS much more likely that nonmonetary vanablcs shape this decline in 
polygyny 

Here, as so often in such ventures into why social structures assume their 
sometimes peculiar shape, we have to ask, not merely about the non- 
market variables, but who decides who organizes the market In spite of 
Becker’s graceful reference (n 27) to Shaw’s clever comment, the fact is 
that we have no evidence from any time or place that women ever created 
, a polygynous system It was men who established them, men whose 
command of force and force threat, prestige, and wealth was paramount 
in all the societies we know Doubtless, some women did better econom¬ 
ically under such systems, as some women m the United States would if 
they joined with other women to marry one nch and jKiwerful man, but 
the system as a whole was not created by women, and wherever new laws. 



JOURNAL OP POUnOAt BQONOHY 


S32 

social movements, revolutionary ideologies, mdustriahzation, urbaoiza- 
tion, or religion permit them a free market choice they move away from 
polygyny ^ 

I shall not presume, especially in the midst of our contemporary 
turmoil about who has the right to report what women feel, to know 
precisely why women object to this system, although I think Becker’s 
earher, excellent analysis of canng and sharing are relevant. From the 
known fact that the birth rate under polygyny is less than under monog¬ 
amy, and from the rules about equal sexual rights in polygynous systems, 
we must infer that women do not get as much tenderness, alTection, and 
sexual enjoyment in those systems—again, we can use economic theory to 
reason about these variables, but they are not monetary. Men are m a 
much more influential position m such systems, and do not have to share 
as much of the physical output of the family, as indeed they also monop¬ 
olize much of the prestige-esteem output of the family. They have the 
power to impose their own prices, in such systems Thus, when external 
constraints, including money-market constraints, permit, women decline 
to enter such unions, even when they are still permitted. 

I suggested earlier that after this cntical commentary an important 
puzzle would still remain, although it is one that Becker has really not 
perceived clearly It is why some social systems are jiolygamous, and others 
are not An explanation of a ^stem may not be adequate as an explanation 
of how some individuals act within it The possibility that some men will 
benefit from taking more than one wife does not explain why polygyny 
occurs here rather than there—and it especially fails if we remember that 
even in most such systems the typical marriage is a pairing, one man to 
one woman Why are such systems approved and viewed as an ideal, 
though few can attain it^ 

I do not thmk that any current explanations work well, though doubt¬ 
less when one docs it will be some sort of nonmarket economic analysis 
That explanation would have to confront the fact that Western societies, 
including Latin America, of course, have not been polygynous in any 
historical period—and of course I do not mean only in the statistical or 
distnbutional sense, but in the normative sense, that polygyny was never 
a socially approved pattern of marriage. Both Japan and China have been 
mainly monogamous, though in both cases wealthy or powerful men 
could purchase additional concubmes, and in China tome of these came 
close to being recognized as “secondary wives” with specific rights 
India, similarly, has been mainly monogamous, though again with a 
much more open acceptance of concubmes. Perhaps most soaeties outsidt 
those great civilizations—but remember they encompass most of th< 

* See in this oonnectum the relevant sectiont of my WtrU RnoUtmt end Fmdy 
(1963), op. chap. 4 on Sub-Saharu Africa. 



COmiENT: NONUONETARY VARIABLES 


S33 

world’s population—have been polygamous m the specific but narrow 
sense that some form of polygamy was normative, something to which 
men and famihes aspired 

Whatever those explanatory variables will turn out to be when we 
locate them, I rather suppose they will not be monetary, and—more 
imjxirtant—their impact will be seen primarily among the people who 
dominated those sociopolitical systems and those marriage markets, that 
IS, the upper-strata men. Upper-strata men created a set of constraints 
that defined the marriage and family market so as to yield great monetary 
and nonmonetary advantages to them 

Few good theories in the history of science “fitted the facts” very well, 
and the “facts” at any given time were partly wrong, so that each explorer 
must follow his or her temperament rather than a rule which would 
ground new thinking only on the facts or only on free speculation I do 
suspect, however, that a deeper sense of the crude regulanties would not 
hinder these elegant flights, but ought pose still more interesting puzzles 
for the imaginative economist More fundamentally, I am at least partly 
persuaded by the work before us that m several fields we are moving 
toward a single structure of hypotheses about social behavior, hypotheses 
supported by research in several different fields under different labels, and 
that two decades hence we may discover we have created a rather im¬ 
pressive body of social science—^if not “unified,” at least mutually 
confirmatory Becker’s explorauons are an important step in that progress 



The United States Marriage Market 


Alan Freiden 

Vtrgmia Poljilechnu InstUuU and Statt Unaarntf 


I. Introduction 

The empincal implementation of theoretical contributions in the social 
sciences usually rests on variables defined and measured by persons other 
than the theorist Such is the case with the theory of mamage For the 
United States, the only relevant demographic quantity measured with 
accuracy is two-person, male-female legal mamage. Thus, my decision to 
study legal mamage was imposed by the Bureau of the Census and the 
inherent difficulties of measurmg other forms of communal living This 
does not seem to be restrictive for the United States, where consensual 
mamage is often a prelude to legal mamage, but it may be important in 
countnes such as those m Latin America, where legal and other social 
institutions (i c , divorce laws) differ * 

That a model of marriage based on rational choice has substantial 
explanatory power is demonstrated in this study Three factors—the ratio 
of the sexes, the potential returns to mamage, and the cost of divorce— 
explain many of the areal differences in mantal behavior in the United 
States. It IS also evident that the mamage market has important indirect 
effects on the reproduction rate 


n. The Marriage Market 

People decide to marry when they expect to enjoy some flow of “real" 
mcome, denoted by Z, which is greater than that which they could receive 


I thank Gary Becker, H. Gregg Lewis, Marc Nerlove, Margaret Reid, T W Schultz, 
and T. Dudley Wallace for their nuuiy helpiiil suggestions, not all of which have been 
incorporated in this paper I, of course, am solely responsible for any errors. The work 
for this paper was fiiianced in part by grants to the University of Chitago from the Nsi 
tonal Institutes of Mental Health and the Rockefeller Foundation. 

’ For an analysis of the three fonns of mamage recognised in Brazil, see Kogut (19721 
Cheung (1972) has described the structure of property rights within mamage in Chmi- 

S34 



U.8. UARRIAOE MARKET S35 

if single.* The Z represents an abstract commodity, produced withm 
households, which is the true source of personal satisfaction Let 

2mt ^ 

represent the yield to marriage, where the nght-hand side is the distri¬ 
bution of this “real” income between the married partners With Z"'^ 
taken as given, this distribution of married real income is determined by 
the supply of husbands and wives to the marriage market The qualitative 
results derived holding Z"-^ constant are not altered when couples are 
sorted in a manner which optimizes the total output of Z within the 
commumty 

If unmarried, the ith male would obtam the real income Z^^ and the 
(th single female would obtain 2/. Either would prefer marriage if and 
only if 

or 

Z/. ^ 2"/. 

Since no male will accept less than his single income, the first male 
desiring marriage will be forthcoming at some level of mamed income, 
Z"-^, which IS equal to the mimmum single income. Z„„ taken over all 
males By ordering males with respect to their single incomes, the supply 
curve of males in the mamage market is equivalent to the cumulative 
distribution of males with respect to the Z„| In figure 1, this distribution 
has been drawn as if the maximum single income were undefined, that is, 
the supply curve asymptotically approaches the boundary set by the total 
number of males (which imtially equals the number of females) The 
qualitative results denved here do not dcpiend on this particular distri¬ 
bution Similarly, the supply curve of females is the cumulaUve distri¬ 
bution of fhe Now, since 

Z"/ ^ zrf - Z’^l, 

the supply of females is a negatively mehned function of the male mantal 
income, Z*^, and can be viewed as the denved demand for husbands by 
females * No women can be found who would offer more than Z"-^ less 
the tmnimum of Z^, for a husband, so the supply-of-potenual-wives curve 
begins at 

Z"'^ — Zf^ (min) 

and falls to the right. 

* TTui lecUon u based on Becker (1973) For another appheauon of the general model 
see Keeley (1973) 

* The superscript a|f denotes a mamed couple. The subscripts m and/refcr to ungle 
males and females. 

* The analyau u, of oouiae, periectly symmetncal ibr the denved demand for wives by 
males 



JOURNAL OF FOLinCAL ECONOMY 



Fra I —^The marruge market 


« 



The intersection, e, of the supply curves in figure 1 gives the number of 
men and women marrying as well as the distnbution of income within 
marriage The proportion of females married will be and their 

real income will be Z^j — Z"^ — 

Now, consider an mcrease in the number of men to This will shift 
the cquihbnum to raising the real income of married women to 

2y' = 2"^ - Z’i > Z’"/ = - Z"/ 

This increase is not directly observable, but the coincident increase in 
the number of females married, from to N'^f, will raise the proportion 

of women married to 

N-f N-f 








u S UARRIAOE MARKET S37 

Thus, the model predicts that the proportion of women married should be 
piositively related to the ratio of men to women 

To derive further implications, the nature of the household production 
process generatmg Z"-^ (and the single real incomes) must be made 
explicit Let 

Z - g{X, T^, Tf) 

represent the household production function for the flows of real income 
previously denoted by Z"-^, Z,,, and Zj The AT represents goods and ser¬ 
vices purchased in the market, is male time, and Tj, female time The 
g is then the household production function which summarizes the 
technology available for converting market goods and time into the actual 
source of utility, namely, Z As increasing returns to scale would always 
result in a gain to marriage regardless of the number or sex of the 
participants, g will be a function such that economies of scale are ruled 
out * 

A single male would maximize his production of Z subject to the con¬ 
straint that Tf = 0, that is, Max Z = g[X, Tf), subject to (1) 

= T, = market work, (2) PX = + V^,P = index 

of prices, = wage rate, = nonmarket income, and (3) Tf = 0 
The maximized value of Z is Z,„ Similarly, a single female would 
MaxZ = g{X, Tf), subject to (1) Tf + Lf ^ T, (2) PX = 
WfLf + Vf, and (3) T), = 0 The maximized value of Z is Zf In 
either case, one person must substitute his own time to make up for the 
absence of the other’s A mamed couple would maximize Z without the 
third constraint Max Z = g{X, T^, Tf), subject to (1) T„ + L,„ = T, 
Tf + Lf = T, and (2) PX = The max¬ 

imized value of Z IS Z"-^ The gain to marriage is then 

G^Z^f - (Z„ 4- Zf) 

A more convenient form of this expression is obtained as follows If 
Z = g{X, Tf), where ^ is a function of factor inputs, then there exists 
a function c = c{P, W„„ Wf), a function of factor paces, which gives the 
minimum average cost of producing units of Z in the household (this is 
the cost dual of the household production function) For mamed couples 
this function is (P, Wf) , for single males it is r„(P, IF„) , and so 
on The maximized value of Z is then total resources available divided by 
this averagfc cost of production Resources available, called full income, 
can be obtained by combimng the time and goods constraints For a 
mamed couple these constraints are (1) T,, -F = T, Tf + Lf = T, 
and (2) PX - WJ.,, -F WfLf + K. -F 


’ Note that a gam to mamage u not ruled out by decreaang return* to icale 



S 38 JOURNAL OF POLITICAL BQONOUY 

Let 

Z.* = T — and Z.^ = T ~ Tj, 

then 

PX = W^{T - TJ + Wf{T - T/) + V„+ V^, or 
PX + + WfTy =. + Wf)T + F. + Vj. 

The left-hand side is the total monetary equivalent of time and market- 
goods expenditures, while the nght-hand side is full income. Thus, 

G = V-n + Vr _ \ W„T + F, WrT -1- F, ] 

r"/(P, W^, W,) 1_ r„(i>, W^) c,{P. W,) J 

By totally diiferentiatmg G, further implications of the model may be 
derived * 

First, consider the wage rates If 




MP„ 

V. 




then 


-f Z/ = 2Z = Z"^. 


and 


This means that if male and female time are perfect substitutes in both 
household and market production, there is no gain from marriage 
However, in all other cases Z"-^ will exceed the combined single incomes 
Note that the marginal products must be equal throughout the range of 
inputs and not just at a point In particular, the combined g^n would be 
greater the larger the difference between W„ and Wj- If, as is usually the 
case, W„ > Wf, then the husband will specialize in market production 
while the wife concentrates her Ume m the home Therefore, the gains 
from mamage and the proportion of females married should be pxisitively 
related to the relative wage rate, WJ W^. However, the effect of equal 
propiortionate increases in the male and female wage rates is ambiguous 
On the one hand, there is a positive income effect associated with the nse 
in full income. But on the other hand, there is a negative cost-of-produc- 
tion effect due to the elasticity of the cost dual of the household production 
function with respect to the wage rates If this elasticity is sufhciently large 
(in absolute value), the net effect of equiproportionate increases in male 
and female wages may be n^;ative. Note that equal percentage increases 
m the nonlabor incomes and F|- do not generate a negative cost-of- 
production effect, so the gains to marriage are expected to rise with such 
a change 


* Thu sppromch u equivalent to a comparative staUci analysii of the market-iupph 
curvet, since an increase m the gain to mamage tfaifti both curvci to the li^t m figure I 


U.S. MARRIAOE MARKET 


S39 

The manner in which education affects the household production 
function IS not obvious, so the prediction of this factor’s effects on the 
gains to marriage is unclear For tacample, consider the following two 
ways of looking at education • first as factor-neutral technological change 
and then as factor-augmenting In the first case, education uniformly 
reduces the amount of each input required to produce units of Z, that is. 


2 */ 


+ Wf)T +V„+Vr 
A^{E^)Af{Ef)<rf{P, W/) 


, and so on. 


where E stands for education and the A’s are functions whose first 
derivatives are negative It can be shown that the gain to marriage is 
positively related to increases in education in this case If treated as factor- 
augmentmg technical change, education improves the efficiency of the 
time input independently, that is, 


_ (jy. + iVr)T+ F, + Vr 

where B„ and Bf arc functions similar to the /4’s Special conditions must 
hold to obtain the same results as above If education raises the non- 
market productivity relative to the market productivity of the mate 
specializing in household production, or the relative market productivity 
of the market spiecialist, then it increases the gams to mamage However, 
either of these possibilities may prove difficult or impossible to distinguish 
empirically. 

Until this point, an implicit assumption was made that perfect in¬ 
formation and certainty held for persons in the mamage market Clearly, 
this IS not the case The high divorce rate of piersons mamed 2 years or 
less is evidence that uncertainty plays an important role This divorce 
behavior may be interpreted as the correction of mistakes Then a 
lowering of the cost of corrccUng a mistake (the difficulty of divorce) 
should induce people to marry more readily The effect of the cost of 
divorce on the stock of mamages, however, is uncertam since a fall in the 
cost of divorce also encourages the dissolution of existing mamages The 
former is likely to dominate the behavior of the young, while the latter is 
more relevant for older couples 


m. Empirical Implementation 

Three sets of data—state, county, and Standard Metropohtan StatisUcal 
Area (SMSA) aggregates—are used to evaluate the model’s predictions 
Since no single set of data is completely satisfactory, this test rests on 
evidence accumulated from several sources The most complete data arc 
at the state level, but the variables are highly collinear. Heterogeneity 



JOURNAL OP POUnCAL ECONOMY 


S 40 

among the observations is increased by using a random sample of United 
States counties, but the federal census compilations are insufficiently 
detailed, especially with regard to age-specific and female wage data 

The method used is to estimate an ordinary least-squares regression 
wth a measure of the proportion of females married (FEMMAR) as a 
dependent variable and the relative wage (RELWG), the ratio of the 
sexes (SXRT), the cost of divorce (DCOST), and a measure of income as 
independent variables ’ All observations are weighted by the appropriate 
number of females For the states and SMSAs, equations are estimated for 
each of the 5-year age cohorts between ages 15 and 39. The actual vari¬ 
ables used are described below and listed in table 1 

1 FEMMAR. The proportion of females married is the number of 
women married, with spouse present, divided by the total number of 
women There is a small error due to the exclusion of mamed women 
having a spouse absent for reasons other than separation, but this is 
insignificant More important, however, is the treatment of widows On 
the average, males die earlier than females (especially married males), so 
there will be some widows who should be considered “mvoluntanly” 
single Investigating a revised figure for females married, one that in¬ 
cluded widows, showed httle difference in the results, so this factor is 
Ignored. 

2 RELWG Three alternative measures of male and female wage rates 
are available The best is the annual earmngs of fiiH-time wage and salary 
workers as estimated by the Social Security Admimstration These 
figures represent the marginal (and here average) opportumty cost of 
time to a prospective labor-market participant It is likely that these values 
approximate the piermanent wage rates for workers by state. The relative 
wage IS simply male earnings divided by female earnings. The mnemonic 
for the Social Security wage is RELWGSS A second measure of the full¬ 
time wage is the United States Census figure for median earnings of 
persons working 50-52 weeks in the previous years Smee this is available 
for both states and SMSAs, direct comparisons between the results are 
feasible. The symbol for a measure of the male-female relative wage based 
on census data is RELWGCN. Finally, it is most unfortunate that neither 
of these measures is compiled at the county level. The only scx-specihc 
earmngs data are the median annual earnings of persons. Thu value is 
quite sensitive to labor-force participation rates, so it is a biased measure 
for the purpose of explainmg marriage Suppose the full-time relative 
wage u high, then the gains to marriage are highest with husbands 

Education cannot be investigated at this tune In aggr^te data, the TclaUonship 
between income and education is very close, so roulooollineanty is a severe problcoi 
Later, the 1970 Public Use Samples of Basic Records will be available from the Bureau of 
the Census for fiirther em]Mrical work. 




* Age'fpeafic data 


I w{i)*SF{k) 
A- I 

t AGE--^5-. 

I NF(k) 

A- I 

where NF(k) u the number of female* in the Ath age group 
A - 1, 2. . 20 for agci 0-^. 5-9, . 95-99. 


A 

h;(FEMMAR) 

tt'(BlRRATE) 

1 

0 

0 

2 

0 

0 

3 

02 

08 

4 

15 7 

89 1 

5 

69 5 

2S8 I 

6 

862 

IQ74 

7 

88 7 

112 7 

8 

882 

S6 2 

9 

85 9 

09 

10 

825 

0 

11 

77 0 

0 

12 

699 

0 

13 

61 4 

0 

14 

51 6 

0 

15 

650 

0 


S41 





JOURNAL OF POLITICAL ECONOMY 


S42 

specializing in market labor and wives in household production. This 
implies a lower female labor-force participation rate and lower median 
annual female earnmgs Therefore, if the model is correct, the observed 
annual earnings of females will be n^atively correlated with the pro¬ 
portion of females married With this caveat, a relative wage based on 
annual earnings (RELWGAE) is used for the counties. 

The preceding analysis bnngs to light an interesting question about the 
interpretation of observed wage differentials and also their relation to 
education If formal schooling primarily acts to raise market-earning 
potential, then the measured effects of education, holding the market wage 
constant, must be picking up something else, that is, on-the-job training 
Suppose husbands and wives specialize as predicted. For a time men are 
receiving on-the-job training which raises their market age-eammgs 
profile above that which would hold in the absence of such traimng. At 
the same time, their wives would expienencc depreciation of their stocks 
of market-specific human capital (but an increase in the nonmarket 
component) which lowers their permanent market wage. Therefore, even 
the full-time wage is somewhat biased as a predictor of marital behavior ® 
It IS true that a “pseudo wage” estimated from the demand side of the 
market would be more appropriate, but this approach will not be followed 
at present 

3 SXRT The ratio of the sexes is simply the number of males m each 
age group divided by the number of females For the counties it is the 
number of males 15 years and older divided by the equivalent number of 
females. A minor concern with this definition is that males tend to marry 
females a few years younger than themselves Redefining the sex ratio to 
take account of this does not affect the results appreciably 

4 DCOST The measure of the cost of divorce is not a quantitative 
vanable. It is an index of the ease of divorce legislation as compiled 
through questionnaires and other data in 1959 ® However, an attempt 
was made to achieve cardinality, so the actual values used are probably 
at least highly correlated with the true values Note that it is an index of 
the ease or picrmissiveness of the law and represents the inverse of the cost 
of divorce Table 2 lists the values used Religion may also be a relevant 
psychic cost of divorce for Catholic couples, so a set of regressions in¬ 
cluding the percentage of the population Catholic (GATH) in place of 
DCOST IS estimated for the states 

5 INCOME. The appropriate income measure is full family income, 
mcluding labor and nonlabor components. The census value, median 
family mcome, is subject to the same measurement problems mentioned 
above for the female wage and is, therefore, unacceptable. An alternative, 

* This may be added to Gronau’s list of applications (1972) in which otxerved female 
wages are biased predictors of behavior 

* See Rheinstesn (1972) and Broel-Platens (1%1} for details 



U.S. MARBIAOB UAKKET 


S43 


TABLE 2 

CuMnnoATiON or Statu by the Easmbu or Thedi Divoroe Lawi, 1959 


Category and State 

Fitinfil 

Score 

i Category and State 

Pasinew 

Score 

Very permunve: 


Medium 


Ne^da 

95 

Nebraska 

49 

Florida . .. 

90 

Ohio 

48 

ArkansM 

85 

North Dakota 

46 

Idaho . . 

81 

Delaware 

45 

New Mexico 

77 

Vermont 

44 

Alabama 

76 



California 

74 



Wyoming 

72 

Strict 




Maryland 

42 

Permuuve: 


Minnesota 

41 

Kentucky 

69 

West Virginia 

40 

Georgia 

67 

Maine 

39 

Arizona 

66 

Illinois 

37 

Oklahoma 

64 

Lousiana 

36 

Utah 

63 

Virginia 

34 

Washington 

62 

Wisconsm 

33 

Miaun^i 

60 

New Hampshire 

32 

South Dakota 

59 

Michigan 

29 

Oregon 

59 

Dut of Columbia 

28 

Kansas 

58 



Indiana 

57 

Very strict 




North Carolina 

25 

Medium 

1 

Pennsylvania 

22 

Montana 

55 

Rhode Island 

19 

Teimessee 

54 

Connecticut 

16 

Colorado 

53 

New Jersey 

13 

Missouri 

52 

Masuchusetts 

9 

Texas 

50 

New York 

4 

Iowa 

49 

South Carolina 

1 


Sou&oft,—firoel-PUtera 1961, uble 12, p. 149 


on the expenditure rather than the receipt side of family accounts, has 
been suggested (Margaret Reid I%2) This is the median value of 
owner-occupied housing (HSEVAL) Since a large proportion of urban 
dwellmg units are not occupied by owners, the median monthly gross 
rental (RENT) will sdso be used for the SMSAs. There are two reasons 
for usmg these measures. First, the observed positive mcomc elastiaty of 
dcnnand for housing guarantees that a family’s housing expenditure is 
positively correlated with income so HSEVAL and RENT will be 
unbiased surrogate measures. Second, these expenditures are not sensitive 
to the transitory component of income.'** 


IV> Remilts 

There is a problem interpreting the r ^r ess ion results reported below The 
FEMMAR is a limited dependent variable, that is, a proportion bounded 
by zero and one (0-100 in percentages). This complicates the evaluation 
of elasticides but not the estiniadon techmque, since no estimated values 

There may be a biu due to a pnoe oompooent. 




S 44 JOURNAL OF POLmOAL ECONOMY 

of the dependent variable violated these boundary constraints Con¬ 
sider a given percentage change in a bounded variable. If the variable is 
already near its upper bound, then a small change is also a large change 
relative to the distance to the upper bound. For example, suppose 
FEMMAR IS 90. A 1 percent mcrease to 90.9 is the same as a 9 percent 
decrease in the distance between the initial value and the upper bound 
Therefore, elasticities for the older age groups, those with an average 
proportion of marned women greater than 80 percent, should be inter¬ 
preted with this in mind 

Tables 3 and 4 are the ordinary least-squares results from the state and 
SMSA data, respectively Two alternative measures of relative wages 
arc shown for the states and two of income for the SMSAs Results for 
one set of equations from each data source arc summanzed in elasticity 
form in table 5 The explanatory power of the four independent variables 
is quite good, with R} > .5 in all but one case and with the vector of 
estimated coefficients significantly different from zero at the 5 percent 
level of confidence in every case. However, there are a number of para¬ 
meter estimates with large standard errors; this is not unusual, given the 
level of aggregation employed here Two approaches to this problem will 
be described later 

The data support the hypothesis that the proportion of females mamed 
is positively related to the ratio of the sexes Only one coefficient is not 
significantly greater than zero at the 5 percent level ** Evidence on the 
effect of the cost of divorce is that the incidence of marriage nses as 
divorce becomes easier Table 6 contains the regression results for ,1 
replication of one set of the state equations using the proportion of the 
population CathoUc (GATH) in place of DCOST Here a negative sign 
on GATH is equivalent to a positive one on DCOST, so the coefficients 
(all sig^niiicant) are consistent with theory.** Note that for all five sets of 
regressions (state, SMSA, and state with GATH) the elasticity of marriage 
with respect to the cost of divorce declines monotonically with age The 
decline is greater than that which is generated by the effect discussed at 
the beginning of this section. This indicates that of the two effects 
mentioned earlier, encouraging new mamages and lowenng the cost of 
dissolving existing ones, the former is more relevant for young couples 
The influence of mcomc is puzzling Where significant, the coefficients of 
both RENT and HSEVAL are negative. This contradicts Silver’s finding 
(1965) that the flow of new mamages (i c , the marriage rate) conforms 
to the business cycle. Of course, if the mcomc elasticity of the divorce 

' ‘ Elsewhere 1 have used the logit uaosfbnnauon id this context (1972) 

Of course, one coefficient out of 20 will randomly ill the 5 percent Ltest when the 
mill hypotfaesu is true 

' * libdticallineanty ptohibiti including CATH along with DCOST and HSEVAL m 
the same equation. 



< 

Q 



S a* 

J) 3 

eo w 2i.® S^O S2«^ “ ^ 


' ^-04 3^“^ ‘^2» ST'*^ 

CD 

5J,<^ ^o 


31 Si *2 21 <D S CM ^ tn 

A —i?Sco - 

S SS 8®0-0^2,-Si S 


iS,o — o SS * 2 - ^ 


<0 ^T'l^ — 


CM Of;: CM®^ —Of^5CM©ir)C^ 

^ S «0 —O’^CM— « 


us 

eo ^ «o — ^ 


7 ss^als^s^® 

^ ” g, — z,° £0 ti.^ S- ^ 


* £“ £9 a^ s^ ^ 

m eo_<o«b « 

— 2 —o'Oo«?-<‘ e-( 


7 Is 

_ og Sgoc^oi-— ^ 

I I I 


^s“R§$is|S^^ 

1 —«goc:.o£‘=g ■«■ 

' I » 


3 o H < P 

^ S H « > ^ 

.j .j p« Q [d Z 


j: 


c. 

B 

c 

3 

•a 

> 

t 

g 


S 45 








u 8. MARRIAOE MARKET 


TABLE 5 

SuMUA&Y or R£ouanoN Reiulti 


S 47 




EtAmciTiE* AT Means 


Independent 

Variables 



Age 



15-19 

20-24 

25-29 

30-34 

35-39 




State Data 



relwgcn 

relwgss 

-0 33 

012 

0 15 

0 07 

008 

SXRT 

1 40 

0 20 

0 52 

0 70 

0 73 

DCOST 

0 28 

009 

0013 

0 008 

0 005 

HSEVAL 

RENT 

-038 

-009 

-0 11 

-008 

-004 




SMSA Data 



relwgcn 

-0 005 

0 24 

022 

009 

0 05 

RELWGSS 

SXRT 

0 63 

0 15 

0 33 

038 

0 59 

DCOST 

HSEVAL 

0 41 

0 12 

0 05 

0012 

-0 007 

KENT 

-064 

001 

-0 03 

-006 

-0 05 


TABLE 6 

Reoression op FEMMAR, 1960 State Data 

Independent 

Variable 



Ace 



15-19 

20-24 

25-29 

30-34 

35-39 

RELWGSS 

6.90 

19 0 

9 59 

434 

3 82 


(2 36) 

(4 67) 

(3 94) 

(2 43) 

(2 73) 

SXRT 

462 

32 3 

49 7 

563 

52 3 


(4 85) 

(4 79) 

(5 34) 

(7 14) 

(7 91) 

CATH 

-0 191 

-0 303 

-0 095 

-0 065 

-0 048 


(4 47) 

(5 02) 

(2 41) 

(2 35) 

(2 24) 

HSEVAL 

-0 110 

0096 

-0 629 

-0 497 

-0 307 


(0 62) 

(0 38) 

(3 76) 

(4 15) 

(3 43) 

CONSTANT 

-388 

5 31 

28 5 

32 8 

361 


(2 89) 

(0 42) 

(2 93) 

(4 59) 

(5 97) 


70 

64 

64 

70 

73 

F(4, 43) 

25 3 

18 7 

178 

26 3 

29 9 


Notk —^vaIua m parenihcsa. 


rate exceeds that of the mamage rate, then the cross-sectional and time- 
senes results are consistent 

A strong theoretical result is that the potential gain to mamage and, 
therefore, the proportion of women mamed, is posiuvely related to the 
sex-specific relative wage. This a confirmed for persons age 25 and older 
but not for the young Suppose that the benefits to mamage as perceived 
by teenagers differ firom those of adults. Then the gams to specxahzation 
ntay not be relevant although other market factors (sex raUo and cost of 





JOURNAL OF POLITICAL ECONOMY 


S 48 

divorce) arc If so, then as these people mature, many of the “irrational” 
mamages will dissolve, leaving in the stock of married persons only those 
enjoying gains to specialization While ad hoc, this argument is plausible 
given the substantially higher divorce rate among couples married as 
teenagers ** Also, suppose the costs of marriage for teenagers diflfer from 
those for adults. This is obviously the case, since the age of consent is 21 
for males and 18 for females in most states. Then a measure of the cost of 
marriage as well as of the cost of divorce is necessary in the regressions 
for the group of persons ages 15-19 Unfortunately, no such measure is 
readily available, but the states with a substantial Catholic population 
seem to have the strictest marriage laws ‘ * If the results for persons ages 
15-19 in table 6 are interpreted with GATH viewed as a measure both of 
the cost of divorce and the cost of mamag^, the relative wage effect is 
substantiated 

The evidence presented so far would have been more convincing had 
the estimates been more accurate Imprecise parameters may be improved 
through the apphcation of additional sample informauon Here two 
techniques are employed to make better use of the data available First, 
It IS obvious that the division of the population into 5-ycar age cohorts is 
arbitrary Therefore, one would expect the behavior of adjacent age 
groups to be similar For the regression model used here, the correlations 
of the residuals of adjacent age groups is very high There is information 
in the set of residual correlations which can be used to obtain more 
efficient estimates of the coefficients of all five equations through the 
seemingly unrelated regressions technique (Zellner 1962) The results for 
the state data (table 7) are as expected The standard errors of the 
coefficient estimates arc improved, especially those for DCOST Since 
there are no qualitative changes in the coefficients, the discussion of the 
ordinary least-squares results need not be amended. 

A second way to improve the information content of the sample is to 
expiand the number of observations Roughly a tenfold increase in sample 
size occurs through use of a random sample of 530 United States 
counties’* Unfortunately, no age-specific mamage data arc compiled 
by the United States Bureau of the Census at this level, so the analysis 
must be restneted to a smglc equation aggregated over age To standardize 
the age structure of the county populations, an index (AGE) was con¬ 
structed which has the value of FEMMAR if the proportion of women 

EducaUon must be the next lutor to be exanuned in this area. It u likely that 
something related to schooling, such as diflerentiaJ human capital investments or the 
more efiicient college mamage msuket, explains this poor result 

‘ * This IS based on the casual observation of age of consent, medical requirement^' 
and svaiting periods by state. Note that tbe two non-Calholic states with strict divorce 
laws—North and South Carohna—seem to have very liberal mamage laws 

I wish to acknowledge the contnbutiooofDonaldBogue in thepieparauon of these 

data. 





$50 


JOURNAL OP POLITICAL BOONOUY 


TABLE 8 

Reokehion of FEMMAR (Wouzn, 15 Ybam and Oum), 
1960 State and Cotnnv Data 


Independent 

State 

County 

Elasbemes &om 

Variable 

Dau 

Data 

County Data 

RELWGAE 


2 93 

0 091 



(7 37) 


RELWGSS 

408 




(2 571 



SXRT 

55 89 

11 22 

0 165 

DCOST 

(5 671 

0022 

(4 66) 
0041 

0 026 


(2 35) 

(6 56) 


HSEVAL 

-0 220 

-0315 

-0061 


(186) 

(6 65) 


AGE 

0548 

1 s6 



(199) 

(12 38) 


AGRI 

0005 

0 092 

0010 


(0 09) 

(5 25) 


NONWT 

0028 

-0 117 

-0 020 

CONSTANT 

(093) 

-28 23 

(9 69) 
-34 19 



(126) 

(5 28) 


R* 

72 

52 


F 

12 66 

79 56 


(df) 

(8,39) 

(7,520) 



Note —^vaiuet m pairnthetei 


mamed m the county in each group was identical to the national 
average (see table 1) Counties differ in many other ways which may 
influence mantal behavior, but only two factors, the importance of 
agnculture (AGRI) and the relative size of the nonwhite population 
(NONWT), will be included The regression results for the county data 
are in table 8. For comparison, a similar regression for age-aggregated 
state data is included The results are consistent with the predictions of 
theory and those reported above As expected, the parameter estimates 
arc much more accurate An interesting finding is that, cetms panbus, 
nonwhitcs have a lower propensity to marry Of course, everything is 
not being held constant, since what is being measured here is the effect of 
the lower male/female relative wage that is known to exist among blacks ‘ ’ 


‘An analysis ofSMSA data disaggregated by race it m progress The county data were 
alto used for a prcluninary analysis of education In the South, the education of women 
hat by tradition been oriented toward houiebold produebon Then higher female 
education with the tchooling of malet held comtant ihould incretie the gaioi to marriage 
ID the loutbem states The following results were obtained by including median male 
and female schooling completed (MALSCH and FEMSCH) in the equauon lor 
FEMMAR The absolute value of the Mtatisuc is in parentheses' 


FEMMAR •> -6B -f 1 8 RELWGAE + 94 9XRT -I- 004 DCXIST 
(1,11) (4.98) (4.91) (4.81) 

-0.62 HSEVAL + 0 09 AGRI - 0.11 NONWT + 0.74 AGE 

(5.69) (5.37) (8 67) (7.73) 

-0.13 MALSCH + IJ FEMSCH, R* - .72, F(9,238) - 66 7 

(026) (452) 







U.8. MARHIAOE UARKET 


TABLE 9 

Reobemons of FEMMAR and BIRRATE, 

1960 County Data, 2SLS (Women, IS Yeau and Oujer) 


S 51 


Explanatory 

Dependent Variable 

Vasiabui 

FEMMAR 

BIRRATE 

FEMMAR 


0 33 
(0 15) 

[0 335] 

BIRRATE 

-0 009 
(0 02) 

[0 0091 

RELWGAE 

3 03 
(0 45) 

[0 094] 


SXRT 

11 5 
(2 5) 

[0 167] 


DCOST 

004 
(0 006) 


HSEVAL 

[0 026| 

1 1 


(0 04) 

(0 2) 


[-0 062] 

[0 221] 

AGE 

1 33 

1 03 


(0 12) 

(0 05) 

AGRI 

0 095 

0 063 


(0 02) 

(0 041) 


[0 011] 

[0 007] 

NONWT 

-oil 

009 


(0 01) 

(0 04) 


[-0020] 

[0 016] 

FEMEARN 

-70 

(10) 

[-0 263] 

FEMSCH 


-1 15 
(0 39) 

[-0 188] 

CONSTANT 

-35 89 

-82 


(7 46) 

(9 5) 

F 

16 2 

11 7 

(df) 

(8,2) 

(7,3) 

9 

006 

006 


Notk. —Suodard error lo pareotbeae*. ElaMicfty at mean in brackm. 


The county data offer an opportunity to investiatc another proposition 
mentioned by Becker Suppose that a major component of family utihty 
IS the consumption of own child services * * Then a direction of causation 
runnmg from marriage to fertility may not be appropnate It is the 
demand for children which is most relevant, and observed mantal 
behavior is partially a derived demand Table 9 lists two-stage least- 
squares estimates from a two-equation model treating births and marriage 
simultaneously Here FEMMAR is a funcuon of the birth rate 

*• This tenninology aroae in recent economic research on fertility A compendium of 
dm work can be found in T W. Schuln (197V) 

** The first attempt to deal with simidtaneous family deosioni empirically wa# by 
Nerlove and Schults (1970) 



S53 JOtniNAL OF POUnOAL ECONOMY 

(BIRRATE) and vice versa.*® The other explanatory variables in the 
birth rate equation are those suggested by the new economic approach to 
fertihty. Clearly, the proportion of females married and, therefore, factors 
influencing the mamage market, contributes significantly to the explan¬ 
ation of observed fertility On the other hand, there is a small elasticity of 
the proportion of females married with respect to the birth rate This can 
be explained in two ways. First, child services may not, in fact, constitute 
a dominant portion of couples’ consumption bundles. Alternatively, this 
small elasticity may be a statistical anomaly caused by companson of a 
change in a stock to a change of a flow. Regressing the rate of new mar¬ 
riages on the birth rate may prove more satisfactory These results do, how¬ 
ever, suggest the fruitfulness of the simultaneous analysis of family decisions 
The decision to marry is only one of a set of interrelated family choices 
that influence both the observed proportion of women marned and the 
variables chosen to explain this proportion In a sense, the marriage 
equation is partially a reduced form from a simultaneous system in which 
all of the nght-hand variables are the result of choices made in the past 
This possibility has already been introduced with regard to the relative 
wage effect, and it applies to the other variables as well The sex ratio, 
for instance, is determined by migration decisions If unmarried females 
decide to leave a region (as is true for a rural population), then a greater 
proporuon of the females remaining are married, not simply because the 
sex ratio is higher, but also because unmarried women have left. Much 
more complete data are required, however, before the nature of these 
reverse causations can be investigated 


V. Conclusions 

Data from the 1960 census yield substantial empirical support for the 
economic interpretation of the mamage market Two factors, the ratio 
of the sexes and the cost of divorce, influence observed marital behavior as 
expected This is also true of the gains to specialization, but the results arc 
somewhat tenuous for teenagers The evidence also suggests the tentative 
conclusion that long-term economic growth may result in more people 
desinng to remain single Fmally, it is shown that the analysis of simul¬ 
taneous fatmly decisions is empirically feasible 

Means and standard deviation of these and all other variables used are given in 
table 10. 



si 


< 2 fr . 
3d . 


00 21 


•ny 

S?. 


Ss 




.o" 

00 21 


5 21 


m ^ 

M *? 
OO 


in 




O O*^ - 

o*i 


m 

2 ^- 


d SoiSs.^'-SwSn'^S.S^ 


;j*a>oo2;S2 

0*^0 



o« 

o? 


m ^ 

Si 


og'^S 


^e^i^co 


cn 
o “ 


m 

o^SS 

<=2 


S; 




lO 

o 52 

<=S 


=>3 

’“o 


3; 


^ C' 

o§5!s22 = E1 






^ s 

S53 



§ 

f 


J. 

5S 


u 

U 

b 9 

0 

0 

fitf 

< 

< 


ei 

o 

< 


§ I 
< o 

U !r 

5; s 

u u 

(Jm 


—StATuiard devtation in parentheses 



Comment 


T. Dudley Wallace 

North Caroltna State Unwerstty 


Freiden has been reasonably faithful to the Becker (1973) marriage model 
in his attempt to estimate the reduced-form equation explaining variation 
in the proportion of females married His observational units arc regional 
subaggregates in cross-section data Thus, the Freiden paper represents a 
commendable attempt to evaluate the usefulness of the model 

The major problem arising m going to the data is in translating and 
measuring full income Full income is a precise concept m the theory but 
is made up of two main components, one of which is ambiguous in its 
effect upon mamage The two components of full income arc (1) nonlabor 
income, and (2) full labor earnings if both male and female were com¬ 
pletely occupied in the market Variation in the first component, nonlabor 
income, is unambiguous Mamage should vary directly with it. Variation 
in “full wages” affects both implicit costs of time and full income and 
therefore is ambiguous Thus, full income, in sum, is ambiguous, and 
Freiden chose a proxy for full income that enters his regressions as a 
single variable 

In all his regressions, Freiden obtained a negative sign on his proxy for 
full income and in many cases the ^-statistics indicate that the results are 
not solely due to a lack of nchness in the data 

Retummg to the Becker theory, male and female wages and nonlabor 
income all enter the reduced-form relationship explaining mamage 
Variational effects of wage ratios and nonlabor earnings arc clear, 
variation in the wage sum (full eammgs) is not Nevertheless, all three 
enter and have separate effects Therefore, it seems that Freiden should 
have introduced an additional explanatory variable, namely, the sum of 
male-female wages. According to the theory, the relative-wage results 
should hold up in the presence of full eammgs If Frieden’s proxy for 
full income is appropriate, its net effect, holding full caraings constant, 
should be positive. Given the generality of the theory that guided the 

S54 




COMMENT 


S55 

empirical work, one cannot anticipate the sign on “full earnings” as 
measured by the sum of the wages Moreover, the data may not be rich 
enough to yield significant results on relative wages, full earnings, and 
full income Thus, such results would be revealed by the data It may also 
be the case that less generality m the theory would further clarify the role 
of full earnings m its relationship to marriage For example, I investigated 
a Cobb-Douglas production function with unit scale for the joint house¬ 
hold ‘ 

The mam additional result gained by this specificity was the de¬ 
lineation of the effect of a change in the sum of male and female wages 
(full earnings), holding the wage ratio constant For a Cobb-Douglas 
household production function, an increase in full earnings due to 
equiproportionate changes in the male and female wages reduces the 
gain from the marriage whenever the ratio of full earnings to nonlabor 
income is less than the ratio of labor cost to market-goods cost Con¬ 
versely, when the source of full income from labor relative to nonlabor 
income is larger than the relative share of labor-to-goods cost in pro¬ 
duction, an equal increase in both wage sources increases the attractiveness 
of marriage, given the Becker framework Whether this type of inference, 
or something similar, holds for more general production functions and 
whether data can be found to test more specific hypotheses about the role 
of full earnings arc open to question 

Not considering product allocation within the household contributes to 
a failure to sort out variables into cost-returns (supply-demand) boxes 
that have been found so useful in most work in economics The impression 
left in both the Frciden paper and the Becker paper (1973) is that one 
can say something about the incidence of marriage but, because of 
symmetry, little can be said about such matters as the demand for wives 
by males or the supply of wives If one took the simple and obvious 
expedient of assummg equal sharing of household production, the 
demand for wives could be represented by the optimal joint-product 
function less twice the single-male production, and the demand for 
husbands by women could be similarly represented by joint production 
minus twice the single-female production These are the returns On the 
cost side there arc costs of search, measured in some sense by the relevant 
sex ratios, and costs of correcting mistakes, as reflected by ease of divorce 
Whether one could construct meaningful models that are identifiable in 
this way, given the sorts of data which are available, is moot If such 

‘ For good reason, Becker wanted to avoid increasing returns to scale Scale less than 
or equal to unity leaves the burden of the theory to complementarity in producuon As 
Becker has stated (1973), Ckibb-Douglas (C-D) implies zero producuvity in the singles 
cases Our sense of desenpUon may thus be offended by using the C-D function, but if we 
accept the nobon that mamage is more attractive, the larger the difference in joint and 
individual household production, the general theory is not crucially violated by zero 
product m the households of single indiyiduals 



JOURNAL OF POLITICAL ECONOMY 


S56 

constructions are possible, the link between model and data would be 
clanhed As an example, in both the Freiden paper and in Becker (1973), 
the sex ratio and the divorce-cost variables are brought in by ad hoc 
argument rather than as an integral part of the theory This makes for a 
cetens panbus problem in interpreting results The equations that Freiden 
fits with data are reduced forms of two sorts of considerations 

What follows are a few additional comments that may have marginal 
value (1) Certainly the existence of marriage predates progressive income 
tax and joint returns Nevertheless, levels of marriage incidence m modem 
society should depend on tax structure (2) Divorce costs probably vary 
directly with the number of immature children; thus, the index used by 
Freiden is not so appropriate for his higher-age groups. Divorce costs in 
some states vary greatly, dependmg upon real assets Also, lowering the 
probability of dissolution of long-standing marriages is the effect of the 
household equivalent of on-the-job training specific to the mdmdual 
household One might expiect considerable differences in the analysis of 
survey data where one could obtain, for example, specific information 
about household wealth, number of dependent children, or length of time 
of current marriage Survey data should also be amenable to resolution 
of the importance of household productivity of the less intensive wage 
earner (3) The notion that the “likelihood” of marriage increases the 
greater the difference in joint and individual household production is 
appealing but somewhat lacking in ngor The basic Becker theory is 
completely deterministic Therefore, in speaking of “likelihood,” one 
leaps from the basic theory mto an area still to be explored Whether 
beginning with a probabilistic framework is of practical value is not clear, 
but It should be recognized that there is here a methodological problem 
which IS not usually present in microeconomic modeling. 

Lest the cntical tone of these comments be misleading, it is worth 
saying again that carrying economic analysis into unexplored terrain is 
exciting and of potentially great value. The Becker theory and the Freiden 
empincs are steps into a new area for economists. The steps are worthwhile 
and wull not end here 



Benefits of Women's Education 
within Marriage 


Lee Benham 

Utmursity <ff Chicago 


Studies of the returns to education have generally investigated the 
relationship between individuals’ investments in formal education and 
on-the-job training and their labor-market productivity It is well 
recognized, however, that other (actors besides formal education and 
traimng contnbute to a person’s effective stock of human capital (and 
hence to productivity), these factors include early childhood environment, 
parents’ behavior (see Dugan 1969), and associations with other indi¬ 
viduals There is general agreement, for example, that a child’s developi- 
ment is affected by the ability and performance of peers in school At a 
later stage, a significant part of an individual’s college and graduate 
education appears to result from association with fellow students, the more 
able students contnbuting to the education of all After formal schooling 
is completed, close associates are likely to continue to affect an individual’s 
further educational development and to influence the rate of depreciation 
of the individual’s stock of knowledge 

One of the more persuasive explanations of the observed strong positive 
relationship between formal education and labor-market productivity is 
that, in addition to providing specific skills, formal education improves 
the individual’s ability to acquire and assimilate information, to perceive 
and understand changing conditions, and to respond effectively ‘ From 

I thank Alexandra Benham for commenU and subatanuaJ aautance throughout thu 
study, Gary Becker, Solomon Polachek, Robert Michael, and T W Schultz for helpful 
comments, and Elayne Howard for analyzing the data The invesugauon was partly 
supported by PHS grant HS00080 from the NaUonal Center for Health Services Research 
and Development 

‘ In his study of migration. Aba Schwartz (1968) found results consistent with the 
hypothesis that one effect of educaUon is to reduce the cost of obtauung information The 
responsiveness of individuals to lifetune-esmungs differentials was found to mcrease 
monotonically with educaUon Fims Welch (1970) disunguishes between the worker 
effect and the allocative effect of educaUon He argues "Much of the'leverage'associated 
with added schooling is drawn from the dynsumc implicaUons of changing technology ” 
That IS, educauon enables an individual to adapt more rapidly to changmg condiuons 

S57 




S66 





S68 JOURNAL OF POLITICAL ECONOMY 

education is at least as strongly associated with husbands' earnings as is 
their prcmamage education * ’’ 

Another variant of the selective-mating hypothesis states that a wife’s 
education is only a proxy for other background characteristics of both 
spouses, such as intelligence and social class Should these latter variables 
be sissociated both with husband’s earnings and with wife’s own educa¬ 
tional attainment, then the association noted between the wife’s education 
and the husband’s earmngs may be spurious This argument has its 
counterpart in the recurring question of the extent to which an in¬ 
dividual’s education itself results in higher earnings versus the extent tc 
which education is merely a proxy for intelligence, social background 
and so forth 

Data on background variables were not available m the samples usee 
for tables 1-5 However, this issue has been examined in another contex 
by Duncan, Featherman, and Duncan (1972, p 178) in a study of th' 
relationship between the occupational status of husbands and the back 
ground characteristics (including education) of both spouses ** A multipl 
regression was computed of husband’s occupational status in 1956 on hi 
background characteristics, including his education, his occupation: 
status at marriage and his father’s occupational status, and his wife’s bad 
ground characteristics, including her education, her father’s occupation, 
status, her mtelhgence, and a psychological index of her drive to g 
ahead All the coefficients for wife’s charactenstics except education we’ 
less than two times their standard error This is consistent with the hyp 
thesis that the wife’s education has a significant independent effect on tl 
husband’s career development. 

These results, however, are not directly comparable to those present 
in tables 1-5, since occupational status rather than earnings is t 
dependent variable By using another data set developed by Blau ai 
Duncan (1967), it is piossible to obtain more directly comparable resu 
by estimating the relationship between husband’s income in 1961 a 
both sfxiuses’ education, given several of the husband’s backgrou 

' ^ An alternative explanation concerning the strong relationship between wi 
postmamage education and husband’s earnings is that husbands with higher eanu 
purchase more postmamage education for their wives To examme this questioi 
sample was drawn of husbands of ages 20-30 havmg 12 or more years of school ' 
were employed full time during the week of the 1960 U S Census survey For this gn 
median income was calculated for husbands whose wives were in school and for tl 
whose wives were not in school Only 2 4 percent of the men in this sample had a wi< 
school, so results should be viewed with appropriate caution, but in all men’s educa 
categories except 12 years, the median mcome was tvwtr for those men with wivc 
school 

The data used for these estimates were for couples who had recently had a set 
child at the time of the survey in 1937 (see Duncan et al. [1972, p. 34] for a more comj 
description). 



women’s education benefits S 69 

charactenstics, including occupational status of his first job and his 
father’s occupation and education ** 

The follovidng three specifications were used to estimate the impact of 
includmg husband’s background charactenstics using these data: 

INCOME = a, + EDH + EDW + p, 

INCOME = ai + EDH + EDW 

+ Pi EDHF + OCHF + p, and 

INCOME = «!+/?, EDH + /?i EDW 

+ Pi EDHF + p^ OCHF + /?* OCHJ + p, 

where INCOME = husband’s income in 1961, EDH = husband’s 
education, EDW = wife’s education, EDHF = husband’s father’s 
education, OCHF = husband’s father’s occupation, and OCHJ = 
husband’s occupational status on his first job Due to the nature of the 
available data,*® these specifications differ from the earher ones m this 
paper in that income rather than In earnings is the dejjcndent vanablc 
and husband’s work experience is not included ** Nevertheless, they 
provide qualitative estimates of the relative importance of husband’s 
background characteristics 

Table 6 shows the relative impact of wife’s education as compared with 
husband’s education on husband’s income {P2IP1) by age of wife These 
estimates are consistent with those presented earher There is a sub¬ 
stantial relationship between wife’s education and husband’s income in 
most years. For wives of ages 22-26, husband’s income increases 37 
percent as much with an increase in wife’s education as with a similar 
increase in husband’s own education This ratio increases to 80 percent 
for wives ages 32-36 and then declines at older ages, a piattcm similar to 
that shown m table 3 

When husband’s father’s occupation and education are included, the 
pattern is very similar * * Except for the oldest group, these results are not 
consistent with the hypothesis that the association between wife’s education 

** Data were collected from 20,700 respondents representmg men m the U S avUian 
noninstitutional population between the ages of 20 and 64 m March 1962 (sec Duncan 
et al 1972, p 32) 

Professor Otis Dudley Duncan generously provided the correlaUon matnccs which 
he had developed for table 8 16 in Duncan et al. (1972, p 240) The estimates shown here 
were calculate from those matrices 

The consequences of excluding the husband’s work-experience variables are less 
severe here since separate estunates are made for each of eight agc-of-wife categones 

Inclusion of lather's education and occupation reduces the educauon coeffiaent for 
both spouses The ratio of the husband’s educaUon coefficient when his father’s education 
and occupation are included to the husband’s education coefficient when they are ex¬ 
cluded. by wife’s age, IS as follows; wife’s age 22-26 ( 91), 27-31 (81), 32-36 ( 80). 
37-41 ( 82). 42-46 (.87). 47-51 ( 85). 52-56 ( 85), and 57-61 ( 80) 



Comment 


Finis Welch 

City Umverstfy of New York and National Bureau of Economic Research 


I have been tempted to write a note on the social cost of the “marriage 
go-round ” In a nutshell, the idea is that more productive marnagi 
partners are preferred and that, on balance, net productivity in mamag 
IS positively related to education Increased education serves to increas 
an individual’s chance of a good marriage, of marriage to a “more 
educated spouse If an individual’s social product is independent of h 
spouse’s education, then the mamage-go-round results in ovennvestmei 
in education Benham’s view that a person’s own productivity depends c 
attributes of the spouse is a clear alternative 

My skepticism concerning Benham’s hypothesis that a husbanc 
earning capacity is piositivcly related to his wife’s schooling is n 
qualitative but quantitative How impiortant are these (very like 
positive) cross-productivity effects'* I was not surprised to hear of t 
positive association between wife’s schooling and husband’s earnings 
was surpnsed that most emphasis is on productivity instead of select 
mating 

Bcnham points out that, since his measures are of annual earnings, th 
may be labor-supply effects submerged in the estimated relations 
between husband’s earnings and wife’s schooling He notes (correc 
that the direction of this effect is unclear on a prion grounds, but fi 
empincally that an extra year of wife’s schooling has only a slig 
smaller percentage effect on husband’s weekly wage than on am 
earnings In tabulations using a different body of data, I have ven 
this result I will bnefly desenbe these computations 

I used the NBER-Thomdikc sample of 4,519 white married males ( 
present) who entered the U S Air Force in 1943 and who were exam 
as navigators and pilots at that time Earnings are for 1969. Experier 
the number of years since the individual was last in school The geom 
mean of earmngs is over $13,000,* and logarithmic variance is 0 24. 

1 Earnings for 1969 are deflated to the 1958 value to as to allow life-cycle compa 
In the rqirenioni below, I did not rescale this variable 






COMMENT 


S73 

The advantage of thu sample is that data are available to permit 
explicit consideration of the possible effects of selective mating For 
example, information on husbands includes (1) schooling, (2) a measure 
of IQ,, (3) father’s schoohng and occupation, and (4) religion It is clear 
that the sample is rarefied, and it is consequently hard to judge how much 
confidence should be placed in results contained within For example, 
essentially all respondents are white, completed high school, and on armed 
forces intelligence tests scored in the upper one-half of the distribution 
On the other hand, in one important dimension, this sample is superior 
to the national probability samples for isolating “pure” productivity 
effects of wife’s schoolmg. It is likely that the World War II mamage 
market funcaoned somewhat differently from that in normal times— 
possibly a careful matching of family background was less important 
during the war If so, then in this sample, containing a disproportionately 
large number of men who were married during the war, there should be 
less “pollution” with covanance between wife’s schoohng and husband’s 
family background ^ For example, Benham, in an unpublished table, 
shows that, in the U S Census sample, the zero-order correlation between 
husband’s and wife’s schooling is 0 65 In the NBER-Thomdike sample 
the correlation is 0 38 

It IS true that this lower correlation in the NBER-Thomdike sample 
could be the result of truncation in the husband’s schooling variable, but 
It is important that if truncation reduces correlations it does not change 
biases due to omitted vanables Yet, if the reduced correlation follows 
from reduced covanance between wife’s schooling and oimttcd vanables 
such as husband’s IQ or family background, which may themselves affect 
earnings, then the reduced correlation reduces coefficient bias Table 1 
reports regression results for this sample 

Regression 1 is equivalent to regressions Benham presents in table 1 A, 
and the results agree that there is a “significant” association between 
wife’s schoohng and husband’s earnings There is a quantitative difference, 
however, while a year of schooling for the husband has the same estimated 
effect (6-7 jjercent) on earnings, the coefficient for wife’s schoohng is less 
than one-half as high in the NBER-Thorndike sample as in the samples 
used by Benham Regression 2 includes the length-of-marnage vanables 
k la Benham’s table 3 Here no evidence of a statistically sigmficant 
relationship exists The computed F-statistic (2,4,512 degrees of freedom) 
IS only 1.16 In any case, it is hard to interpret Benham’s results vis-k-vis 
length of mamage, for the mamage can itself be considered as a short 
“penod of investment ” 

Regrression 3 introduces a decomposition of the wife’s schooling vanable 
into two parts The first part includes those components of a wife’s 

* Roughly 17.5 pereent of the NBER-Thomdike sample of men who bad wives in 1969 
had maiTted those particular spousei between 1941 and 1944 Of this group of men, 
8 percent were mamed in 1944. 




COMMENT 


S75 

schooling that are correlated with (and presumably would be predicted 
by) her husband’s IQ,, religion, and his father’s schooling and occupation. 
The intention is simply that these vanables would be among the most 
obvious predictors of wife’s schooling, based on a sociological “network” 
or an assortive-mating frame of reference The second part of the wife’s 
education is that which is orthogonal to the above list and is simply the 
residuals between observed schooling and schoohng predicted by re¬ 
gression 4 It IS this part of a wife’s schooling that is independent of her 
husband’s background, religion, and measured IQ which is the obvious 
candidate for the “pure” productivity effect posited by Benham But in 
these data there is little independent role for wife’s education Most of the 
effect IS channeled through the indepiendent vanables of regression 4 
To summanzc these results, note first that the effect of a year of a wife’s 
schooling IS 22 percent of the estimated effect of her husband’s schooling 
Further, after “partialing out” effects of a husband’s IQ and background 
on his wife’s schooling, the net effect of the wife’s schooling is reduced to 
slightly less than half its onginal level and the coefficient estimate is of 
marginal statistical significance. The estimated net effect of a wife’s 
schooling IS roughly one-tenth of the estimated effect of her husband’s 
schooling and is only one-fifth of Benham’s estimate 

Docs this then prove that the pure productivity effect is unimportant’ 
Clearly not Aside from the pecuhanties of this particular set of data, the 
identification problems surrounding the Benham hypothesis are probably 
insurmountable For example, consider models of selective mating If 
forces for selectivity exist in the absence of this cross-productivity (cross- 
fertilization’) effect, they are clearly magnified by its existence 



Family Investments in Human Capital: 
Earnings of Women 


Jacob Mincer 

Columbia Unwersity and National Bureau of Economic Research 

Solomon Polachek 

Univetstty of North Caroima 


I. Introduction 

It has long been recognized that consumption behavior repre 
mamly joint household or family decisions rather than separate dec 
Ilf family members Accordingly, the observational units in consum 
surveys are “consumer units,” that is, households in which inco 
largely pooled and consumption largely shared. 

More recent is the recognition that an individual’s use of tim< 
particularly the allocation of time between market and nonr 
activities, IS also best understood within the context of the famil 
matter of interdependence with needs, activities, and character!' 
other family members More generally, the family is viewed 
economic unit which shares consumption and allocates produc 
home and in the market as well as the investments in physical and 
capital of Its members In this view, the behavior of the family umt 
a division of labor within it. Broadly speaking, this division of 1 
“differentiation of roles” emerges because the attempts to promot 
life are necessanly constrained by complementarity and sub 
relations in the household production process and by com 

Research here reported u part of a continuing study of the distribution i 
conducted by the National Bureau of Economic Research and funded by th 
Science Foundation and the Office of Economic Opportumty This report has 
gone the usual NBER review We are grateful to Otu Dudley Duncan, Jama 
Melvm Reder, T W Schultz, and Robert Willu for useful comments, and 
Borjas for skillful research assistance 

S 76 




EARNIN08 OF WOMEN S77 

advantages due to difTerential skills and earning powers with which family 
members are endowed 

Though the levels and distribution of these endowments can be taken 
as given in the short run, this is not true in a more complete penpective 
Even if each individual’s endowment were genetically determined, 
purposive marital selection would make its distribution in the family 
endogenous, along the lines sugg^ted by Becker in this volume Of 
course, individual endowments are not merely genetic, they can be aug¬ 
mented by processes of investment in human capital and reduced by 
depreciation Indeed, a major function of the family as a social institution 
IS the building of human capital of children—a lengthy “gestation” pro¬ 
cess made even longer by growing demands of technology 

Optimal investment in human capital of any family member requires 
attention not only to the human and financial capacities in the family, 
but also to the prospective utilization of the capital which is being 
accumulated Expectations of future family and market activities of 
individuals are, therefore, important determinants of levels and forms of 
investment in human capital Thus, fanuly investments and time 
allocation are linked while the current distnbution of human capital 
influences the current allocation of time within the family, the prospective 
allocation of time influences current investments in human capital 

That the differential allocation of time and of investments in human 
capital IS generally sex linked and subject to technological and cultural 
changes is a matter of fact which is outside the scope of our analysis 
Given the sex linkage, we focus on the relation within the family between 
time allocation and investments in human capital which give nse to the 
observed market earnings of women Whether these earnings, or the in¬ 
vestments underlying them, arc also influenced or reinforced by discrimi¬ 
natory attitudes of employers and fellow workers toward women in the 
labor market is a question we do not explore directly, though we briefly 
analyze the male-female wage differential Our major purposes are to 
ascertain and to estimate the effects of human-capital accumulation on 
market earnings and wage rates of women, to infer the magnitudes and 
course of such investments over the life histones of women, and to interpret 
these histones in the context of past expectations and of current and 
prospective family life. 

The data we study, the 1967 National Longitudinal Survey of Work 
Expienencc (NLS), afford a heretofore unavailable opportunity to relate 
family and work histones of women to their current market earning 
power Accumulation of human capital is a hfetime process In the post¬ 
school stage of the hfe cycle much of the contmued accumulation of 
earning power takes place on the job. Where past work expenence of 
men can be measured without much error in numbers of years elapsed 
since leaving school, such a measure of “potential work expenence” is 



JOURNAL OF POLITICAL ECONOMY 


S 78 

clearly inadequate for members of the labor force among whom the lengtl 
and continuity of work expienence varies a g^eat deal. Direct informatioi 
on work histones of women is, therefore, a basic requirement for th 
analysis of their eammgs To our knowledge, the NLS is the only dat 
set which provides this information, albeit on a retrospective basi 
Eventually, the NLS panel surveys will provide the information on 
current basis, showing developments as they unfold.* 


n. The Human-Capital Eammgs Function 


To the extent that earmngs in the labor market are a function of ( 
human-capital stock accumulated by individuals, a sequence of piosit 
net investments gives rise to growing earning power over the life cy 
When net investment is negative, that is, when market skills are erodec 
depreciation, earning power dechnes This relation between the seque 
of capital accumulation and the resulting growth in eammgs has 1 : 
formalized in the “human-capital earnings function ” A simple spe 
cation of this function fits the life cycle “earmngs profile” of men ra 
well. The approach to distribution of earnings among male woi 
(in the United States and elsewhere) as a distribution of indivi 
earnings profiles appears to be promising.^ 

For the purpose of this papier, a brief development of the ear 
function may suffice 

Let C,_, be the dollar amount of net investment in penod f — 1, 
(gross) earnings in that pienod, before the investment expenditure 
subtracted, are £, „ j Let r be the average rate of return to the indivi 
human-capital investment, and assume that r is the same in each p 
Then 


E, — -b rC,_j 


Let i, = ejE,, the ratio of investment expenditures to gross ea 
which may be viewed as mvestment in time-equivalent umts Th« 


E, = £,-i(l + rk,_i). 


’ For a descnption of the NLS survey of women's work hutones, see Pan 
Spitz, and Zeller (1970) For an analysis of eammgs of men, using “potenti 
experience measures, see Mincer (1974) Though less appropriate, the same pn» 
was used m several recent studies of female earmngs Direct information fron 
Survey was first used by Suter and Miller (1971) The human-capital approat 
appli^ to these data by Polachek in his Columbia Fh.D thesis, “Work Exp< 
the Difference between Male and Female Wages” (1973) Thu paper repo 
development of the analysu in that thesu 

*See, for instance, Rahm (1971), Chiswick and Mmcer (1972), Chisw 
Mincer (1974), and a series of unpublidied research papers by George £ J 
Frank P. Stafford on earmngs of 1% D's m various fidds 



earnings of women 


S79 

By recursion E, = Eq{1 + rA:o)(l + rk^) (1 + »■*,_,) The term rk 
IS a small fraction Hence a logarithmic approximation of In (1 + rk) ^ rk 
yields 


In £, = In jEo + r ^ k, 


(3) 


Since earnings net of investment expenditures, Y, = E,{\ — k,), we have 
also 


I-1 

In F, = In £0 + »■ Z) *1 + (* - *<) 


(4) 


Some investments are in the form of schooling, others take the form of 
formal and informal job training If only these two categoncs of invest¬ 
ment are analyzed, that is, schooling and postschool experience,^ the 
k terms can be separated, and 

»-i 1-1 

In E, = \n Eq + r ^ r ^ kj (5) 

(=0 i =» 

where the k, are investment ratios during the schooling period and the kj 
thereafter With tuition added to opportunity costs and student earmngs 
and scholarships subtracted from them, the rough assumption = 1 
may be used * Hence, 


In £, = In £0 -I- rj -h r ^ kj 


( 6 ) 


The postschool investment ratios kj are expected to decline continuously 
if work experience is expected to be continuous and the puipose of in¬ 
vestment is acquisition and maintenance of market earning power This 
conclusion emerges from models of optimal distribution of investment 
expenditures C, over the life cycle (sec Becker 1967 and Ben-Porath 1967) 
A sufficient rationale for our purposes is that as t increases, the remaining 
working life (T — t) shortens Since {T — <) is the length of the payoff 
period on investments m t, the incentives to invest and the magnitudes of 
investment decline over the (continuous) working life This is true for 
C, and a fortiori for k„ since with positive C„ E, rises, and k, is the ratio 
of C, to E, 

In analyses of male earnings, a linearly (or geometrically) declimng 
approximation of the working-life profile of investment ratios k, appears 
;o be a satisfactory statistical hypothesis 

’ inclusion of other categories in the earnings function is an important research 
teed, since human capital is acquired in many other ways in the home environment, in 
^vestments in health, by mobility, mformation, and so forth 

* * According to T. W Schultz, thu assumption overstates k, espeaally at higher educa- 
on levels, leading to an undentatement of r 



S8o 


JOURNAL OF POUnOAL ECONOMY 


It will be useful for our purpose of studying earnings of women t 
decompose net investments explicitly mto gross investments and d« 
predation Let C,* , be the dollar amount of gross investment in penc 
t — 1, 5,_, the depreciation rate of the stock of human capital, hence 
earnings £,_t during that period, and k* = C*l£t> gfO” investme 
ratio. Hence 

" ^t—i ^^#^1 — ^1—1-^<—1 

and 

—== 1 + = 1 + rk,_,, by equation (2), ( 

thus 

rk, s= rk* — <5,. 

The earnings function (3) can, therefore, be written as 

»-i 

In £, = In £0 + E - <5i) 

1-0 

In transferring the analysis to women, we face two basic facts* (1) 
marriage, women spend less than half of their lifetime in the 
market, on average Of course, this “lifetime participation rate” 
by marital status, number of children, and other circumstances, 
has been growing secularly. (2) The lesser market work of married ’ 
is not only a matter of fewer years during a lifetime, and fewer we 
year, or a shorter work week An important aspect is discontir 
work expenence, for most of the mamed women surveyed 1 
reported several entnes into and exits from the labor force after 
school. 

The implications of these facts for the volume and the life-c 
tnbution of human-capital investments can be stated briefly * 

1 Since job-related investment in human capital commands 
which IS received at work,* the shorter the expected and actual 
of work expenence, the weaker the incentives to augment job s 
the life cycle With labor-force attachment of marned women 1 
average, about one-half that of men, labor-market activities 
are less likely to contain skill training and learning compoi 
result both of women’s own decisions and decisions of emplc 
may be expected to invest m worker skills to some extent. 

2. Given discontinuity of work expenence, the conclusion ' 
ation analysis to the effect that human-capital investmei 

’ For a mathematical statement of the optimization analysis applied to 
work experience, see Polachek (1973, chap. 3) 

* For the sake of brevity, the term “work" refers to work m the job marl 
imply that women occupied in the household do not work. 



earnings or women 


S8i 


TABLE 1 

Labor-Force Particwation of Mothers Proportion Woriono, 
White Marrieo Women vmH Chujoren, Spouse Present 


Proportion Workino (%) 


Aoe 

In 1966 

After Fust Child 

Ever 

Sample Size 

30-34 

43 

64 

82 

925 

5 < 12 

46 

71 

75 

294 

5= 12 

43 

63 

84 

446 

5 > 12 

40 

59 

88 

185 

35-39 

47 

67 

87 

945 

5 < 12 

45 

66 

82 

336 

^ = 12 

49 

68 

88 

422 

5 > 12 

47 

67 

92 

187 

40^ 

53 

70 

88 

1,078 

5 < 12 

52 

72 

78 

465 

5 = 12 

54 

70 

91 

446 

5 > 12 

51 

68 

93 

167 


Source — NLS, 1967 lurvev 
Note — S » years of tchooitng 


continuously over the successive years of life after leaving school is no 
longer valid Even a continuous decline over the years spent in the job 
market cannot be hypiothesized if several intervals of work experience 
rather than one stretch represent the norm 
3 The more continuous the participation, the larger the investments 
on initial job experience relative to those in later jobs 
Women without children and without husbands may be expected to 
engage in continuous job experience But labor-force participation of 
married women, especially of mothers, vanes over the life cycle, depending 
on the demands on their time in the household as well as on their skills 
and preferences relative to those of other family members The average 
pattern of labor-force expienence is apparent in tables 1-3, which arc 
based on the NLS data reported by women who were 30-44 years of age 
at the time of the survey According to the data 

1 Though less than 50 percent of the mothers worked in 1966, close to 
90 percent worked sometime after they left school, and two-thirds 
returned to the labor market after the birth of the first child (table I) 
Lifetime labor-force participation of women without children or without 
i husbands is, of course, greater 

I 2 Never-marned women spent 90 percent of their years after they left 
school in the labor market, while married women with children spent 
less than 50 percent of their time in it In each age group, childless women, 
those with children but without husbands (widowed, divorced, or 
separated), and those who married more than once spent less time in the 
market than never-married women, but more than mothers married 
once, spouse present (table 2). 





earnings of women 


S83 

3. Table 3 shows the charactenstic work histones of mothers,’ spouse 
present (MSP), who represented over two-thirds of the women in the 
sample We show chronologically the length of nonparticipation (Aj) 
during the interval between leaving school and mamage, the years of 
market work between school and the birth of the first child (#1), an 
uninterrupted penod of nonparticipation, Aj, starting just before the 
first child was bom, followed by ej and A,, which sum intermittent 
participation and nonparticipation, respectively, and finally <3, the 
present job tenure of women working at the time of the survey 

It IS clear from the tabulations that, after their schoohng, the life cycle 
of married women features several stages which differ in the nature and 
degree of labor-market and home involvement There is usually con¬ 
tinuous market work pnor to the birth of the first child The second stage 
IS a period of nonparticipation related to childbeanng and child care, 
lasting between 5 and 10 years, followed by intermittent participation 
before the youngest child reaches school age The third stage is a more 
permanent return to the labor force for some, though it may remain 
intermittent for others In our data, which were obtained from women 
who were less than 45 years old, only the beginning of the third stage is 
visible 

The following conjectures about investment behavior in each of these 
stages are plausible in view of the described patterns which are to some 
extent anticipated by the women 

1 Prospective discontinuity may well influence many young women 
during their prematernal employment (r,) to acquire less job training 
than men with comparable education, unless they do not expect to marry 
or have an overriding commitment to a work career 

2 During the period of childbearing and child care, prolonged non¬ 
participation may cause the skills acquired at school and at work to 
depreciate Some revisions of expectations and of commitments may also 
take place ® Little investment, if any, can be expected during the episodic 
employment penod «2 

3 There is likely to be a stronger expectation of prospective continuity 
of employment after the children reach school age To the extent that the 
current job («j) is more likely to represent this more-permanent return to 
the labor force than «2 does, strong incentives to resume investments in 
job-related skills should reappear 

’’ Tile six intervals shown in table 3 are aggregated from eight available ones Both sets 
arc described in the Appendix 

* We are reminded by T W Schultz that erosion of market skills during penods of 
nonparticipation is likely to be associated with growth in nonmarket productivity If so, 
the longer the time spent out of the labor force the greater the excess of the reservation 
or “shadow” pnee over the market wage, hence the smaller the probability of subsequent 
labor-force participation 




S82 



EARNINGS OF WOMEN 


S83 

3 Tabic 3 shows the charactenstic work histones of mothers,’ spouse 
present (MSP), who represented over two-thirds of the women in the 
sample. We show chronologically the length of nonparticipation (Aj) 
dunng the interval between leaving school and mamage, the years of 
market work between school and the birth of the first child («,), an 
uninterrupted penod of nonparticipation, Aj, starting just before the 
first child was bom, followed by *3. which sum intermittent 

participation and nonparticipation, respectively, and finally e^, the 
present job tenure of women working at the time of the survey 

It IS clear from the tabulations that, after their schooling, the life cycle 
of married women features several stages which differ in the nature and 
degree of labor-market and home involvement There is usually con¬ 
tinuous market work prior to the birth of the first child The second stage 
IS a period of nonparticipation related to childbearing and child care, 
lasting between 5 and 10 years, followed by intermittent participation 
before the youngest child reaches school age The third stage is a more 
permanent return to the labor force for some, though it may remain 
intermittent for others In our data, which were obtained from women 
who were less than 45 years old, only the beginning of the third stage is 
visible 

The following conjectures about investment behavior in each of these 
stages are plausible in view of the desenbed patterns which are to some 
extent anticipated by the women 

1 Prospective discontinuity may well influence many young women 
during their prematemal employment (e,) to acquire less job training 
than men with comparable education, unless they do not expect to marry 
or have an overriding commitment to a work career 

2 During the penod of childbearing and child care, prolonged non¬ 
participation may cause the skills acquired at school and at work to 
depreciate Some revisions of expectations and of commitments may also 
take place * Little investment, if any, can be expected during the episodic 
employment period «2 

3 There is likely to be a stronger expectation of prospective continuity 
of employment after the children reach school age To the extent that the 
current job (e^) is more likely to represent this more-permanent return to 
the labor force than does, strong incentives to resume investments in 
job-related skills should reappear 

’ The SIX intervals shown in table 3 are aggregated from eight available ones Both sets 
arc desenbed in the Appendix 

’ We are reminded by T W Schultz that erosion of market skills during periods of 
nonparticipation is likely to be associated with growth in nonmarket productivity If so, 
the longer the tune spent out of the labor force the greater the excess of the reservation 
or “shadow” pnee over the market wage, hence the smaller the probability of subsequent 
labor-force participation 



9 




TABLE 3 (Continued) 





S86 


JOURNAL OF POLtnOAL ECONOM 


These conjectures imply that the investment profile of married womi 
IS not monotonic There is a gap which is likely to show negative vali 
(net depreciation) during the childbearing period and two peaks befi 
and after The levels of these peaks are likely to be correlated for the sa 
woman, and their comparative size is likely to depend on the degrei 
continuity of work expenence The whole profile can be visualizec 
comparison with the investment profiles of men and of single won 
For never-married women, stage I (e,) extends over their whole wor 
life, and the investment profile declines as it does for men To the ex 
however, that expectation of marriage and of childbearing are stre 
at younger ages and diminish with age, investment of never-ms 
women is likely to be initially lower than that of men. At the same 
given lesser expectations of marriage on the part of the never-married 
initial on-the-job investments exceed those of the women who ever 
marry, while the profile of the latter shows two peaks 
The implications for comparativc-earnings profiles are clear ( 
investment ratios imply a steeper growth of earnings, while de 
investment profiles imply concavity of earnings profiles. Hence, e 
profiles of men are steepest and concave, those of childless womer 
and those of mothers are double peaked with least overall growtl 

in. Women’s Wage Equation 

To adapt the earnings function to persons with mtermitti 
experience we break up the postschool investment term m eqi 
into successive segments of participation and nonparticipatio 
occur chronologically In the general case with n segments we m 
the investment ratio I, = a, + ifl, i = 1, 2, ., n, and 

" /•<! * 1 

i (a, + ill) dt. 

It 

Here a, is the initial investment ratio, bi is the rate of change c 
ment ratio during the ith segment' (t|+, - t,) = «, = dur 
ith segment Note that in (7) the initial investment ratio 
projected value at (, = 0, the start of working life In a wo 
which occurs in later life there is likely to be less investmei 
earlier interval j, though more than would be observed ifj 
Its gradient through the years covered by m. In this case, t 
(7) will exceed ay 

Alternatively, aj and e„ can be compared directly in th 

A f" 

In £, = In Eq -P rs -f r 2j I (“i + « 

i.i Jo 

since Oi is the investment ratio at the beginning of the parti 


In £, = In £n -P rr + r 

1=1 



earnings of women 


While the rate of change m investment b, is likely to be negative in 
longer intervals, it may not be significant in shorter ones Since the 
segments we observe in the histones of women before age 45 are relatively 
short, a simplified scheme is to assume a constant rate of net investment 
throughout a given segment, though diffenng among segments The 
earnings function simplifies to 

In E, = In Eg + rs + r ^ (9) 

I 

Whereas {ra^ > 0 denotes positive net investment (ratios), (m,) < 0 
represents net depreciation rates, likely in penods of nonparticipation 
The question whether the annual investment or depreciation rates vary 
with the length of the interval is ulumately an empmcal one Even if 
each woman were to invest diminishing amounts over a segment of work 
experience, those women who stay longer in the labor market are likely to 
invest more per unit of time, so that fl| is likely to be a positive function 
of the length of the interval in the cross section 

Thus, even if = a,y — A,y< for a given woman _/, if fl,y = -f pjt 
across women, on substitution, the coefficient 6 of / may become negligible 
or even positive in the cross section On integrating, and using three 
segments of working life as an example, earnings functions (7), (8), and 
(9) become 

ln£, = ao + " + »'[«i<i + + 't2(<2 - h) .y. 

+ — ^t) + <*3(^ “ h) + ~ 

In E, = Cg + rs + + 0,^2 , 

(8a) 

+ ^^2^2 + + i^3*3)> 

In E, = ag + rs + r(a^e^ + 02*2 + “3^3) 

In this example, i is within the last (third) segment, and the middle 
segment, = b, \s a period of nonparticipation or “home time ” The 
signs of A, are ambiguous in the cross section, as already indicated, 
the coefficients of and of *3 are expected to be piositive, but those of 
«2 (or h) negative, most clearly in (9a) 

The equations for observed earmngs (In T,) differ from the equations 
shown above by a term In (1 — k ,)—as was shown in the companson of 
equations (3) and (4) With k, relatively small, only the intercept Og is 
affected, so the same form holds for In Y, as for In E, 

It will help our understanding of the estimates of depreciation rates to 
express camings function (9a) in terms of gross-investment rates and 
depreciation rates 

i ln £, = In £0 + ('■** “ ^1) 

= In £(, + (" — ^.) + ('■*1 ~ <5i)*i 

+ (»'*? - 5»)A + (<'*3 - ^3)«3 



S88 


JOURNAL OP POLmCAL ECONOMY 


This formulabon suggests that depreuation of earning power may occur 
not only in periods of nonparticipation (A), but at other times as well. On 
the other hand, market-onented investment, such as informal study and 
job search, may take place during home time, so that Aj > 0. Positivi 
coefficients of s, and Sj would reflect positive net investment, while 
negative coefficient of A is an estimate of net depreciation, If Af > 0, th 
absolute value of the depreciation rate is underestimated 


tv. Empirical Findings 

Tables 4-8 show results of regression analyses which apply our earni 
function to analyze wage rates of women who worked in 1966, the j 
preceding the survey The general specification is In ui = f{S, e, A, *) 
where w is the hourly wage rate; S is the years of schooling, r is a v( 
of work-expenence segments, A is a vector of home-time segments 
X IS a vector of other variables, such as indexes of job training, mol 
health, number of children, and current weeks and hours of work 
the statistical residual 

The findings described here are based on ordinary least-squares I 
regressions The tables show shorter and longer lists of variables v 
covering all the intermediate lists In view of a plausible simul 
problem we attempted also a two-stage least-squares (2SLS) esti 
procedure, which we describe in the next section. Since the 2SLS e 
do not appear to contradict the findings based on OLS, we desen 
first below 

1. Work History Detail and Equation Form 

When life histones are segmented into five intervals (eight is the 
possible in the data), three of which are periods of work expe 
two of nonmarket activity,*’ both nonlinear formulations (equ: 

[7] and [8]) are less informative than the linear specification 
of change in investment (coefficient b) are probably not substa 
a short interval, and the mtercorrelation of the linear am 
terms hinders the estimation. Dropping the square terms 
explanatory power of the regression slightly but increases 
of the life-cycle investment profile Conversely, when the 
aggregated, the quadratic term becomes negative but d» 
acquire statistical significance by conventional standards. 1 
term for current work experience is negative and significai 


* Tables 2 and 3 ahow lu intervalB, including a very ihort nonpartic 
between school and mamage Tlu» interval is aggregated m other 
regreviODS. 



earnings op women 


S89 

of never-married women, one segment of work experience usually covers 
most of the potential working life Here the nonhnear formulation over 
the interval is as natural and informative as it is for men 


2 Investment Rates 

Table 4 compares earmngs functions of women by mantal status and 
presence of children, tables 5 and 6 by level of schooling, and table 7 by 
hfetime work experience In each table we can compare groups of women 
with differential labor-force attachment According to human-capital 
theory, higher investment levels should be observed in groups with 
stronger labor-force attachment 

We can infer these differences in investment by looking at the co¬ 
efficients of experience segments, e^ (prematemal), (intermittent, after 
the first child), and <3 (current) These increase systematically from 
married women with children to married women without children to 
single women in table 4, and from women who worked less than half to 
those who worked more than half of their lifetime in table 7 An exception 
IS the coefficient of <3 which appears to be somewhat higher for the group 
who worked less (sec table 7) Note, however, that these coeffieients arc 
investment ratios (to gross wage rates), not dollar volumes Since wage 
rates are higher in the groups with more work experience, the conclusions 
about increasing investment hold for dollar magnitudes, a fortion, and 
the anomaly in table 7 disappears * ® 

Classifications by schooling show mixed results In table 5, where 
schooling IS stratified by <12, 12-15, and 16-1-, investment ratios (co¬ 
efficients of e^ arc lower at higher levels of schooling (with the exception 
of the coefficient of <,) Translated into dollar terms,“ no clear pattern 
emerges At the same time m table 6, where the schooling strata are ^ 8, 
9-12, and 13-1-, a positive relation between investment volumes and 
levels of schooling is somewhat better indicated Note that the sample 
size for the highest-schoohng groups (10-I-) is quite small in table 5, 
as is that for the lowest-schooling groups ( S 8) m table 6 

3. Investment Profiles 

Another implication of the human-capital theory refers to the shape of 
^he investment profile, it is monotonically dechmng in groups with 
Bontmuous participaUon, hence earmngs arc parabolic in aggregated 

B The coeffiaent of < 3 , calculated as 9 In Wjde, is 15 percent higher in the nght-hand 

Voup However, the wage rate of this group is about 25 percent lower 
^ ‘ ’ Wage rates are roughly 30 percent higher in successive schooling groups 









\ 




^Oeoc^^^OCMO^Oeoo-* 

II II 


16+ 

•« 

.36 

107 

010 
- 016 
.004 
- 012 

002 

0007 

032 

0 

008 

05 

- 16 

05 
- 05 

Oi 

en 

li 

Co 


^ m on <o M ^ 

O CM — CS| O 

i 1 



•« 

00 CO en CM en (p 

sisssss 

1 1 

16 

83 


- 

OCO 9)00000^ «O(09)eMCM 

CM«^CM^O-^^eO — o 
II II 


lO 

•« 

- 03 

086 

008 

0 

oil 

- 018 
- 006 

0004 

008 

0 

002 

036 
- 11 

031 
- 002 

CO 

I 




11 

Co 

*• 

— t>i CM r«» cvi 

lO ^ on eo o 

1 i 

SSooooS 

1 1 t 



-Q 

s 


- 

00CMC0CMCMine'i'4fi>.in’^o>«o^ 
cne^*^cnoo —CM^OCMO-*© 

1 1 1 



•« 

98 

039 

015 

012 

019 

001 

003 

0004 

016 

0006 

001 

06 

044 

045 

004 

CM 

CM 

cv 


1 III 


V 

Co 

~ 

— 00 r>> © iTj 
■^COCM’^CO 

1 




- 095 
046 
016 
014 
021 

- 002 
002 

17 

435 



. 

oc, 53 S.S “.SS: 



i 

I 

iT 

Ji 


S 

I 

g 

E 

o 

i 

S 

E 


Sgi 






OF WOMEN 


S93 


TABLE 7 

EAMnNOi Fukctions op WMSP bv Lifetime Work Experience 


y- 


r 




Worked More than 
Half of Yease 


Worked Leu than 
Half of Veare 


b 


M 

b 

I 

M 

- 28 



- 10 



073 

94 

118 

059 

79 

11 0 

009 

2 1 

49 

003 

04 

22 

006 

14 

56 

- 005 

-06 

1 5 

017 

20 

49 

022 

38 

1 6 

- 0002 

-0 7 


- 001 

-1 5 


- 014 

-2 3 

22 

- 010 

-26 

10 7 

Oil 

J 7 

2 1 

- 004 

-09 

4 7 

- 0008 

-2 1 

10 8 

- 0001 

-0 3 

13 7 

002 

1 1 

12 1 

002 

1 0 

11 8 

064 

28 

0 97 

024 

1 0 

090 

- 08 

-20 

3 52 

- 13 

-44 

340 

07 

Id 

3 71 

023 

1 0 

3 29 

- 015 

-14 

221 

- 001 

-02 

3 18 

22 



21 



536 



604 




Note —WMSP * white married women, spouse prcMnt See table 4 for key to symbols 


I 


penence for men and ncver-mamcd women ** In the groups with 
scontinuous participation, the profiles are not expected to be monotomc 
We can summarize the implicit profiles schematically, m terms of the 
-fficients of length of work experience before the first child, A,, 
nteiTupted nonparticipation after the first child, and ej, the current 
ork interval We find (table 4, col 3) that white mamed women with 
ildrcn (with spouse present) have current investment (ratio which 
cecds the investment (ratio) incurred in cxpcncncc before the first 
Id ** Presumably, current participation in the labor force, which 
es place when most of the children have reached school age, is expected 
last longer than the previous periods of work expencnce This is 
ainly true of women over age 35, and it holds in regressions with or 
out standardization for age 

king at regressions within three education levels (tables 5-6), we 
that coefficient of prematemal experience («,) exceeds the coefficient 
urrent work experience (e^) at the highest level of schooling (in the 
equations, though not in the long ones), and the opposite is true at 
levels. For women without children the coefficient of prematemal 
experience equals that of current work expenence The investment 
e of never-mamed women has a downward slopie Comparable 

the earnings regressions, the quadrauc term of aggregated expenence is often 
, but not significant statistically 

statements about differences m coefficients refer to point estimates The dif- 
Rre mentioned because they are suggestive, though they would not pass strict 
Usucal significance withm a given equation 



JOURNAL OF POUnCAL ECO 


S94 

early segments of their post school job experience contain higher inves 
ratios—indeed, the fit implies a hnear decline of such ratios over t 
cycle Evidently, women who intend to spend more time in the labc 
invest more initially. This is true, presumably, even if their plans a 
changed following marriage and childbcanng. 

4. DtprtcuUwn Rates 

The coefficient of home time is negative, indicating a net deprei 
earning power During the home-time interval (Aj), associs 
mamage or the birth of the first child, this net depreciation ar 
on average, 1 5 percent per year. In table 5 the depreciation ra 
(—0 2 percent) and insignificant for women with less than h 
education, larger (—13 percent) for those with 12-15 yean ol 
and largest (-2.3 percent) for those with 16+ yean of sd 
table 6, the net depreciation rate is — 1 I percent for womt 
mentary schoolmg or less, — 1 4 percent for women with some 
and —43 percent for women with at least some collegi 
differences probably account for the different estimates m tb 
The depreciation rate also appean higher in the group who 
than half the yean (table 7) 

It would seem that the depreciation rate is higher when thi 
stock of human capital is larger. An exception appean m t' 
of women without children (married and single) with wo 
dren The former have a lower depreciation rate Of coun 
spend much less time out of market work, and some of thi 
job-onented (e.g, job search) 

It is useful to return to the formulation (9b) of the e 
for a closer analysis of the depreciation rates: In £, = In £ 

(rk* — 5j)<j + {rk* — d*)A + (rk* - 5j)ey Our coe 
time measures the depreciation rate only if market-ori 
itj IS negligible This is likely to be true for the penod o 
period defined as A, m the regression (Aj in the tabula 

An interesting question is whether the depreaatioi 
nonparticipation is different from the depreciation tha 
well The question is whether depreciation due to no 
capital stock (atrophy?) exceeds the depreciation dui 
to aging(^). We are mclmed to believe that deprecia 
(“getting rusty”) is by far more important, particul/ 
relatively young (below age 45). Moreover, the at' 
that depreciation due to nonparticipation is stron 
oriented components of human capital acquired on 
for the inborn, imtial, or general components of the 
If so, a fixed rate of “home-time depredation'’ aj 



^RNINOS OF WOMEN 


S95 


ccumulaGon of human capital would appear as a varying rate in the 
imings function given the volume of other human capital, the larger the 
Ln-the-job accumulated component of human capital, the higher the 
Ibserved (applied to the total earning power) depreciation rate 
This may be an explanation of the observed higher depreciation rates 
higher schooling and experience levels of mothers In particular, there 
j a piositive relation between the coefficients of (in absolute value) and 
'e, across schooling groups (table 6), experience groups (table 7), and 
iice groups (compare tables 4 and 8) 

Eject of Family Size 

family size and number of children currently present affect the 
cumulation of earning power beyond the effect on work expenence^ 
^e answer is largely negative when numbers of children and some 
sures of their age are added to work histones m the equations, the 
n variables are negative but usually not significant statistically 
ur inclusion reduces the absolute values of the coefficients ofexpenence 
, of home time and does not add pierceptibly to the explanatory piower 
lie regression Note, however, that the children variable does approach 
uficance m the relatively small groups of highly educated women 
1 5-6), and more generally among women with stronger labor-force 
chment (table 7) Possibly, shorter hours or lesser intensity of work 
(to some extent, the preferred alternatives to job discontinuity 


Formal Postschool Training 

coefficients of experience, a„ represent estimates of rkf, where k, is 
verage investment ratio across women over the segment and r is the 
ge rate of return Individual variation in k^ is not available to us 
ave some individual information, however, on months of formal job 
fig received after compIeUon of schooling as well as on possession of 
^ional certificates by, among others, registered nurses, teachers, and 
plans If the length of training and possession of a certificate are 
indexes of k, we may represent a, = Oq + P • tr, where tr is the 
of traimng. The term a e in that equauon becomes 

(og -f- ptr) e = ag-e + P{tr e) 

1 interaction term (/r • e) can be added to the equation, and if the 
Bs IS correct, the coefficient P should be positive This is indeed 


S u the observed depreciation rate, Sj the rate applicable to job-accumulated 
and Hq the volume of other human capital, S = -1- Hq) = 

Jfij)} With a fixed rate ij for all individuals, the larf^ Hj the larger 5 


JOUKNAL OP POtmCAL EC 


S96 

the case in moat of our equations, confirming the training inteipi 
of the experience coefficients in the earnings function. Both mte 
with months of job traimng and with possession of a certifii 
significant for married women The training interaction vanab' 
positive in the eammgs function of single women, but the c 
variable is negative. Whereas the negative coefficient of the car 
experience variable imphes less than average investment behavi 
persons who work continuously, the corresponding positive coe 
intermittent workers implies more than average mvestment be 

7 £Jecls of Mobtltly 

Research in mobility has shown that, so long as mobility 
voluntary—resulting from layoffs—it is associated with a gain 
However, geographic labor mobility of married women is ofte 
due to job changes of the husband. In that case, it may nu 
contmuity of experience and slow the accumulation of ea 
We used the information on the length of current resident 
or a Standard Metropohtan Statistical Area (SMSA) 
measure of mobUity This variable has a small positive ' 
rates of white MSP women and a significant negative e 
women To the extent that mobility is job onented foi 
and exogenous for married women, the differential signs 
sistent interpretation. 

8 Hours and Wteks m Current Job 

When (logs of) weeks and hours worked in the survey 
in the regression, a n^ative sign appears for the weekl 
and a positive but less significant one for the weeks-' 

The hours’ coeffiaents arc smaller for married worn 
women and smaller for white than for black women, 
of weekly hours may be partly or wholly spunous sm< 
indicated by respondents were weeks or months and t 
was obtained by division through hours Of cour 
causahty is suspect: it is more likely that women w 
work longer hours than the converse. Deletion of tb 
has a minimal effect on the equations. 

9. Other VariabUs 

Three other variables were included in the equati 
1. Twenty percent of the married women who 1 
out of work in 1967. We used a dummy variable 



[ earnings of women 


TABLE 8 

EAKNiNas FuNcnotn of Black Women 


S97 


"SS 

MSP WTTH Children 



Never Married 



Var 

b 

t 

Var 

b- 

t 

{7 


- 02 


C 

- 48 


s 


095 

11 2 

s 

110 

37 



005 

08 

* 

004 

0 1 

4 


001 

03 

a* 

- 0003 

-02 

K! 


.006 

1 4 

^3 

001 

02 

1 


- 006 

-12 

h 

- 02 

- 05 



- 005 

-09 


001 

1 1 



0005 

1 3 

itr 

0006 

1 4 



008 

1 9 

ect 

003 

04 

Mil 

- 0002 

-05 

Ml 

- 001 

-1 8 

5 


002 

09 

res 

001 

02 


II 

40 

loc 

23 

27 

J 

Hrs 

- 30 

-74 

In Hrs 

- 13 

-07 

a 

Wks 

08 

22 

In Wks 

03 

02 

i 

e 

005 

06 

K 




2 

39 



46 


, \ 


550 


N 

70 



SoTP » MSP ■■ white mamed women, spome present See table 4 for key to aymboli 


onl 




jrking in 1966 stopped working in 1967, and 0 otherwise** This 
lable had a negative sign, since it indicated a shorter current job 
penence compared with the prospective work interval of others who 
itinucd to work in 1967—the completed interval of those dropping out 
i not longer than the interval of stayers In effect, women who droppied 
; of the labor force in 1967 had wage rates about 5 percent lower than 
len who continued working, given the same charactcnstics and 
tones ** The proportion of drojxiuts is somewhat larger at lower 
Ication levels 

The size of community in which the resjxindcnt lived at age 15 had 
Mitive effect on earning power of mamed women but no effect on 
j of single women 

j Duration of current health problem in months was used as a meaisure 
ilth levels It is an imperfect measure for retrospective purposes and 
a very small negative effect on the wage rate 

^Black Women 

^gressions for black MSP (table 8) show expcnencc coefficients about 
size of the corresponding white population Home time or 
ation coefficients arc not significant, neither are the children 


[p(l4 


hown in the table* 

lit itandarduation, women who bad dropped out had wage rates about 10 
ver than women who continued woilung. 





S98 JOURNAL OP POLITICAL E( 

vanables The impiication is that there is less investment on the ji 
though black women spent more time than white women in t 
market. They had more and younger children) on average T 
vanables behave comparably with those in the white regressio 
that hours of current work and location at age 15 show strong 
In contrast to white women, the size of community of residence 
has a positive effect for never-mamed women as well. Agai 
penence coeffiaents are smaller for black single women than 
Perhaps contrary to expectations, neither health problems r 
withdrawal from the labor force in 1966 differ for black as c 
white mamed women with children, spouse present Rates 
schooling appear, if anything, to be higher for black women 


V. Lifetime Participation and the Simultaneity Prol 

The earnings function, as we estimate it, relates wages 
investments in schooling and on-the-job training and to 
additional vanables already discussed 
The interpretation of some of the independent vana 
affecting earning power may be challenged on the ground 
just as well be viewed as effects rather than causes of 
Presumably, women with greater earning power ha' 
aspirations and work commitments than other women 1 
lifetimes Hence, what we interpret as an earnings func' 
read with causality running in the opposite direction— 
function This argument is most telling for concurrent 
last year’s hours and weeks worked in relation to last 
But these vanables are of only marginal importance in 
of marned women All other independent vanables t 
the dependent vanable (current wage rate), which 1 
function interpretation less vulnerable, though not en 
a senal correlation between current and past work exf 
and past earning power Since lifetime work expener 
on pnor wage levels and expectations, our experiei 
part, determined as well as determining If so, the residu 
tions IS correlated with the expenence variables, 
coefficients which we interpreted as investment rati 
How serious this problem is for our analysis de 
of individual correlations between current and j 
tations of earning power and on the strength of eff 
on subsequent work histories of individuals. Of co' 
grouped these correlations and effects are like! 
educated women tend to have higher wage n 
women throughout their woriting lives, (see, fo 
' — '■nr tabic 3 shows, they spend a larger fra 



earnings of women 


S 99 


labor force. Table 3 also shows that mamed mothers who currently do 
not work, spent, on average, less of their lifetime working than those who 
currently work 

One econometric approach to an estimation of the earnings function in 
the presence of endogeneity of “independent” vanables is the two-stage 
least-squares (2SLS) approach We estimate work experience as a variable 
dependent on exogenous vanables, some of which are in the earnings 
function and others outside of it In effect, we estimate a “lifetime labor- 
supply function ” The second step is to replace the work-expenence 
vanables (e) in the earnings function by the estimated work cxpenence(«) 
from the labor-supply function Parameter estimates in this revised 
earnings function are theoretically supenor to the onginal, simple Icast- 
; squares estimates ^ ’ 

Our application of a 2SLS procedure is far from thorough, for two 
1 reasons 

1 It IS difficult to implement it on the segmented function, since each 
3f the segments would have to be estimated by exogenous vanables For 
|this purpose we aggregate years of work cxpenence and compare the 
feestimated earnings function with the onginal, using aggregated ex- 
penence 

2 One of the vanables m our lifetime labor-supply function is the 
^umber of children, which is not exogenous In pnnciple, we should 

spand the equation system to three to include the eariungs function, the 

! bor-supply function, and the fertility function At this exploratory level 
e prefer not to do it, particularly since the fertility function would be 
timated by the same vanables as the labor-supply function 
[ The supply function obtained for all white MSP women was 

j _ 514-1- .020 Sp - 0064 - 062 N^, 

(5 1) (18) (12 0) 

j^ere e is total years of work, is “potential job experience,” that is, 
since school, Sp is education of wife, 5 m IS education of husband, 
IS number of children The addition of earnings of husband 
need the coefficient of 5 m to insignificance without changing the 
Sclent of determination, which was /?* = 14 

f timated values of the numerator (/) are used to reestimate the earn- 
function A comparison of 2SLS and OLS estimates of the earnings 
aon IS shown in table 9 If anything, the recstimated function shows 
positive coefficients for (total) experience and stronger negative 
cients for home time The children variable becomes even less 
cant (in terms of <-values) than before The reestimabon leaves our 
sions, based on the OLS regressions, largely intact 


B < is a function of exogenous vanables, it is not correlated with the stochastic 
le recstimated esmungs function 



Sioo 


JOURNAL OF POLITICAL ECONOMY 


TABLE 9 

EARNiNas FuNcrnoN, WMSP Women, OIB and 2SLS 


Var 

OLS 


2SLS 

OLS 

2SLS 

b 


b 


b 


b 


C 

- 20 


-.06 


.19 


26 


S 

.069 

12 8 

063 

12.0 

053 

94 

048 

85 

e 

010 

32 

012 

27 

008 

28 

010 

1 9 

*1 

- 008 

-30 

- 015 

-77 

- 007 

-19 

- 013 

-55 

l<2 

0006 

02 

- 006 

-23 

001 

05 

- 006 

-19 


009 

32 

009 

35 

009 

34 

010 

37 

tr 





005 

22 

006 

22 

cert 





18 

5 1 

18 

5 1 

flit 





- 0003 

-13 

- 0003 

-1 4 

res 





001 

1 3 

021 

1 4 

loc 





.044 

28 

042 

25 

In Hrs 





- 11 

-50 

- 11 

-49 

In Wks 





03 

1 5 

03 

1 6 

N, 

• 




- 010 

-13 

003 

03 


Note — WMSP » white mAirted women, ipouse present, (r ■ months of training, an » certihcation 
(dummy), see table 4 for key to other symbols 


VI. Prediction 

A test of the predictive power of the earnings function was performed on a 
small sample of women who did not work in 1966 but were found m the 
same first NLS survey to have returned to work in 1967 They were not 
included in our analyses, but their life histones and 1967 wage rates are 
available The latter were predicted with several vanants of the earnings 
function and compared to the refxirted wage rates On average, the 
prediction is quite close, and the mean-square error is even smaller— 
relative to the variance of the observed wage rates—than the residual 
vanance in the regressions ‘ * In other words, the predictive power 
outside the data utilized for the regressions is no smaller than within the 
regressions The test, however, is weak, because the sample is so small 
(45 observations) Similar tests will be jjerformed on larger samples of 
women who return to the labor market in subsequent surveys 


Vn. Earnings Inequality and the Explanatory Power of 
Earnings Functions 

As table 10 indicates, the earnings function is capable of explaining 25-30 
percent of the relative (logarithmic) dispersion in wage rates of white 
married women and about 40 percent of the inequality in the rather small 
sample of wage rates of single women in the 30-44 age group who worked 
in 1966. The earnings function is thus no less useful m imdcrstanding the 
structure of women’s wages than it is in the analysis of wages of males. 

* * The (squared) correlation between predicted and actual wage rates was .37 The 
mean of actual rates was 5.1%, with a w ,335; the mean of predicted wages was 5.187, 
with a as .204. 






EARNINGS OF WOMEN 


SlOI 


TABLE 10 

Eakninos Inequauty and Explanatory Power op Waoe FuNcnom, 1966 


Group 

<r» (In W) 

Rlr 

<r» (In Y) 

Ri 

o* (In H) 

N 

Married women by 
education (yn) 

<12 

17 

21 

81 

76 

64 

435 

12-15 

18 

17 

92 

78 

74 

622 

+ 16 

.17 

16 

77 

74 

60 

83 

Total 

22 

28 

97 

78 

75 

1,140 

Single women 

30 

41 

62 

66 

32 

138 

Mamed men 

32 

30 

43 

50 

11 

3,230 


Notb—(I n tV) variAnceof (loff) wa«es,^2 (Id K) ■ vnapCT of (iog) ftnnuai fmuy, (Id .W) » 
variance of (log) annual houn of wore, « coefficient of detenmnatioo m wage rate function, • 
coefficient of determination in annual eamioga function 


The dispersion of hours worked during the survey year is much greater 
among married women, (In H) = 75, than among men, (In H) = 

11 The (relative) dispersion in annual earmngs of women is, therefore, 
dominated by the dispersion of hours worked This factor is also important 
in the inequality of annual earnings of single women and of men of 
comparable ages, but much less so It is not surprising, therefore, that the 
inclusion of hours worked in the earnings function raises the coefficient of 
determination from 28 percent in the hourly-wage equation to 78 percent 
in the annual-camings equation of mamed women, from 41 percent to 
66 percent for that of single women, and from 32 to 50 percent for that of 
men 

The lesser inequality in the wage-rate structure of working married 
women than in the structure of male wages is probably due to lesser 
average, and correspondingly lesser variation m, job investments among 
individuals At the same time, the huge vanation in hours, reflecting 
intcrrmttency and part-time work as forms of labor-supply adjustments, 
creates an annual earnings inequality among women which exceeds that 
of men However, the meaning of that inequahty, both in a causal and 
in a welfare sense, must be seen m the family context As was shown 
elsewhere (Mincer 1974), the inclusion of female earnings as a comjxment 
of family income narrows the relative inequality of family incomes com¬ 
pared with that of incomes of male family-earners 

Vm. Some Applications 
1 77u Gap 

To compare wage rates of women with wage rates of men, we analyzed 
earnings of men from the Survey of Economic Opportunity (SEO) for 
the same year (1966). We find that the average wage rate of white married 
men, ag^ 30-44, was S3.18, compared ynth $2 09 for white mamed 
women and $2.73 for white single women in our NLS data 






Sl02 


JOURNAL OF POLITICAL ECONOMY 


TABLE 11 

Experiencb and Depreciation CoEPnciENTE, 1966, Ages 30-44 


Vab 

Married Women 

Sinole Women 

Married Men 

b 

M 

b 

M 

b 

M 

S 

063 

11 3 

077 

12 5 

071 

11 6 

1 

012 

96 





t 



.026 

15 6 

.034 

194 

F* 



-.0006 

258 

- 0006 

409 


009 

32 

009 

80 



A. 

- 015 

67 





Ao 

- 006 

35 






Soi/RCKs—Women NLS 1967, men SEO, 1967 

Note —S years of schooling, » home time following birth of first child, Hq ■ other home time, 
e a years of work experience since completion of tchot^ing, sj u ent job tenure, f ^ 2SLS estimate of 
total work experience, b regression coeflicient, M » means 


TABLE 12 

EpFEcn OF Work Experience on Waob Rates 



Relative ConTRiBunor op 

Percent of 
Wage Gap 
Explained 

Actual 

Experience 

(I) 

Men's 

Expcnencc 

(2) 

(3) 

(4) 

Marned women 

+ 02 

+ 26 

45 

42 

Single women 

+ 32 

+ 33 

7 

40 

Mamed men 

+ 42 





We inquired to what extent the larger wage ratio (152 percent) ol 
married men to marned women and the smaller one (116 percent) ol 
mamed men to single women can be explained by differences in work 
histones and by differences in job investment and depreciation For thi' 
purpose we estimated a single earnings function of men, aged 30-44, ir 
SEO The coefficients and means of the variables for these men are showr 
in table 11, which also gives the NLS estimates for both mamed anc 
single women 

Note that married men and mamed working women have just abou 
the same average schooling, while never-mamed women are somewha 
better educated (by 1 year, on average) The coefficients of schooling ar 
somewhat lower for mamed women but higher for single women Th 
big differences are in years of work experience since completion ( 
schooling These are 19 4 for men, 15.6 for single women, and 9 6 fc 
marned women The coefficients of initial experience are .034 for mei 
.026 for sii^lc women, and about half as much for mamed women. 

Multiplying the coefficients by the vanables (table 11) and summir 
yields contnbuUons of postschool investments to the (log of) wage rat 
as shown in table 12 These differences, roughly 40 percent betwet 
husbands and wives and 10 percent between mamed men and sirg 










EARNINOS OF WOMEN 


S103 

women, arc about 70 percent of the observed difference in wage rates 
between married men and married women and a half of the difference 
between mamed men and single women 

If one prefers to be agnostic about the human-capital approach, one 
can treat the earnings function simply as a statistical relation and the 
regression coefficients as average “effects’* of work experience and of 
nonparticipation on wages, without reading magmtudes of investment or 
depreciation into them In that case we may ask how much the sex 
differential in wage rates would narrow if work experience of women were 
as long as that of men, but the female coefficients remained as they are. 
A multiplication of the female coefficients by the male variables in table 11 
yields the following answers for mamed women, 45 percent of the gap 
would be erased, for single women, only 7 jjercent of the much smaller 
gap (table 12, col 3) The answer is similar for mamed women if the 
converse procedure is used, that is when the work expenence of women is 
multiplied by the male coefficients (table 12, col 4) For single women, 
the reduction of the gap is larger than in the first procedure 

We believe, however, that the weight of the empirical analysis of 
female earnings supports the view that the associationof lowercoefficients 
with lesser work experience is not fortmtous a smaller fraction of time and 
energy is devoted to job advancement (training, learmng, getting ahead) 
per unit of time by persons whose work attachment is lower Hence, the 
45 percent figure in the explanation of the gap by duration-of-work 
expenence alone may be viewed as an understatment. 

Indeed, companng the annual earmngs of year-round working women 
and men in the 30-40 age groups, Sutcr and Miller found a female-to- 
male earnings ratio of 46 7 percent However, the ratio rose to 74 percent 
for women in this group who worked all their adult lives The same 
companson for high school educated piersons yielded 40 5 as against 
74 9 percent. Thus lifelong work expenence reduces the wage gap by 
51 or 58 pierccnt, respectively 

At this stage of research we cannot conclude that the remaming 
(unexplained) part of the wage gap is attnbutable to discrimination, nor, 
for that matter, that the “explained” part is not afiected by discrimin¬ 
ation More precisely, we should distinguish between the concepts of 
direct and indirect effects of discnmmation Direct market discnmination 
occurs when different rental pnees (wage rates) are paid by employers 
for the same unit of human capital owned by different persons (groups). 
In this sense, the wage-gap residual is an upper limit of the direct effects 
of market discrimination Indirect effects occur in that the existence of 

’’Sutcr and Miller (1971, table 1). Their figures are not quite comparable with oun 
thar male data come the Current FopulaUon Survey (CFS), and oun from SEO 
They compare full-tune eanungi rather tluut wage rates, and they compare men and 
women mthout regard to manta! status. 



JOURNAL OF POLITICAL ECONOMY 


S 1 O 4 

market discnmination discourages the degree of market orientation in the 
expected allocanon of time and diminishes incentives to investment m 
market-oriented human capital Hence, the lesser job investments and 
greater depreciation of female market earning power may to some extent 
be affected by expectations of discnmination 

Of course, if division of labor in the family is equated with discnmi¬ 
nation, all of the gap is by dehmtion a symptom of discnmination. 
Otherwise, the analyses of existing wage gaps and of their changes over 
time remain meaningful, not tautological 

Our data on work histones show some interesting trends which suggest 
a prospective narrowing of the wage differential. Table 3 shows that the 
uninterrupted period of nonparticipation which starts just pnor to the 
birth of the first child has been shnnking when older women are compared 
with younger ones Women aged 40-44 who had their first child m the late 
1940s stayed out of the labor force about 5 years longer than women 
aged 30-34 whose first child was born in the late 1950s Family size is 
about the same for both groups, but higher for the middle group (35-39) 
whose fertility marked the peak of the baby boom Still, the home-time 
interval in that group is shorter (by about 2 years) than in the older 
group and longer than in the younger Thus, the trend in labor-force 
participation of young mothers was persistent If, by the time the 30-34- 
year-old women get to be 40-44 (1 e , in 1977), they will have had 4 years 
of work experience more than the older cohort, and their wage rates will 
rise by 6 percent on account of lesser depreciation and by another 2—4 
percent due to longer work experience Thus, the total observed wage 
gap between men and women aged 40-44 should narrow by about one- 
fifth, while the gap due to work experience should be reduced by one- 
quarter 


2 The Price of Time and the Opportunity Costs of Children 

The loss or reduction of market earnings of mothers due to demands on 
their time in child reanng represents a measure of family investment in 
the human capital of their children This investment cost has been 
me 2 isured by valuing the reduction of market time at the observed wage 
rate As pointed out by Michael and Lazear (1971), this valuation is 
incomplete for two reasons. First, if job investments take place at work, 
the observed wage rate understates the true foregone wage (gross or 
capacity wage) by the amount usually invested during the period when 

^ ° Two opposing biases mar this conjecture Ihe shorter home-time interval for younger 
women is an average duration for those who already returned to work. It will lengthen 
with the passage of time as additional women return to the labor force It can be shown, 
however, that the apparent trend is genuine At the same time, the assumption of un¬ 
changed job-investment behavior leads to an undentatement. 



EARNINOS OF WOMEN 


S105 

eamings are foregone. Second, as is clear from earmngs-function analysis, 
the reduction of market time in turn reduces future wage rates because of 
a depreciation in earning power during the penod of nonparticipation 
The present value of future earmngs lost through depreciation is a com- 
fionent of the opportunity cost of time, hence of children ^ ‘ 

The data and the estimated wage functions permit a tentative, perhaps 
only an illustrative, empirical assessment of the opportunity cost of 
women’s time and of children Specifically, the marginal opjxirtumty 
cost per hour of a year spent at home—rather than m the market— 
consists of (1) the gross wage rate (iVg), that is the observed but foregone 
wage ( W) augmented by currently foregone investment costs, and (2) the 
present value of the reduction of the future gross wage through current 
depreciation • * ^ 

1, We can estimate Wg since IV = 1^,(1 — k), and rk is estimated in 
the earmngs function by ai, the coefficient of work expiencncc («i) 
preceding the interruption k = a,/f, where r is the rate of return 

2 The present value of the reduction in due to depreciation, using 
r as the discount rate, is djr • Wg, where d is the (depreciation) coefficient 
of home time in our wage equations ^ ^ 

The estimates of marginal opportunity costs of a year (m dollars pier 
hour) are shown in panel I of table 13 for three cducauon groups of white 
mothers, aged 35-39 In panel II we calculate total opportumty ex¬ 
penditures incurred dunng the nonparticipation penod following the 
birth of the first child This is the pienod for which the earnings functions 
show significant depreciation coefficients The length of the picnod de¬ 
pends, in part, on the number of children Though interpreting all of the 
foregone earnings this pienod as an oppiortumty expenditure on children 
may be an overstatement, we impose an oppiositc bias by ignonng 
subsequent pienods of non participation** which may also be child 
induced Figures in panel II are the marginal costs pier hour (pier year) 
multiplied by h, the duration of home time Figures in pianel III are 
average opportunity expicnditurcs per child {N in each group. Since h 
IS in years, the dollar figures in pianels II and III should be multiphed 
by annual hours of work For example, with 1,500 hours of work pier year, 
the opportumty mvestment expicnditures pier child range from about 
$8,000 spjcnt in 8 8 years by mothers with less than high school education 
to $17,000 spient in 5 2 years by mothers with college education or more 


As Robert Willu suggested to us, this is stnctly correct for the exocB of depreciation 
dunng home Ume over the depreciation at other times As we stated esulicr, we believe 
that the latter is negUgible in our age groups 

** Note that we are looking at household producuvity as the return, the purpose of 
reducing market work, not as a negauve eiement in costs. 

A 10 percent discount rate was used in these calculations, 
s* Inclusion would lead to a 20-25 percent increase in expenditures for the age group 





EARNINQS OF WOMEN 


S107 

Only panel I represents the marginal price of tune Note that the 
observed wage rate^* represents 80 percent of the marginal price of an 
hour below college levels and only 60 percent at higher levels The same 
proportions hold in the other two panels However, figures in these 
panels are not prices but expenditures which depend on both the price of 
time and the number of children and the average home-time interval 
per child Both of these variables can be viewed as responses to the mar¬ 
ginal price of time As the table indicates, observed wage rates and, even 
more so, marginal prices of tune (panel I) increase with education Lesser 
fertihty and closer spacing of children are the responses ** both numbers 
of children and interval of home time per child dimmish Consequently, 
the differences in total expenditures by education level are reduced While 
the marginal price of time of the highest education group is three tunes as 
high as that of the lowest, the expenditures per child are a little over 
twice as high, and total expenditures arc only 70 percent higher 

Since the opportunity costs of labor-force withdrawal (“home time”) 
are not quite the same thing as the opportunity costs of children, we again 
caution the reader to view the estimates of table 13 as largely illustrative 
They clearly illustrate the point which the title of this paper intends to 
convey foregone market-onented human capital of mothers is a part of 
the price of acquiring human capital in children, and more generally, a 
price exacted by family life Of course, the greater market specialization, 
longer houn, and greater intensity of work and of job trammg on the part 
of husbands and fathers can be viewed as a “price exacted by family 
hfe” in exactly the same sense 

Implicitly, families balance such pnees against perceptions of received 
benefits ^ ^ Of course, both perceptions of net benefits and prices change 
While perceptions are matters of individual pisychology and of cultural 
climate, the marginal opfxirtumty cost of time has risen secularly with the 
rise m real wages and with the growth of human capital It is natural for 
economists to connect to this basic fact both upward trends in labor-force 
participation of women and downward trends in fertility,** changes in 
the family, and even some of the rhetone which accompanies these 
developments 

In pnncipic, wage rates just before the period A are required The wage at ages 
35-39 represents, on average, a small overstatement wage profiles of married women with 
children are relatively flat in the age span 25-39 within education groups 

Direct evidence on closer spacing at higher levels of education is shown in research 
for a Columbia Ph D dissertauonby SueKoss(l973) In the NLSdata,thercisastTong cor¬ 
relation between the length of home time and the birth interval from oldest to youngest child 
Some of these benefits are analyzed in the papers of Lee Benham and Arleen 
Leibowitz in this volume 

For econormc analyses which bear on the upward secular trends in labor-force 
participation of mamed women, see Mincer (1962) and Cam (1966) For analyses 
bearing on fertility trends, see “New Economic Approaches to Fertility,” JJP ^, vol 
81, no 2, suppl (March/April 1973) 



JOURNAL OF POUnCAL ECONOMY 


Sio 8 

Appendix 

Note on the Construction of Work-Experience Intervals 

The 1967 NLS stirvey of women aged 30-44 permits a division of time elapsed 
since leaving school into, at most, eight intervals The following information was 
used in constructing these intervals (< 2 ) Dates were available for school leaving 
(S), first marriage (Af), birth of first child (C), start of first job, return to labor 
force after birth of first child, start of current job, and end of last job, if currently 
not worlung (i) Number of years during which the woman worked at least 6 
months between ( 1 ) school leaving and first marriage, ( 2 ) marriage and birth of 
first child, (3) return to labor force after the first child, and (4] the start of current 
job 

On this basis, we describe the intervals m the order of their chronological 
placement interval A, (on average, half a year) is the interval between school 
and first job, Cj is the number of years of work between school and marriage The 
placement and continuity of this interval checks rather closely with the data, 
though direct statements are absent,«j is years worked (similarly defined) between 
first marriage and birth of first child, A 2 is the residual home time, given informa¬ 
tion on the length of interval between first marriage and birth of first child The 
assumption of continuity and order of placement of S] and A 2 are somewhat 
arbitrary They are justified by evidence of frequent identity of job Cj and ^2 
the plausibility of Aj starting during pregnancy. Indeed, A 2 is a fraction of a year, 
on average, Aj is the uninterrupted interval of home time following the birth of 
the first child. It is placed by direct information, Sj is years of work and A^ the 
residual amount of time in the interval between returning to the labor force at 
the end of A 3 and start of current job. However, neither ^3 nor A 4 , needs to be 
contmuous. The succession of A 4 after *3 is more plausible than the converse Also 
(«3 + A 4 ) IS, on average, about 3 years altogether, is clearly defined and placed 
as the current job interval. 

In tables 2 and 3 we aggregate («| + * 2 ) ^i> (^2 + ^ 3 ) “ ^ 2 > 

the other mtervals are correspondingly renamed. 

In the regressions we added A] to A 3 to get A 2 other home time Separately, or 
together, these intervals arc quite short and show little effect in our analysis 



Comment 


Otis Dudley Duncan 

University of Michigan 


For readers, other than economists specializing in studies of human 
capital, the most controversial sections of Mincer and Polachek’s paper 
are likely to be the introductory statements of the motivation of the re¬ 
search and the authors’ calculation which suggests that not less than half 
the wage gap between mamed women and men is due to sex differences 
in length and continuity of work histones 

To their initial rationalization of the division of labor within the family 
in terms of “complcmentanty and substitution relations in the household 
production process” and “absolute and comparative advantages due to 
differential skills and earning powers,” the authors quickly add the dis¬ 
claimers that sex linkage of intrafamily role differentiation is “subject to 
technological and cultural changes” and that such changes along with 
“discriminatory attitudes . in the labor market” are outside the scofie 
of the study Most feminists and many sociologists would presumably 
respond that with this much of the problem out of scojic, what remains is 
hkely to be misleading 

In technical terms, the issue is whether the “statistical residual” u in 
the general specification of the model is indeed uncorrclatcd with the 
vector e so that the coefficients of e, (expenence segments) in the earnings 
function are unbiased estimates of the investment in human capital 
appropnatc to the wife’s role in the household Mincer and Polachek do 
address the problem of bias in their somewhat cursory treatment of 
two-stage least-squares estimation of the earnings function But there the 
concern is with possible simultaneity of wage levels and expenence 
vtuiables The more senous question may be whether such institutional 
and cultural factors as discrimination and employers’ unwillingness to 
invest in traimng female workers—all taken to be beyond the scope of the 
study and presumably captured only in the residual u —act as common 
causes of ivage levels and work expenence 




Si lo 


JOURNAL OF POLITICAL ECONOMY 


Indeed, in the remarks appended to the calculations on the wage gap, 
Mincer and Polachek come perilously close to a premature closure of 
their conceptual scheme. Having attributed half the gap to sex differences 
in work expenence they go on to observe that the lower coefficient for that 
expenence among women may also be attributable to lower work 
attachment on the women’s part. But the sex difference in work ex¬ 
perience and the sex difference in coefficient for work experience provide 
an exhaustive decomposition of the sex difference in wages, net of initial 
endowment as of the beginning of the work history Hence, the paper 
seems to be saymg that women get about what they deserve in the job 
market' 

Many technical and conceptual improvements are required if the 
strategy opiened up by this paper is to be productive Mincer and Polachek 
seem amply aware of the frailty of their estimates, due to sample size 
and crudity of measurement The hypercritical reader, however, will not 
fail to note various places where the authors seemingly read their tables a 
little too smoothly 

On the conceptual side, much work remains to be done before a 
serviceable model emerges from this kind of work Treatment of the 
“activities, and characteristics of other family members,” mentioned at 
the outset, is underdeveloped in the present formulation, while the labor- 
supply function is admittedly inadequate and a fertility function is lacking. 
More directly to the point of the inquiry at hand, one is disappointed to 
find no effort to include the quality of the work experience, and this is 
perhaps a more senous omission than the parallel inattention to quality 
of schoohng One indication that investigation of this facet of the problem 
could be productive is given by table 4.7 m Dual Careers (Pames, Shea, 
SpiU, and Zeller 1970, p 114), wherein it appears that the effect of the 
rate of lifetime labor-force participation on the wage rate is substantially 
gfreater for white-collar than for blue-collar workers among currently 
employed women. In this connection Mincer and Polachek do investigate 
the interaction of work experience and schooling with respect to wage 
rate Close inspection of their tables 5 and 6 turns up some disconcerting 
variations in coefficients by years of schooling At this point errors in the 
data and misspecifications of the model arc hard to tell apart 

One can surmise that when all the truly endogenous and relevant 
causes and effects of the earnings of women are explicitly represented in 
the model, it will either be undendentified or else have been broadened 
to include the “sociological” exogenous variables ruled out of scope in 
this initial and productive foray In trying to get the model properly 
specified, however, one need not expect too much help from the disaplme 
of sociology as such, geared as it is to the production of dialectic rathei 
than discovery. 

’ The authon* own views on the points at issue are made much clearer in their revision 
which was not available when this comment was prepared. 



Home Investments in Children 


Arleen Leibowitz 

ffatumal Bureau of Ecommu Research and Brown Umvernty 


By the time children enter first grade, significant differences in verbal 
and mathematical competence exist among them * These differences 
reflect variations m (1) inherent ability, and (2) the amounts of human 
capital acquired before the children reach the age of six ^ The stocks of 
acquired human capital reflect, in turn, varying inputs of time and other 
resources by parents, teachers, siblings, and the child The process of 
acquinng preschool human capital is analogous to the acquisition of 
human capital through schooling or on-the-job training 

Assuming a constant rental rate for human capital, earnings can be 
interpreted as a measure of capital stocks at later ages The IQ also can be 
interpreted as such a measure of human capital stocks It is related to 
some commonly used inputs of human capital, for it is well known that 
measured IQ is not indepiendcnt of years of schooling acquired before the 
age of testing At preschool ages IQmeasures should be related to human 
capital inputs in early childhood as well as to inherent genetic ability 
Viewing measured ability as an index of the stock of human capital puts 
a different light on eammgs functions which include ability and schoohng 
If contemporaneous ability and schooling measures are used to predict 
earnings (as in Hansen, Weisbrod, and Scanlon 1970), it is not surprising 
to find that earnings arc more closely related to an ability measure, which 


This research has been supported by grants to the Nauonal Bureau of Economic 
Research from the Office of Economic Opportumty and the Nauonal Insututes of Health 
A National Science FoundaUon grant to the Nauonal Bureau of Economic Research 
funded the data preparation I gratefully acknowledge the very competent research 
assistance of David L Lindauer smd Arun K Mukhopadhyay For helpful comments 
on a previous draft of this paper I thank Charles Betsey, Barry Chiswick, John Hause, 
Jacob Mincer, and Finis Welch 

‘ This IS one of the important findings of the Coleman Report (see Moateller [1972, 
P. 49]) 

* Genetic factors cannot account for the entire difference, as studies of identical twins 
show. In one such study, idenUcal twins raised apart showed an average difference m 
' IQ. of 14 points (Kagan 1969, p 275) 


Sill 






SlI3 


JOURNAL OF POLITICAL ECONOMY 


IS an index of human capital stocks, than to years of schooling, which is 
merely a partial measure of one of the inputs In this respect family- 
background variables included m an earmngs function may be interpreted 
as proxies for early investments in human capital if characteristics of the 
father and mother are systematically related to the investments in time 
and goods that they make in their children 


1. A Causal Model 


Consider a model in which earnings depend upon human capital stocks 
which are built up over the life cycle through schooling, on-the-job 
training, and home investment 

By adapting Ben-Porath’s well-known model of the acquisition of 
human capital over the lifetime through investments, we postulate a 
variant of the Ben-Porath production function for human capital (Ben- 
Porath 1967, p 360) • 

( 1 ) 


where Q, = gross additions to human capital stock m time t, K, = th< 
stock of human capital existing at 1,5, = the proportion of A, allocated ti 
producing Q„ 0 ^ S, ^ \, D, = the quantity of goods inputs allocate! 
to producing Q,, and 0^ — the Hicks neutral-efficiency parameter fo 
producing human capital, 0 < yttfiiPi < !• The rate of change in th 
capital stock is given by 

A, = Q, - SK„ (1 


where 5 is a constant depreciation rate The stock at time T can I 
dehned as 


Kr = 


Ko + 


f 

Jf*C 


K,dt, 


( 


where Kq = the initial stock or inherent genetic ability 

The marginal-cost curve which corresponds to human capital pi 
duction function (1) is shown by Ben-Porath to be 


MC, = 

^oVi 


fil ^0/ 




where 0 — (y, + 0i) Thus, the marginal cost of acquinng hun 
capital is a function of existing stocks if fi ^ yj Vi ^ Vn ^ 
with the level of existing stocks, and human capital increases the \ 
ductivity of time in the market more than the productivity of tim< 
producing new capital. However, if y, < y^, the marginal cost 
producing new capital falls with existing stocks. The “demand pr 
depends, however, only on the rental price for human capital, the i 
of interest and depreciation, and on the time remaining in the wor 



HOME INVESTMENTS IN CHILDREN 


SlI3 



lifetime Thus, we would exfject persons with greater initial levels of 
human capital to demand more additions to the stock if y, < 

Figure 1 helps us to visualize the process Parents’ genetic character¬ 
istics determine the heredity of the child I have shown (Leibowitz 1972) 
that the quantity of time devoted to children is positively related to 
parents’ education, and there is evidence that the quality of time inputs is 
also positively related to education (Schoggen and Schoggen 1968) Thus, 
parents’ genetic endowment and their education determine the quality 
and quantity of time inputs to the child Parents* attributes also determine 
family income, which affects the amount and quahty of time and goods 
inputs. These inputs comprise home investment, which, along with 
heredity, determines IQ, Final schooling level is determined by family 
income and stocks of human capital. Home investments may affect 
capital stocks in a manner not reflected in measured IQ Thus, a variable 
such as the education of a mother may affect a schooling level both 
directly and through heredity Income, which is a rent on the stock of 
human capital, depends on the four major sources of capital home 
investment, measured ability, final schooling level, and postschool 
investment This model can be wntten as a recursive system 

IQ =MG.ix), 

I 2 =/2W./l, Yf), 

Y^Miui2, h.iQ). 

where G = genetic factors, /, =■ home investment, /j = schooling 
investment, and I 3 postschool investment. 



JOURNAL OF POLITICAL ECONOUY 


S 114 

Clearly, m this system variables which are endogenous in one equation 
are predetermined in equations of higher number. This is true because 
the endogenous variables are determined at various early stages of the hfe 
cycle. That is, childhood IQ, is determined prior to schooling, which is 
determined prior to earnings If we assume that disturbances from each 
pjur of equations are uncorrelatcd in the probability limit, this system 
may be estimated consistently by the use of ordinary least squares 

The rest of this paper reports an investigation of the possible returns to 
home investment, concentrating on the following questions • 

1 Do home investments add to preschool stocks of human capital, as 
measured by IQ^ 

2 Is the amount of schooling achieved affected by early stocks of 
human capital and by later home investments? 

3 Do home investments affect earnings if other forms of human capital 
are held constant? VVhat is the bias to the coefficient on schooling 
(the rate of return to schooling) if home investments arc omitted 
from the earnings function? 

4 Docs an early measure of ability affect earnings if schooling and 
home investments are held constant? What is the bias to the co- 
effiaent on schooling if ability is omitted from the earnings 
function? 

To investigate these questions a unique data set called the Termar 
sample is used, it is described in Section II In Sections III-V, thi 
estimation of the childhood human capital, education, and earning 
functions is discussed 


n. The Nature of the Data’ 

In 1921, Lewis M Tcrman, a Stanford University psychologist, initiated 
study to evaluate the physical, mental, and personality traits of Califomj 
school children who scored in the top 1 percent of the national !• 
distnbuuon, and to follow those children tis far into adult hfe as possib 
to see if high IQ was a good predictor of success in later hfe (The da 
generated will be referred to hereafter zis the Terman sample.) In 19! 
when Teiman began to test the children, he selected 1,528 of them wi 
IQ scores of 140 or above, corresponding to the top 1 percent of t 
intelligence distnbution 

The original sample consisted of 857 boys and 671 girls The laij 
proportion of males among the group can probably be accounted for 
the larger variability in tested IQ among boys than among girls. 


^ The data docnption in this section draws heavily upon discussion in Tennan 
Oden (1959). I am indebted to Susan Oayne of the NaUonal Bureau of Econo 
Resear^ for devising an ingenious method for decoding the column binaiy tape oi 
data and for doing the decoding 



HOME INVESTMENTS IN CHILDREN Si 1 5 

The method of selection was not free from bias. Budget constramts 
precluded testing every child in the California schools Rather, in grades 
three through eight of schools in urban areas, each classroom teacher 
listed the three brightest children in the class and the youngest child 
These children were further screened by a group IQ test, and the 
“promising” candidates were given the Stanford-Binet test The onginal 
criterion for inclusion in the study group was a Binet IQ score of 140 or 
above, although 65 pupils who tested between 135 and 139 were also 
included High school students nominated by their teachers as being 
bright were chosen if their score on the Terman group test fell within the 
top 1 percent of the distribution. From later evaluations, it was estimated 
that 90 percent of the eligible children had been included in the sample 
The mean IQ of the students on the Stanford-Binet test was 151 5 for 
boys and 150 4 for girls at the time they were included m the study 
Average age of Binet-tested pupils was 9 7 years and that of group-tested 
high school students was 15 2 years at the time they were selected. 

The sample is thus not in any sense representative of California school 
children in the 1920s The children were atypical physically and socially 
as well as intellectually They tended to be taller and stronger and to 
mature earlier than their classmates Fathers of the students had a median 
12 4 years of schooling and the mothers, 11 7 years—nearly 4 years above 
the average for their generation in the United States 

The achievement of sample members also was above average About 
70 percent of the students in the sample finished college as contrasted 
with 8 percent of the United States populauon in their cohort The 
average earnings of males in the sample in 1960 exceeded those of men of 
equivalent age and years of schooling by 30-50 percent 

At the outset, in 1921, in addition to IQ scores, many data about the 
home backgrounds of the students were collected Information on the test 
group members wais updated in 1929, 1940, 1950, 1954, and 1960 to 
include current data on schooling, earmngs, occupation, and data on 
other vanables Although the data collection spanned almost 40 years, the 
retention rate in the sample was very high Of the onginal 1,528* sample 
members, 104 (6 8 percent) died by 1955, after 1928 only 28 were lost 
track of until, following 1945, six more were not heard from 


m. Determinants of Ability 

Our discussion up to this point has emphasized that ability as measured 
by IQ or other similar tests is not only the result of inhented attnbutes 
but also of acquired human capital Although schooling level may affect 
measured ability, this will not be the case in our analysis, where ability is 

* Regresuons in this paper are based on the 821 males and 643 females for whom 
complete childhood records were available Sample suet may be hirther reduced in 
certain legrenions due to missing data 



Sii6 


JOURNAL OF POLinOAL ECONOMY 


the childhood IQ, score determined in 1921-22. The IQ measure is the 
score on the Stanford-Binet or Terman group test (TGT), 

Several measures of home investment are readily available in the data. 
The respective educations of the father and the mother arc important 
indices of the quality of the time spent with the child and should be 
positively related to IQ. In addition, as I have shown elsewhere (Leibowitz 
1972), education is also positively related to the quantity of time spent with 
the child. Since mothers’ time expenditures on children exceed those of 
fathers by at least a factor of 4,* we would expect the significance and size 
of the coefficient of mother’s education to exceed that of father’s education 
However, education of the parents may be a proxy for the quality of 
genetic inheritance—if intelligence of the parents and their own education 
arc positively correlated, and if intelligence can be transnutted genetically 
If education is acting as a proxy for heredity, the coefficient of each 
parent’s education on child’s IQ should not differ signiffcantly A com¬ 
parison of the size and significance of these two variables will indicate 
whether parents’ schooling is influencing IQ directly through the quality 
and quantity of time spent with children or whether it is a proxy for 
inhented ability 

A more direct measure of the quantity of time spent with children in 
preschool investment results from the parents’ estimate m 1922 of how 
much home instruction they provided their child Two dummy variables 
were generated HOMETR2 = one if parents conducted “an appreciable 
amount of instruction along particular lines,’’ and zero if they did not; 
HOMETRl = one if parents spent a “considerable number of hours, but 
chiefly reading, telling stones, and wnting,” and zero if they did not do so 
The omitted category covered “no instruction, other than usual amount 
of reading and telling stones ” 

It IS a consistent finding of psychological studies that first children and 
children from smaller famihcs score higher on IQ tests and achieve more 
(for instance, get more schoohng) The analysis of family investment 
presents a rationale for this phenomenon, since the more children there 
arc, the less time input parents have available per child For the same 
reason children of lower birth order can be expected to receive more time 
inputs. Both the number of siblings and birth order have been included 
as proxies for home investments 

In addition, the regressions included a dummy variable equal to ont 
if the IQ test was the Terman group test rather than the Stanford-Binci 
test About 33 percent (268) of the male subjects and 26 percent (168) o 
the females had taken the TGT, in which average scores were 8.5 point 
below the Stanford-Binct version of the IQ test Terman’s analysis shown 

> Thu u based on 1967 data (Leibowita 1972) It appears unlikely that in 1922 fathe 
spent more tune with children than motheis did 



HOME INVESTMENTS IN CHILDREN 


Si 17 

the TGT-tested students were at least as select a group as the Bmet-tested 
ones, but scores were shifted down The assumption is made that the two 
tests are identical in all other respects Means and standard deviations of 
the variables are presented in Appendix C 

A. EsUmatton for Males 

In table 1 the regression of family-investment variables on IQ, measured 
at age 11 is presented for the 821 males in the sample Both home-invest¬ 
ment variables were positive, with HOMETR2 having a stronger 
positive impact on IQ than HOMETRI. Thus, the quahty of time inputs 
by parents did raise measured IQ and spiecial instrucDon had more of an 
impact than generalized time inputs Since this holds the quantity of 
time inputs constant, the education variables can be taken to measure the 
quality of time The mother’s education is a very significant predictor of 
IQ, while the coefficient for father’s education is not significantly different 
from zero This suggests, as argued above, that education is a proxy for 
home investment and not solely for inherited factors Psychologists have 
also shown (Kagan and Moss 1959) that corrclaDon between child’s IQ 
and maternal education is greater than that between child’s IQ and 
paternal education, while the correlations between child’s IQ and those 
of mother and father do not differ significantly 

Birth order is represented by the variable BORDER, which was coded 
1 for a first-bom child, 2 for the second-bom, and so on This variable 
thus contains some information about number of sibhngs as well as about 
birth order The negative effect of birth order, while quantitatively small, 
IS quite significant If, instead of birth order, number of sibhngs is used, 
Its coefficient is significantly different from zero only at the 10 percent 
level of significance If both arc included in the equation, birth order’s 
coefficient remains significantly different from zero, while number of 
sibhngs does not Estimated family income in 1922 (YK) did not prove to 
be a significant predictor of childhood IQ, as seen in table 1, column 2 
When the equations arc based on the 553 boys who took the Stanford- 
Binet test, thus allowing us to dispense with the TGT dummy variable, 
the coefficients arc substantially the same (see col. 3, table 1) 

B Estimation for Females 

The same variables which accounted for 18 6 percent of the variance in 
IQof males in the sample account for only 13 percent of the vanance of 
females’ IQ (see table 1, col 4) Apart from the TGT dummy and family 
income, none of the variables meets conventional levels of significance 
Neither the direct measures of the quantity of time mputs (HOMETRI 
and HOMETR2) nor the parents’ education vsuiables have coefficients 





HOME INVESTMENTS IN CHILDREN SI ig 

Significantly different from zero Birth order and number of siblings have 
no significant impact on measured IQ, The only other vanable which was 
significantly related to IQ was YK, a measure of relative family incomes, 
generated from data on the father’s occupation in 1922 (The construction 
of this latter vanable is desenbed in Appendix A ) 

When the equation is estimated for only those subjects who took the 
Stanford-Binet test, the only significant vanable is family income (table 1, 
col 7) However, when the sample covers only the girls who took the 
Terman group test (as in table 1, col 8), mother’s education and 
HOMETR2 are positively related to childhood IQas is the case for males, 
while family income is not significantly related to IQ It should be noted 
that the girls chosen by the TGT were older, and probably constituted a 
more selective group than the girls chosen by the Stanford-Binct test 
The ratio of boys to girls in the TGT group was 1 83 and in the Stanford- 
Binet group 1 16 (Terman 1925, p 560) The greater selectivity of the 
older group is consistent with the fact that IQ of girls falls more than that 
of boys over time (Sontag, Baker, and Nelson 1958, pp 22-32) The TGT 
group had high abilities which endured at least until their adolescence 

Although a drop in IQ on retest is cxpiected on statistical groimds due 
to regression toward the mean, girls showed greater decreases in IQ 
than boys when their IQs were retested after an interval of several years 
In this sample, 54 children younger than age 13 in 1927-28 were again 
given the Stanford-Binet test The 13-point average decrease in IQ for 
girls was significantly different from the three-point average drop for 
boys * 

These results suggest that the kind of intelligence that endures through 
adolescence (as wais the case with the TGT-sclected girls) is systematically 
related to home investments However, m the younger group the factors 
which account for their scoring m the top 1 percent of the IQ distribution 
are more evanescent and are not related to home mvestments. Home 
investments were related to older girls’ IQ differences that were stable 
over time, but only income was related to younger girls’ IQs which faded 
over time 

rV. Determinants of Schooling Level 

The model postulates that final schooling level depends on ability, 
family income, and home investments. If human capital is not equally 
productive in the market and in the acquisition of new human capital, 
measures of human capital stock (such as childhood IQ) should be 
related to years of schooling 

* The approximate itandard deviation of the difiorence u two (Terman, Burki, and 
Jenien 1930, p 25) Four hundred older student* also were reteated m 1927-28, but the 
difference by sex was not sigmficant (ibid , p. 33) 



Sl20 


JOURNAL OF POLITICAL ECONOMY 


The impact of family income on schooling attainment may be quite 
attenuated in this sample. First, all sample members were residents of 
California and had access to the “free” state university system, which is of 
high quality. Second, because of their demonstrated scholastic aptitude, 
the students without sufficient family resources to finance the indirect 
costs of schooling would have been eligible for scholarships. Two alter¬ 
native estimates of parental income were constructed (as descnbed in 
Appendix A) Further, additional numbers of sibhngs in a family are 
assumed to have the effect of reducing the amount of family support for 
schooling and to mdicate lower levels of home investments. The effects of 
the quantity of preschool home investments (the HOMETR variables) 
should have been captured in the childhood IQ, measure, but measures of 
home investments made after the childhood IQ test was taken can be 
expected to have a positive impact on schooling levels Later home 
investments may be assumed to be positively correlated with early home 
investments as well as with parents’ education If the constructed income 
variable measures family income imperfectly, father’s education may be 
a proxy for income as well as for home investment (Definitions of 
schooling levels and their frequencies are given in Appendix B ) 

A. Estimation for Males 

Table 2 presents the equations estimatmg education levels for the 780 
males for whom we had 1940 data and the 781 males for whom we had 
complete 1950 data 

The regressions for both years show that childhood ability is positively 
related to schooling level This fact indicates that the kind of human 
capital that IQ represents enhances productivity in acquiring schooling 
more than it does productivity in the labor market This is consistent with 
the general finding of psychologists that the one thing IQ predicts best 
IS success in school 

Both the education of the mother and of the father are positively and 
significantly related to schooling achieved, and the coefficients are no( 
significantly different from each other The effect of other children in th< 
family (SIBS) is negative and significant at the 10 percent level in 1940 
By 1950, the subjects were nearly 40 years old, and the effect of sibhngs ii 
restncting resources available for schooling seems to have leveled off a 
other sources of funding were developed 

The quanUty-of-home-investment variables are not significant! 
different from zero at the 10 percent level. Since these variables wci 
important determinants of IQ their negative signs may indicate thi 
intensive training may tcmpioraniy boost IQ (as measured at age 11 
but that this increase fades as time passes. This, the quanbty-of-hom 
investment variables are correlated with a transitory increase in I 



HOME INVESTMENTS IN CHILDKEN 


Sl2I 


TABLE 2 

Yeau of Schoouno of Maui in Tesman Sampu 



1940 


1950 


Independent Vaeiabu 

(1) 

(2) 

(3) 

(4) 

EDMOT 

0 124 

0 131 

0 134 

0 141 

EDFAT 

(3 05) 

0 no 

(3 18) 

0122 

(3 35) 

0 149 

(3 47) 

0 163 

CIQ 

(3 51) 

0 021 

(3 22) 

0 023 

(4 82) 

0 024 

(4 35) 

0 025 

TGT 

(2 16) 

0 753 

(2 29) 

0 765 

(2 46) 

0500 

(2 57) 
0511 

HOMETRl 

(3 54) 

(3 59) 
-0192 

(2 38) 

(2 43) 
-0124 

HOMETR2 


(0 80) 

-0 293 


(0 53) 

-0 248 

SIBS 

-0118 

(1 13) 

-0 123 

-0 058 

(0 96) 

-0 063 

YK 

(2 01) 

(2 09) 

-0 782 

(101) 

(108) 

-0 929 

C 

1003 

(0 58) 

10 38 

9 39 

(0 69) 

9 85 


078 

080 

102 

104 

Number of observations 

780 

780 

781 

781 


Note —<«VA]uet are m parenChesei 


which has faded by the tune schoohng decisions arc made ’ Family 
income was not significantly related to schooling levels 

In summary, these regressions mdicate that the schoolmg level for 
males is positively related to abihty, that larger numbers of siblings may 
cause schooling to take longer to complete due to the difficulty of finanemg 
It (or the obligation to put younger brothers and sisters through school), 
but the final level of schooling is independent of family size Finally, the 
levels of parent’s education, representing the quahty and quantity of home 
investments, have a positive effect on schooling levels, while the quantity 
of preschool investments does not Mother’s and father’s education have 
equal impacts on schoohng, while mother’s education was much more 
important in deterimmng IQ, Father’s education may be a proxy for 
income, although a more direct measure of income was not sigmficantly 
related to schoolmg levels 


B Estimation for Females 

In contrast to males, schoolmg levels of females arc not significantly 
related to ability (see table 3, cob 1 and 4 ) They are, however, related 

r Thu tTmnutory effect on IQ. of intensive preschool programs has also heen seen m 
programs outside the home, such as Head Start Finu Welch pointed out thu analogy to 
me. 







Sl22 


JOURNAL OF POLITICAL ECONOMY 


TABLE 3 

Years of Schoouno of Females in Terman Sample 




1940 



1950 


Variable 

(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

EDMOT 

0 163 

0 teo 

0161 

0 158 

0 180 

0 152 


(4 67) 

(2 89) 

(3 77) 

(4 73) 

(301) 

(3 70) 

EDFAT 

0 085 

0 085 

01)2 

0 073 

0 043 

0 113 


(2 73) 

(152) 

(3 55) 

(2 43) 

(0 79) 

(3 70) 

CIQ 

0011 

-0011 

0014 

0 009 

-0 019 

0012 

(128) 

(0 36) 

(151) 

(1 15) 

(0 63) 

(144) 

TGT 

SIBS 

0 669 
(3 47) 
-0 136 

-0 152 

-0 146 

0 491 
(2 65) 
-0 126 

-0 131 

-0 143 


(2 42) 

(162) 

(2 08) 

(2 34) 

(147) 

(2 12) 

YK 

1 524 

0 682 

0 076 

1 705 

1 034 

0 083 


(130) 

(0 31) 

(130) 

(1 51) 

(0 48) 

(144) 

FIRSTBORN 

0 349 

0513 

0 274 

0 232 

0 340 

0 164 


(197) 

(160) 

(126) 

(136) 

(1 10) 

(0 79) 

C 

9311 

13 637 

10004 

9817 

14 978 

10 531 


166 

177 

153 

154 

145 

152 

Number of observ- 

atjons 

606 

158 

448 

615 

158 

457 


Note —(>VAlue« are in pareoitieaefl 


to parents’ schooling levels Although the difference between the coeffi¬ 
cients on mothers’ and fathers’ education is greater for the girls than for 
the boys, this difference is not significant at the 5 percent level As with 
males, the more children in the family, the less schooling achieved, but 
for women the negative effect of a larger family on schooling attainment 
IS not overcome by 1950 Girls who were the oldest child in their family 
got more schooling, as seen by the positive effect of the variable FIRST¬ 
BORN For boys, once number of siblings was held constant, birth order 
did not matter Family income and quantity of home investment in pre¬ 
school years are not strongly related to years of schooling achieved 

In table 3, columns 2 and 5 refer to the girls selected in high school by 
the TGT and columns 3 and 6 refer to the Stanford-Binet-selected girls. 
The mother’s education seems to be relatively more important m 
determining schooling attainment of TGT-girls, while father’s education, 
family income, and number of siblmgs arc relatively more important 
determinants of schooling for the Stanford-Binet group. 

In summary, there is some evidence here that a mother’s characteristic 
have a stronger impact on schooling achieved by females than by males 
particularly for the girls chosen in high school. Further, number o 
siblings and family income arc more closely related to females’ schoolini 
levels than to those of males, while abihty is more strongly related to th 
males’ schooling levels than to those of females. If we apply Becker 
analysis (1967, p. 17), these results suggest that for girls the supply curv 
of funds shifts relative to a comparaUvely stable demand curve ft 






HOME INVESTMENTS IN CHILDREN Sl23 

schooling, while the opposite is true for males. This imphes that males 
face greater equality of opportunity for schooling than do females 

C Total Effect of Mother's Education 

The total effect of mother’s education on schooling achieved is composed 
of the direct effect and the indirect effect via IQ_ For a male, each year of 
mother’s education results in an addition of 14 of a year of schoohng in 
1940 and 15 of a year in 1950 Thus, the effect of having a mother who 
IS a college graduate compared to one who is a high school graduate is to 
incre{ise schooling attainment by 6 of a year 

For girls who took the TGT, each additional year of a mother’s 
schooling resulted in 18 of a year of schooling attained in 1940 and 
17 of a year in 1950 For the girls who were selected through the Stanford- 
Binet test, an additional year of mother’s schoohng added 16 of a year 
to schooling attained by 1940 and 15 of a year to schooling attained by 
1950 


V. Determinants of Income 

A Schooling and Postschool Investments 

According to the model presented in figure 1, income is derived from the 
rent on different forms of human capital ability and home, school, aind 
postschooling investment The relation to be estimated is of the form 
Y ^ Pi + Pili + Ps^s + Pei^s)^ + a, where Y = log of earnings m a 
given year, /j = years of schooling, I^ = years of experience, and 
u = other income-affecting factors which are uncorrclatcd with the 
investment vanables 

Using the approximation that log (1 -i- r) Sr, and assuming that 
rates of return are constant over time, the coefficient p^ can be interpreted 
as equal to rK, where r is the rate of return to investments made in 
schooling and K is the ratio of investment costs to gross potential earnings 
during the schoohng penod * 

A common assumption in mterpreting human capital carmngs 
functions is that A = 1, since opportunity costs probably account for the 
major part of investment costs and part-time earnings are assumed to 
offset tuition charges Under these conditions, can be interpreted as 
the rate of return to schoohng The inclusion of the number of years of 
experience and its square results from a Taylor senes expansion of a 
senes of on-the-job investments whose ratio to earnings declines Imearly 
with time (see Mincer 1973, pp 3-6). 

The parameters of the above equation were estimated using data on 
income earned by men in the sample in 1939, 1949, and 1959 Since 


* Thu approach u fully developed by Becker and Chiswick (1966) 



JOURNAL OF POLmOAL SOONOUY 


St 24 

labor-supply data were never obtained, the earnings functions could not 
be estimated for women m the Tcrman sample. Only males who were no 
longer students and who were not unemployed were included in order to 
achieve as nearly as possible a sample of men who were employed full 
time. 

The dependent variable in 1939 was the full-time monthly wage or 
salary. For 1949 and 1959, the dependent variable was annual earned 
mcomc Years ofschoolmg were assigned to correspond to rather detailed 
descriptions of schoolmg level given in the questionnaires (sec Appendix 
B). Experience was calculated as* age —age at high school graduation — 
(final education level — 12) If wc employ the useful assumption that 
/T = 1, the regression results presented in column 1 of tables 4—6 imply a 
rate of return to schooling of between 5.6 and 6 7 percent This is low 
compared with other estimates of rates of return to college graduates 

Rates of return calculated for graduate training are usually lower than 
those for bachelor of arts degree holders, and 44 percent of this sample 
had more than 16 years of schooling. There is also reason to believe that 
the assumption that /T = 1 is less tenable for this sample than for tht 
average college graduate, since 24 percent of the Terman undergraduate 
had scholarships or fellowships amounting to J200 or more and 10 percen 
had received $1,000 or more in support of their studies If ^ < 1 
^ 2 < r, thus coefficients on years of schoolmg appear to be underestimate 
of the rate of return to education for this sample. 

TABLE 4 

Earmnos Functions for Malbs in 1940 








TABLE 5 

EAMorai FoMonoNf n» Mauh in 1950 


Loo or Annual EAunNOt (1100) 



(1) 

(2) 

(3) 

Yean of uhooling in 19S0 

0 062 

0064 

0064 

(6 14) 

(6 29) 

(5 79) 

Yean of expenencc in 1950 

0131 

0 IW 

0 128 

(2 39) 

(2 52) 

(2 34) 

(Yean of expenence)^ 

-0 0018 

-0 0019 

-0.0017 


(-2 19) 

(-2 32) 

(-2 06) 

Mother’! education 

Father's education 

YK 

CIQ 

TGT 

Intercept 

102 

-0 026 
(-2 56) 

0 00075 
(008) 

0 655 
(2 00) 

0.567 

000049 
(0.20) 
-0033 
(-0 52) 
0 905 


061 

073 

062 

SouKCB —CalcuJated from th« Tcrmaa MmpJe, 
Note —4-values are in parentheses 

loclutiei 731 observatioiu 




TABLE 6 

Eaand«m FuNCnoia for Malef m 1960 


Loo OF Annual EARNOcoi ($100) 



(1) 

(2) 

(3) 

Yean of schooling in 1950 

0 067 

0067 

0 075 

Yean of expenencc in 1960 

(6.81) 

0 097 

(6 63) 

0 104 

0 069 

(144) 

(1.54) 

(131) 

(Yean of experience) ’ 

-0 001 

-0 001 

-0.001 

(-145) 

(-154) 

(-1 18) 

Mother’s education 


-0016 



(-158) 


Father’s education 


0001 




(105) 


YK 


0 263 




(0 84) 


CIQ. 



0.002 

TGT 



(0B6) 
-0 096 




(-154) 

Intercept 

1 722 

1 411 

1 S3 


.091 

097 

097 


SouRCB —Calculated lioin the Terman laniple. utchidca 701 ohaervaQons. 
Non —t-valuea aie m parentheKs. 












Si 26 


JOURNAL or POLmOAL BOONOMY 

There is, however, a striking constancy in the schooling coefficients 
from decade to decade. The experience variables are not significant in 
explaining 1940 incomes (the subjects were, on the average, 29 yean old). 
At this stage in the hfc cycle, experience may be exerting a linear rather 
than parabolic efiect on earnings, as indicated in column la, table 4. 

B Home Inoestments 

What is the efiect of expanding this eammgs function to’include home 
investment variables? The equation to be estimated is of the form 

I' = fii + /li/j + fiih + * + Psh + V. (5' 

where /j = the amount of home mvestmcnt and all other variables ar 
defined as above 

It is clear that if home investments affect income and if home mvestmcr 
and schooling are positively correlated, then the simple regression ( 
income on schooling results in a coefficient on schoolmg which is biase 
upward from by a factor equal to Ps^t,,t 2 > where r, j is the regressic 
coefficient of variable t on variable j in the sample However, if hon 
investments do not directly influence income, but affect it only indirect 
by influencing schooUng levels, then the coefficient biased u 

ward from P'^ 

Estimates of equation (5) for 1940, 1950; and 1960 arc presented 
column 2 of tables 4-6. Father’s education was not sigmficantly relal 
to earnings in any one of the three years. Mother’s educauon v 
negatively related to eartungs in all 3 years, and in 1950 this coeffieu 
was significantly different from zero Mother’s education was positiv 
correlated with the two family-income measures (correlation v, 
YK = 43, correlation with family income adequacy =. 16). Both fam 
income measures were quite significantly related to 1940 income. 
1950, only YK was significantly related to income, and by 1960 neit 
income measure was Thus, the efiect of parents’ income on earning 
attenuated over the life cycle. Family-income variables are relater 
quahty of college attended, and this may, in turn, affect earnings 
Wachtel 1973) Of those who had “abundant incomes,’’ 63 per 
attended Stanford; in contrast, only 27 percent of those with “bnui 
incomes and 36 percent of those with “adequate” incomes did so. Gn 
percentages of the lower-income groups attended state colleges rather 
the higher-quality state universities. 

Mother’s education and YK are pcsitively correlated with own yei 
schoolmg (p K .23, p » .^7, respectively) and FAMING is negat 
related (p « —.02). Tlie net effect of the home-background variab 
to increase the coefficient on years of schooling from .056 to .063 in 



HOKE INVESTMENTS IN CHOOREN 


S127 

although the YK variable is significantly related to 1940 income The 
years-of-schooling coefficient is virtually unchanged by the inclusion of 
background vanables m 1950 and 1960. 

These findings provide some evidence against the hypothesis advanced 
by Bowles (1972) that specification errors and errors in measurement of 
family-background vanables lead (because these vanables are positively 
correlated with schooling level) first to overesUmatmg the importance of 
schoolmg; and second, to imderestimating the importance of background 
vanables in the earnings function 

Bowles maintains that because data on parents’ education, occupauons, 
and incomes are obtained from questionnaires administered many years 
after the fact “the degree of error in the measurement of the father’s 
occupation and education vanable greatly exceeds that in the respondent’s 
own years-of-schooling vanable” (1972, p S227) Ckinsequently, the 
importance of background vanables will be underestimated, while that of 
the respondent’s own schooling will be overestimated The Terman data 
allow a simple test of the proposition that if background and the sample 
member’s own data were subject to similar levels of error, the background 
vanables would increase m significance while the member’s own variables 
would fail In the Terman sample, data on parents’ cducaUon and home 
investment were supphed by parents when the child was 11 years old 
Yet, in spite of the significance of some of these vanables, the importance 
of the years-of-schooling vanable was not reduced. 

C. Ability 

Using the same reasoning applied to the home-investment vanables, we 
can say that if ability affects earnings and if it is piositively correlated with 
schooling achieved, omitting this vanable in the earnings function results 
in biasing upward the rate of return to schoohng. 

Income in 1940, 1950, and 1960 is not significantly related to IQj 
holding schooling and cxjicncncc constant (sec col 3 of tables 4-6). 
The IQ, vanables remam insignificant if both home mvestment and IQ 
vanables are included in the earmngs function, while the coefficients on 
the schooling and expenence terms are virtually unchanged. We conclude 
that childhood ability has very little indcpiendent effect on earmngs It is 
to be expected that earnings are related to contempwraneous stocks of 
human capital. In this sample, years of schooling in 1940 is no doubt a 
better index of human capital in 1940 than is IQ measured m 1922 

Furthermore, it must be recalled that due to the nature of the sample, 
we are considenng marginal differences m ability withm the top 1 percent 
of the ability distribution Yet, IQ scores varied for this group from 135 
to 200, while a standard deviation for the IQdistnbution is 15 points and 
IQ was a significant determinant of schooling. 



StsS JOOSNAL OF FOtjnOAL BOOKOUV 

With a less restricted sample, Griliches and Mason (1972) who used 
data on United States military vetoans who were younger than 35 years, 
also found the inclusion of an ability measure did not greatly change the 
coefficient of schoolmg. Using the change in schooling level after military 
service, they concluded that the bias in this coefficient due to the omission 
of an ability variable (AFQ,T) was only about 12 percent. In their sample, 
in contrast with this study, the abihty variables were significant, although 
they did not greatly change the schooling coefficient. 

Several other studies have also found relatively small bias due to 
omission of ability variables. Ashenfelter and Mooney (1968, p 86), in a 
study of past recipients of Woodrow Wilson fellowships, found that the 
“inclusion of an ability measure affected the estimates of the coefficients 
for the other education related variables only in a very margmal fashion “ 
Weisbrod and KarpoflT (1968), roughly controlling for abihty in a sample 
of employees of the American Telephone and Tel^^raph Company, 
estimate that only one-fourth of the eammgs’ differences between high 
school and college graduates is attributable to nonschoohng factors rather 
than to differences in schooling level. 

Taubman and Wales (1972) found, after testing several ability measures 
that in the NBER-Thomdike sample of United States Air Force volunteen 
for certain programs, only the omission of mathematical ability affectec 
the schooling parameters They estimate that the bias to the educatioi 
coefficient from omitting the mathematical-ability variable was abou 
25 percent for 1955 incomes and 15 percent for 1%9 incomes Usin 
unpubhshed data ftom Dacl Wolfle on Mmnesota high school graduate 
Taubman and Wales found a bias of “no more than four per cent at th 
vanous educational levels'* from omitting ability measures from tl 
earnings function. 

John Hause (1972) foimd that the effect of abihty on log of incon 
remained small at all schooling levels when he used data from Proje 
Talent which tested a sample of high school students and later survey 
their earnings He calculated upward bias in the education coefficient 
ability is omitted, at 13-18 percent in Daniel Rogers’s (1969) data bar 
<Mi a sample of Connecticut school boys, and at 3-11 percent in t 
Thorndike data. 

VL Summary 

Because of the unique nature of the sample, the findmgs on the Tern 
data cannot be readily generalized to the population as a whole. Howe 
serveral conclusions can be drawn: 

1. It has been demonstrated that home investments do incr 
measured stocks of childhood human capital. Even within a samp] 



HOME INVESTMENTS IN CBItDREN 


S129 

very able children, home-investment variables were positively and 
significantly related to a measure of human capital, IQ,, for boys and for a 
subset of older girls. A mother’s education was significantly related to 
IQ, while the father’s was not, thus indicating that home investments 
rather than wholly genetic factors underlie the relationship.' 

2 Education achieved by age 29 and age 39 by men and women in the 
Terman sample did depend on parents’ education and on family size, but 
not on a more direct measure of the quantity of preschool time inputs or 
on family income. Boys’ achievement also depended on abihty, mdicating 
greater equality of opportumty for boys to get schooling than for girls. 

3. Men’s earnings at ages 29, 39, and 49 were strongly related to 
schoohng and experience variables. The addition of home-mvcstment 
and ability variables did not significantly decrease the coefficients on 
schooling and experience, although family income was positively and 
significantly related to earnings m the early years of the hfe cycle. 


Appendix A 

Measures of Family Income 

Two measures of family income were examined in this study. One was a subjecUve 
measure, the other was constructed from data on lathers’ occupauons in 1922. 

A The subjective measure, family income adequacy, resulted from the answer 
given in 1950 to the following question “How adequate were family finances 
durmg your childhood and youth’” Answers were coded as fcdlows 


Coding Frequency (in %) 


1 s; lumted or very limited 20 9 

2 » adequate 75 0 

3 >= abundant 4 I 


B An altemauve esUmate of 1922 family mcomc was derived from the fethcr’s 
1922 occupation, reported as one of six categories. Estimates of income for various 
occupation groups were derived from King (1923, p. Ill) smee the Umted 
States Census did not collect income data by occupation until 1940 King’s 
esUmates were based on average eammgs for a sample of 436 employed males 
between December and February 1922. 

The ratio of income in each catq;qry to Uie income of professional workers is 
presented below 


Terman Category YK 


1 Professional 100 

2 Senuprofeaional, higher business 0.90 

3 Clerual, skilled trades, reuui business 0 82 

4 Farming and agriculture 0 78 

5 Semiskilled trades, minor ciencal, or busmes 0 72 

6 Slightly skilled trades, hide training 0 64 






Appendix B 


TABLE BI 

Ysaiu op SanoouNo in thi Terhah Sample, Desouption, and 
19S0 FiiEfjUENoaa por Malei and Feualbi 


Years OP 
School Cooed 


Males 

Females 

Hiohest Level op Sokoouno 

Number 

% 

Number 

% 

10 

High ichool, 1-3 yean 

7 

09 

9 

1.4 

12 

High school, 3 yean plus business 
or trade school, high school 
graduate 

59 

72 

63 

98 

13 

High school graduate plus 1-2 
yean of art, music, business, 
normal, tetiinical, or trade 
school, or less than 2 yean of 
college 

65 

79 

54 

84 

14 

High school graduate plus phar¬ 
macy, chiropractic, accounting, 
or nuning school or 2^3 yean 
of college 

86 

105 

70 

109 

15 

High school plus 3-4 yean of art 
or music s^ool or college 

5 

06 

1 

02 

16 

BA orLLB orDDS without 
BA 

200 

24 3 

186 

28 9 

17 

Masters' degrees or 5~6 year en¬ 
gineering or architecture deg¬ 
rees 

138 

168 

213 

33 1 

18 

D D S with B,A , MBA., grad¬ 
uate work of 2 yean 

20 

24 

I 

o: 

19 

JD orLLBwithBA 

87 

106 

2 

0 ' 

20 

Ph D , Ed D , L L D , etc 

74 

90 

13 

21 

21 

MD 

45 

55 

6 

0 ' 


Incomplete information 

36 

44 

25 

3 







Appendix C 


TABLE Cl 

Meani and Stanoano DsviATiom—M aixi 


Variable 

Mean 

Standard Deviation 

A 1922 data—821 obiervabons' 

CIQ 

148 88 

10.36 

EDMOT 

11.74 

2.71 

EDFAT 

12 36 

3J3 

SIBS 

1B5 

1 58 

ORIGTEST 

0.33 

0 47 

BORDER 

1.89 

1 24 

HOMETRI 

0 19 

040 

HOMETR2 

0 15 

036 

YK 

0.88 

0.09 

FAMING 

1 82 

048 

B 1940 data—724 obiervatioru 

LWAGE40 

560 

0.67 

EDM 

16 11 

2.65 

EX40 

882 

390 

EX40SQ 

92 92 

75 63 

YK 

088 

009 

FAMINC 

1 81 

048 

C 1950 data—742 obacrvationi 

LINC50 

430 

OBI 

ED50 

16 38 

2 59 

EX50 

18 18 

4 27 

EX50SQ. 

348 69 

155 79 

D 1960 data—699 obiervationi 

LINC60 

4 91 

060 

EX60 

2816 

4 31 

EX60SQ 

811 51 

243,27 


TABLE C2 

Means and Standakd Deviations—Females 


Variable 

Mean 

Standard Deviatioo 

A 1922 data—643 observations' 

CIQ. 

148 33 

996 

EDMOT 

1188 

2 67 

EDFAT 

12 09 

3 55 

SIBS 

1 B8 

160 

ORIGTEST 

0 26 

0.44 

BORDER 

1 98 

I 33 

HOMETRI 

0.19 

0.39 

HOMETR2 

0 13 

034 

YK 

089 

009 

FAMINC 

1 83 

0 51 

B. 1940 data—606 olnervations' 

ED40 

15 34 

2 12 

C 1950 data—615 observations 

EDSO 

15 50 

203 










Comment 


Frank P. Stafford 

Unuitmfy af Muktgan 


The Leibowitz study is important because, through the use of homi 
investment vanables, it goes beyond the usual practice of recording thi 
parental education and income should in some fashion influence tl 
liictime-earmngs potentials of individuals As the study suggests, da 
requirements for research on home investments are rather demandii 
because such final output measures as adult earnings are not oft 
available along with measures of home and school investments Morcov 
the posiuon advanced and given considerable empirical support is tl 
the family investments end up as being a type of intermediate good. 1 
intermediate-good argument is crucial to the analysis, for it suggests tl 
simply taking measures of human capital for adults (education and lab 
market experience) and tacking on vanables for home investments n 
suggest only a minor influence of the latter which would likely be s 
sviincd in measures of adult skills 

While the author has demonstrated that home mvestments substanti 
affect adult earnings, there are several theoretical and empincal probl 
with the current effort which may be remedied in future research. 

It would seem that to pursue senously the specification of a dyni 
accumulation model starting at childhood (even given an exoge 
family size and earnings potential of the parents) would require somet 
more than Ben-Porath’s model of investment in human capital. F 
Kq is the initial condition of “inherent genetic ability," it seems iml 
that preschoolers choose, or that parents can make them choose, a 
path of training so as to maximize the present value of net earnings, 
large extent, the investments in preschoolers are organized by 
parents, and this preschool investment determines an intermediate 
This intermediate good then contributes to the learning process i 
point when the child begins to make mdependent investment dec 
with his parents’ resources defining a financial constraint or margini 




CXtlOffiNT 


S133 

of funds. A variety of specifications, such as having the intermediate good 
an argument in the production fiuKtion or altering the efficiency para¬ 
meter, could be used. This general specification would be more m keeping 
with the empirical work m the paper and would separate conceptually the 
role of parents and children in determining educational attainment 
A more difiicult theoretical problem is hidden in the recursive model 
used. While a model of this type has much to recommend it on grounds of 
estimation, the critical factors mfiuencmg home investment in children 
mclude number of children in the family and the quantity and quality of 
home mvestments as well as other traimng given to particular children. 
That IS, the process of investment in children is bound up with the family 
decision on number of children. On a descriptive level it may be sufficient 
to treat the number of children as exogenous and observe how variations 
in parental input influence educational attainment, given family income 
Yet, the decision making by the family requires a delicate balancmg of 
number of children and lifetime faimly income by virtue of the very high 
(dynamic) shadow price of the wile’s withdrawal from the labor force, 
particularly for highly educated yoimg women Although these decisions 
can be portrayed by a complex model, there is much merit in a less 
ambitious (realistic^) approach which takes the family size and home 
mvestments per child as given The point of reference to a model with a 
demographic sector is to emphasize that the approach m the Leibowitz 
paper must be utilized with some uncasmess insofar as it glosses over a 
large and important question of family behavior How do parents decide 
number of children and home mputs per child ^ 

On the empincal side, the major data limitations are that m the Terman 
sample there is only minimal variation m several of the critical variables 
(IQ, has a mean of 149 with a standard deviation of only 10) and that the 
vanables measunng home teaming, HOMETRl and HOMETR2, are 
rather vague. It seems tenuous to aigue that the difference between (1) 
“no instruction, other than usual amount of readmg and telling stones” 
and (2) “considerable number of hours, but chiefly reading, tellmg stones 
and wntmg” or between (1) and (3) “an appreciable amount of instruction 
along particular lines” really constitutes a measure of the quantity of time 
inputs by the parents. Yet, this is in fact assumed because use of these 
vanables, it is argued, “holds the quantity of tunc mputs constant, [and] 
the education vanables can be taken to measure the quahty of time.” 
This is all the more difficult to believe smcc recent work (Leibowitz 1972, 
Hill and Stafford 1973) strongly suggests that quantity of time mputs to 
children by more-educated women are greater than for less-educated 
women within a lifetime income group. 

Without distinguishing between quahty or quantity of home inputs per 
child, the finding, verified in other studies, that education of the wife is 
much num important than education of the husband (or IQ of the 



JOURNAL OF POLITIGAL EOONOiT 


S134 

motber) in determining IQ, of the child implies that there is a larg 
nongenetic component to IQ. More significantly, this nongenetic coir 
ponent which can be related to home investment influences educationi 
attainment This is a first major step in the direction of showing thi 
home investments constitute an intermediate form of human capit 
which influences education. The second step is the demonstration th 
the IQ and parental-education variables (home investment proxie 
influence educational attainment. The third step is to show that educati* 
influences earnings but that the home investment variables have no dir< 
influence on earnings This finding corroborates recent work for 
national probability sample wherein each year of a father's cducati 
results in a .3 increase in the years of schooling of his sons, but the eamii 
of sons arc not directly influenced by education of the father (Johm 
and Stafford 1973, pp. 145 and 151) 

A secondary theme developed in the paper is the issue of upward I 
in educational variables owing to measurement error attributable 
imperfect memory or subjective recall in background vanables V 
minor error in the background variables because the information 
obtained from parents when the child was 11 years old, it is found that 
influence of the child’s own schooling is not appreciably lowered T 
results are interpreted as contradicting the contention that most resei 
understates the importance of background variables (Bowles 1972) 
aside from the issue of measurement errors, the model used here spe< 
that parental vanables influence home investment and therefore ha 
major, though indirect, influence on adult earnings This and ( 
research (Griliches and Mason 1972, Johnson and Stafford 1 
constitute an empirical link in understanding the rule of intergenerat 
influences on the income distribution. 

To conclude, let me mention two topics on which this study I 
bearing The observed positive correlation between spouses’ educt 
can be interpreted as a mutual consumption choice and the retu 
education of women is, under this view, partly realized in this fi 
(though Veblen interpreted this consumption as a vestigial form of 
atory activity) In the Leibowitz study, one of the major retu 
education of women is in the form of home investments in childn 
interestmg study would be to develop value-added figures and co 
these with eamihgs data to calculate returns to schooling for womei 
would show, I believe, that earnings differentials between educate 
and women are considerably of&et by nonmarket productivity difie 
The efikacy of public pohey as a vehicle for altering the 
distribution is limited since, it is said, income is so strongly influer 
parental background that opening better schooling opportunities ^ 
have any appreciable effect on earmngs for those bom into low- 
families. In this %ht, do the different forms of income maintena 



COMMENT 


S135 

allowing more home time by the mother, result in larger home invest- 
ments in the children? If the answer u yes, there is reason to beheve that 
public policy can influence what appears to be an important source of 
human capital formation 



Effects of Child-Care Programs on 
Women's Work Effort 


James J. Heckman 

C$licmiui Uiiatrslijf and NahauU Biatau qf Ecoaamc Rssmtk 


In recent years, Congress has considered a variety of work-subsidy i 
grams designed to encourage work among welfare recipients Man 
these programs would subsidize individuals only if they work s 
minimum number of hours. Commonly used techniques cannot 
direct answen to relevant policy questions since a tied offer u invo' 
and hence the offer cannot be treated as a simple wage change, 
essence of the problem involves utility comparisons between two or 
discrete alternatives. Such comparisons inherently require inform 
about consumer preferences in a way not easily obtained from ord 
labor-supply functions. 

To make such comparisons, I present a method for directly estiir 
consumer indifference surfaces between money income and nonn 
tune. Once these surfaces are determined, they can be used to com] 
vanety of alternative programs to investigate whether or not tl 
scopte for Pareto-optimal redistribution of income transfers and 
improving the general level of welfare of the commumty at large v 
reducing the welfare of individuak receiving mcome transfers. Kno 
of these mdifference surfaces allows us to estimate reservation wi 
estimate the value of nonworking-women’s time (Gronau 1973), 
force participation functions, hours-of-work functions, and weifar 
due to income tax pn^rams (Harberger 1964). I demonstrate tha 
estimation of indifference surfaces allows us, at least in principle. 

This rcKareh wm iponsoied by NsUoiwl Sdence FouadaUan and OfiSoe of I 
Opportunity grants to the Matronal Bureau of Eoenomic Research. I gratefiiUy 
edge Ralph Sbnelvar's skilled programming amistance. Orley Aiheaieltei 
Betsey, Jacob Mincer, Marc Merlove, Mel Reder, James Smith, Finis Wi 
Robert Willis provided usehil comments. I thank Elisabeth Panhley for her as 
retain foU req»o»ibility for all errors in the paper. This paper has notundergt 
staff review a^ is not an ofBcial pubheatun of MBER, N^, or OEO. 

S136 




CiOUXlARE PROORAMB 


S137 

the conventional assumption that the wage rate is independent of hours 
of work The separation of prefo^ces and constraints allows us to 
estimate labor-supply functions for individuals affected by welfare 
systems and progressive income taxation 

The methodology presented here is similar in spirit to the pioneering 
work of Wald (1940). In his neglected paper, Wald suggested that 
information from different price situations can determine a family of 
indifference curves up to a second-order approximation In this paper, I 
depart from Wald’s methodology and suggest a plausible estimable 
specification for the marginal rate of substitution function between goods 
and leisure, the parameters of which arc estimated by maximum-likelihood 
methods. 

The particular focus in this papier is on the effect of work-related child¬ 
care programs However, the methodology is more general and may be 
applied to a variety of work-subsidy programs 


L An Anatomy of PropKiaala for Child-Care Progrsuna 

Profxnals designed to relieve work-related child-care experaea have 
received the most attention The Nixon administration has promoted 
several measures of this typie. In 1971, an admmistration bill (HR 1) was 
introduced as “workfare” legislation designed to vacate welfare rolls by 
providing child care to working women In 1972, tax laws were modified 
to give g^erous deductions for work-related child-care expenses if a 
woman worked 30 hours a week or more. Since some work requirement 
seems likely in any future legislation, 1 confine my analysis to such 
programs 

If a child-care program gives a woman with a child an hourly supple¬ 
ment for each hour she works, and she is free to spiend it on any child¬ 
care source, the supplement is equivalent to a wage change of equal 
magnitude on both her hours of work and her decision to work. Given 
reliable estimates of work piaruapiation and hours-of-work fimctions, it is 
straightforward to make projecboiu of the labor-supply effects of 
alternative programs. 

If a cash grant or tax rebate is given to working women vnth children, 
it is no longer pxisnble to pmiceed so simply Given an agreed minimum 
number of hours which defines the condition of working, the wage rate for 
hours Miorked in excess of the minimum is not affected by such offers 
although the wage rate for hours worked below the minimum b mcreased 
For currently svorking women who work in excess of the minimum, there 
are only income efiR^ts. If nonmarket time is a normal good, such offers 
will diminis h thoT houis of work. Women working below the minimum 
will cither increaae their work to the minimum number of hours or will 
be una%cted by the program. For nonworking women market work is 



SI38 

Money income 


JOURNAL OF POUnOAL ECONOtn 




Leisure 
Fio 1 


more attractive, and some portion of them will be expected to comm 
working. 

Knowledge of the reservation wage (1 e, the mimmum wage tl 
woman will accept to go to work) does not enable us to form estima' 
program labor-supply effects To see this, consider the famihar 1. 
leisure diagram reproduced in figure 1 

Since we only consider wage change, we may compress all other | 
into money income and construct an indifference curve between n 
income and nonmarket time or leisure. That there are many us 
nonmarket tune is inessential to the argument as long as time is 
transferable among its uses. 

Suppose that a woman (or family unit) can enjoy TC units of i 
consumption if she does not work and that her wage rate is indepi 
of her hours of work so that the market opportunity line is 
individual (or family) is on indifference curve AC if the woman d< 
woiic. As drawn, the slope of the indifference curve at zero houn c 
(i.c., the reservation wage) exceeds the offered wage rate (given 
slope of MC) so that the woman does not work. Suppose the govei 


CHnjMMRE PROORAMS Si39 

gives the woman supplement SC but reqmres her to work at least T* T 
hours to qualify as working. As the figure is drawn, the woman will not 
take the offer. Her effective budget constraint becomes M'DEC, and the 
government subsidy does not improve her (or her fhmily’s) welfare. 
Raising the size of the subsidy or reducing the extent of required effort 
may reverse this conclusion 

In practice, tax rebates may be more complicated If the “work re¬ 
quirement’’ becomes a imnimum-eamings requirement which must be 
attained to be able to deduct child-care expenses, T* will depend on the 
wage rate, shifting to the right for higher-wage women and makmg the 
program differentially more attractive to them, thus working in an oppo¬ 
site direction to the Nixon “workfare” proposals, cetem panbus Further¬ 
more, since high-mcome famihes are more likely to itemize deductions, 
tax rebates for child care will tend to favor such famihes, although surely 
in the presence of these rebates more low-income families would find it 
profitable to itemize their deductions. 

Suppose that the government operates on the supply side of the market 
by reduang the price per umt quality of work-related child care * 
Remembering that the composite commodity theorem (Hicks 1946) 
allows us to aggregate all goods whose relative prices do not change into 
money income, a reduction m the price of work-related child care is 
equivalent to a shift m the indifference map of figure 1 from AC to A'C ^ 
With lower price per unit quality, consumers can attain the same basket 
of market goods (including the quahty of their work-related child care) 
with lower money income Hence, A'C lies below AC but the two curves 
meet at C because at this pioint no work is undertaken and the expenditure 
on work-related child care is zero Its pnee is irrelevant to the height of 
the curve at this piomt but a reduction in the price of work-related child 
care reduces the slope of the curve at C (i e , the reservation wage) ^ 

' I Ignore the practical difficulues in enforang child-care provision solely for working 
tune 

^ This analysis assumes that individuals are free to choose the quahty per unit hour of 
their child care If for some reason the quality per hour is not an object of choice, the 
appropriate wage for labor-supply decisions u the wage net of hourly child-care costs 
Only in this case can child care be treated as a cost of work Since fixity here, as elsewhere, 
IS unreasonable to impose a pnon, I do not pursue thu approach However, I mention 
this cate because it is tempung to treat child-care expense as a cost Throughout this 
paper I assume that quality can be varied That only rarely do children go iintuperviscd 
when a mother works is a statement about consumer preferences and not one about the 
need to consume a mimmum amount of chitd-care quality Unng these expenditures as 
exogenous variables m rcgrexsion analysis of labor supply when m fact they are an object 
of choice would bias the resulting labor-supply estimates 

^ The proof of these propositians is rctovely straightforward and is deleted for the 
sake of brevity. Note that, in the text, I only argue that the reservabon wage for AC 
exceeds that for A'C, In general, the slt^ of AC imgfat not exceed the slt^ of A'C for 
any arbitrary houra-of-work potiuon unless total expenditure on child-care quahty rises 
with increasing hours of work along any lao-utihty curve or, what amounts to an 
equivalent proposition, that work and quality per hour of purchased child care are Hicks- 
Allen substitutes. 



JOURNAL OF POUTOIAL BOONOUY 


S140 

Since the res«vati<»i has decreased more women will now work, 
If the price per unit quahty is reduced for all child care, whether or noi 
it is work related, the two curves no longer intersect at C, but if the ex 
penditure on nonwork'Felated child care is small the assumption of inter 
section may remain plausible. 

An important question for policy purposes is whether women recavin 
grants should be free to spend those gprants on any form of child cat 
service. In President Nixon’s veto message (December 10, 1971) a 
unrestricted voucher scheme was proposed. However, the Secretary of tl 
Department of Health, Education, and Welfare testified that tl 
administration favored monitoring voucher use so that only “authorizec 
quality sources could be used for expenditure of child-carc vouche 
(March 27, 1972) * This distinction is quite important It is well knov 
that over 80 percent of working women with small children do not i 
formal day-care services (Low and Spindler 1968; Ruderman 1961 
Since families arc free to resort to the market, the implication of this f 
IS that the price per unit quality of these informal sources is less than tl 
available in the market If individuals arc given cash grants or w. 
supplements and simultaneously are allowed to spend them only in 
formal market, two offsetting effects are at work for women who h 
informal child-carc sources available On the one hand, a wage or ince 
subsidy for working raises the attractiveness of work effort On the 01 
hand, the price change shifts the indifference curves (c g, from A'C to 
in figure 1) and tends to offset work incentives A currently worl 
woman who uses informal arrangements may either remain unaffe 
by the program or switch into the “formal” program For policy purp 
It IS impiortant in assessing revenue costs to know how many will, in 
switch over to formal sources 


n. Estimating Indifference Curves 

If we knew a consumer’s system of mdiffercncc curves and hov 
system shifts in response to vanations in the pnee of child care, we < 
answer the policy questions raised in Section I Since ordinary 1 
supply functions are derived from indifference curves and their parar 
may be used to estimate such curves, why bother to directly est 
them^ My answer comes in four parts. 

The first is that direct estimates of indifference curves allow us to 
the parameters of both hours-of-work and decision-to-work fur 
from a common set of parameters, enabling us to present a unified' 
work in which to interpret these two aspects of work behavior as ' 
giving an economy m parameters to be estimated. Second, by estii 

* Again, I abstract from the very real inublem of enfotang quality liandardi 



OHILD-CARE PROORAM8 


S 141 

indifference curves we can combine observations on all women, whether 
or not they work, and can avoid both the extrapolation error mvolved 
in using observations on working women to estimate the indifference 
surfaces appropriate to all women and the censored-sample problem that 
arises from estimating labor-supply parameters on a subsample of working 
women A sample is censored when data arc missing for some observations 
for reasons related to the model in question. Thus, for example, when a 
woman does not work we do not know what wage she would earn were 
she to work The reason she docs not work is that her reservation wage 
(1 e., the slope of the indifference curve at the zero hours-of-work position) 
exceeds the (unknown) available market wage Estimates of wage functions 
and labor-supply functions based on subsamples of working women lead 
to biased parameter estimates (see Aigner 1971; Heckman 197Z{ Gronau 
1973) Third, by estimating indifference surfaces we can directly estimate 
reservation wages and can use th^ estimates as Gronau (1973) has done 
to estimate the value nonworking women place on their time These 
estimates provide a “reasonableness” entenon with which to judge labor- 
supply estimates apart from the usual sign restrictions on income and 
substitution effects We can further judge the desirabihty of any labor- 
supply specification by asking what reservation wage the functional form 
implies 

A fourth reason for directly estimatmg indifference curves is that the 
procedure allows for the separation of preferences from constraints The 
abihty to make this separation is less important if we arc wilhng to make 
the conventional assumption that wage rates arc independent of hours of 
work (e g , Kosters 1966) but becomes quite imfiortant when we acknowl¬ 
edge the existence of progpressive taxation, welfare regulations, and time 
and money costs of work Thus, for example, even if the pretax wage is 
independent of hours of work, progpressivc income taxes create breaks m 
the budget constraint, since tax rates are set on income intervals and 
change discontinuously at boundaries While it may be reasonable to 
assume smoothness in tastes, the resulting labor-supply functions may be 
quite mtractable either to specify or to estimate (sec Wales 1973) My 
procedure supplies a naniral soluUon to this problem, and it admits of 
ready gencrahzation to data from welfare populations which face similarly 
distorted budget lines. 

A. EsttmabU Indifftrmce Curves 

Any indifference curve may be characterized as a locus of points or as an 
envelope of tang;ents. For my purposes, the latter characterization is 
more convement. Given any initial consumpUon position, the reservation 
wage is the slope of the indifference curve at zero hours of work. If leisure 
is a normal good, higher initial endowments lead to higher reservation 



JOURNAL or POLmOAL EOONOMY 


S149 

wages (Hicks 1946, pp. 28-29). We have seen that higher prices for child¬ 
care quality lead to higher reservation wages if a woman uses some child 
care when she works. 

We may wnte the marginal rate of substitution function (or slope of the 
indifference curve) at a given level of prework income Y as 

m = m(/>^ Y, h), (1) 

where we ignore variations in the price of other goods and where h is 
hours of work and is the price per unit quality of child care. A consumei 
possesses a family of indifference curves indexed by level indicator Y, th< 
no-work level of money income or consumption. We know that > ( 

(normality of nonmarket time) and that dm/dp, > 0 at the zero hours 
of-work position. Moreover, it is plausible that this inequality remain 
valid for all values of A * From diminishing marginal rate of substitutio 
between goods and time we know that Bmidh > 0 

If a consumer faces a parametric wage W, at initial income position ' 
she works if IF > m{p^, Y, 0). If this inequality applies, the cqmlibriu 
position is characterized by 

W = m{p^, Y*, h*), { 

where Y* is a level index appropriate to the equilibnum indifferer 
curve and is nothing more than the amount of money income that woi 
make the consumer indifferent to a choice between working ft* hours 
wage rate W to gam total resources Wh* -I- F or not working and 
ceiving income F*. As things stand, without knowledge of F*, we can 
deduce the equilibrium relationship between W and A However, gi 
F*, we know that equilibrium also requires that 

Wh* + Y = I** m{p^, Y*, h)dh + F*. 

This equilibrium condition states that we may imagine moving 
consumer to her final equilibrium position in one of two ways: giving 
a flat hourly wage rate of W or giving her payment F* — F to be ai 
to her imtial resource endowment F and compensatmg her by amoi 

|*’«(F,, h)dh, 

which IS nothmg more than the area under her (real) income-comper 
supply curve for labor. Given W, ft, Y, and p^ we may solve out fc 

* Stnctly ^eakmg, it u pooible that beyontl cenain values of ft this expression b 
negative, but for this to occur would require decreasing quality expmditure o 
care as houn of work increase, holding the consumer at the same level of uul 
n. 3 above). 



. OHILD'CARE PROORAMS 


S«43 

From equation (2), if m is monotomc in F* we may solve for Y* as a 
function ofp^, W, and k: Y* — g{p^, W, h ); using this value in equation 
(3), we implicitly define the labor-supply function by 

+ I' = J* »•[/>,. giPv iV. A). h-]dh + g{p„ W, h) 

B A Particular Fuachonal Form for m 

A wide variety of functional forms may be used to specify m * One 
plausible specification, arrived at after considerable empirical exjjen- 
mentation, is to write 

In m = ao + «! In^^ + a^F + a^h + a^Z + u (4) 

where, as before, p^ is the pnce per unit quality of child care, F is the 
prework level of income, h is hours of work, and Z is a vector of constraints 
to be discussed more fully below A random variable designated “u,” 
with zero mean and variance al, reflects vanauon in preferences for work 
among individuals The previous analysis leads me to the predicuon that 
a2 > 0 (normality of leisure) and *3 > 0 (dimimshing marginal rate of 
substitution between goods and leisure) and to the presumption that 
a, > 0 

Manipulation along the lines discussed in secuon A shows that the 
resulung labor-supply function is imphcitly defined by 

tt = In JV — — KjA — a4Z 

- aj + F - ^ (1 - e-***)j (5) 

This labor-supply curve can become backward bending beyond a certain 
value for hours worked ’ 

This particular functional form is offered as a starting point If it is 
grossly inconsistent with the data, cither 012 <^3 ^ negative, and the 

implied reservation wages, predicted from estimates of equation (4), will 
be imreasonable 

* One method u to let the data determine the functional form, in the manner of Box 
and Ttdewell (1962) and Box and Cox (1964) I did not pursue that approachm this paper 

t Stnughtfeeward difierentiaUon of eq (5) shows that dhjSW = — «i[* — 

(!/«,)(! - «-***)]l/EailFfl - «->») + «,] Clearly, IjW is nonnegative, the de¬ 
nominator IS positive, as is the first expresoiim in brackets To see why the latter assertion 
is true, note that wbra A — 0 the fint bracketed expression u zero and that the partial 
of that bracketed exp re ss ion with respe ct to A u (1 — »"•*•), which is clearly nonnegauve 
since aj is positive and A > 0 Thus, it u possibic, Ibr suitably large values of A, for 
ihjdlV to became negative. 



JOtntNAI. OP POLmOAI. ECONOMY 


S144 

C. Tht Empifktd Sptt^katm 

To estimate the relevant parameters, we must specify the time riimmuinn 
of the decision period and the vector of constraints (Z) which affect the 
marginal rate of substitution between goods and leisure. For the purposes 
of this paper, I use hours worked in a given year. In my judgment, thu 
dimension is the appropriate one for analyzing the labor>supply behavior 
of women with children—the target population for child-care programs— 
since we arc interested in knowing how they will respond to programs at 
the age when they have their children. I simplify the analysis by assuming 
that variations m the wife’s labor supply do not affect the husband’s 
earnings.* 

We may follow a well-established tradition in the hterature and postu¬ 
late a one-penod static model, or we may imagme the individual woman 
(or household) maxinuzmg her hfetime utility function at each age of her 
life, subject to the constraints refiectmg past decisions and chance events. 
The traditional static model may easily be shown to be a special case of a 
more general hfe-cycle model. 

To simplify exposition and maintain comparabihty with previou: 
studies m labor supply, I assume a onc-penod framework After presenting 
the essential features of my procedure under this assumption, I then shov 
how to incorporate more general intertemporal considerations. 

In specifying the list of variables (Z) that would plausibly affect tb 
marginal rate of subsdtution between goods and leisure, the presence ( 
children of different ages and the price per quahty unit of child care ai 
the most obvious candidates 

Since children require time and effort, it is likely that the presence 
children, espeaally young children, makes market work less attractive 
If a mother works and values her child’s environment during her world 
hours, she will make child-care lurangements. The effect of children 
the indifference system allows us to estimate what compensation 
necessary to induce the woman to work and how the presence of child 
affects the hours worked by women. 

An important empirical fact is that most working women with sn 
children do not pay for their child care, or if they do pay they use 
formal sources and pay less than women using formal sources. S< 
economists have interpreted this fact as evidence of failure in the d 

* Thu sMumpbon is wiilely used (see Givnau 1973; Bowen and Finegan 1969), I 
is ekarly open to the cnticism that the fiunily labor siqi^y is jointly determined 
assumption is equivalent to fbong the husband’s nonmarket time insututionally 1 
cludmg it altogether from bousdiold preiereiices. 

* For example, if diildren require at least some minimum amount of the mother’i 
then ptesenoe of a child, viewed solely as a reductioo in available time without 1 
lettiiig utility flow, would raise the icservatioii wage as long as coosumpticsi is a 1 
good. 



OHIUMMIIB PROOSAMI 


Sh5 

care market.* ® I interpret these facts difScrently. It is plausible that some 
women have low-cost, nonmaricet substitutes available to care for thar 
children. Potential sources of low-cost child care are relatives living 
nearby, husbands with flexible working hours, and friends and neighbors 
The availabihty of these low-cost sources lowers the reservation wage and 
makes it more likely that women having such sources available will work. 

Unfortunately, it u not passible to directly measure the pnee per unit 
quality women must pay for their child care It is plausible to aigue that 
all famihes can resort to the market to purchase units of child-care 
quality but that some famihes have informal sources available at a lower 
price per umt quality than is available m the market Measuring quahty 
by expenditure, we may normalize the market price at umty ‘ ‘ 

Suppose the distribution of informal prices among the population at 
large is given by 

In = Xfi + e (6) 

where e is a disturbance with zero mean and variance u* and where X is 
a vector of variables assumed to affect the price per unit quahty of child 
care. If we assume that the family selects its source of child care by the 
rule nun ( 0 , In p^), the margrinal rate of subsbtuUon function at the 
prework level of utility for families using informal sources (In %) may be 
written as 

In m| = ag + aiXfi + a2P 4 - XjA + + a + a,e, ( 7 ) 

while the function for women using formal sources (In m„) may be 
written as 

In m,i = otg + atjF + ajk + a^Z + u ( 8 ) 

where a restriction on the distribution of e arises from the condition that 

In />, < 0 JT/J + e < 0 (9) 

It is possible to estimate P[<r, from data on whether or not a family uses 
informal child care when the wife works However, further information 
b required to estimate the coefficient Uj alone, although it is possible 
to estimate Uiff, Smee is known to be positive, it is still possible to test 
the hypothesis that Uj is positive 

In particular, Nelioa and Kraihinsky (1972) argue this position 
11 This procedure is clearly open to delMte but is convenbonai It assumes that quahty 
is an objecuve characterubc and that units of quahty arc available at pnees indqicndent 
of the level of quality or the number of childrm. 

** To see this, note that we can estimate from cq (7) and bJbJ SuppOK, as is 
asserted in the text, we know filv,. Then, from the “intercept term” we can esbmate 
€,o„ and from the variance we can also estunate a,Bi if s is mdependcotly distributed 
from B, but we cannot detemune Bi ind^iendently from b, unlos we assume hirther 
inionnation One poanUe aanimpbon is that o, 1 



SI46 

Money income 


JOURNAL OF POLITICAL XOONOUY 



0 H H* 


Leisure 
Fio 2 

Suppose that we can successfully estimate the relevant paramete 
equations (7) and (8), includmg aj^ We can use these functioi 
address the problem, raised in Section I, of restricted vouchers, 
government program raises the wage rate of a womeun with a chilt 
insists that she use formal child-care sources, it is equivalent to a 
wage change for a woman already using formal sources but invo 
change in to unity (In to zero) for a woman not using these so 
From estimates of the indifference surfaces we can detenmne whet 
not (on average) a woman using informal sources would switch to I 
sources, because we can estimate the equilibrium position for he 
work in the case where she gets a higher wage but must pay a highe 
for child care and the case where she abstains from the program. 1 
determined her equilibrium hours of work and wage rate in each sit 
we can use equations (7) and (8) to determine the F* (the am< 
money income without work that would put the women at the sat 



CHILD-CARS PROORAMS 


S147 

of Utility as working would) implicit m each situation. Since F* serves 
as an index of utility, the situation with the higher F* is the preferred 
situation and will be chosen This approach is exact only if a woman 
spends nothing on child care when she does not work; it is approximately 
correct if child care is a small fraction of family expenditure when the 
woman does not work 

To see this more explicitly, consider figure 2 which, is sinular to 
figure 1 Consider a woman facmg budget constramt MB and workmg 
TH hours. The government raises her wage but simultaneously increases 
the price of work-related child care. In terms of this diagram, the effect 
of the pncc change is to shift “preferences” as shown by the dashed lines 
and to raise the wage so that the new budget line becomes M'B. A new 
equilibrium position emerges along isoquant F*, using the “new” system 
of preferences Using the estimated labor-supply curves, we can deduce 
that TH* hours would be worked Further, using the new equilibrium 
values, we can deduce level F* from equation (8) Then we can detemune 
which situation is preferable since it is assumed that child-care expienditure 
is a negligible portion of consumer expenditure if a woman does not work, 
so that the effect of changes in the price of work-related child care is to 
pivot the indifference map around the intercept on the TT' axis As the 
diagram is drawn, the woman abstains from the program since V* lies 
below F' Precisely the same methodology may be used in analyzing 
tied offers 

D Life-CycU Modifications 

Assume an additively separable lifetime-preference function which may 
be represented as the (time preference) discounted sum of utility functions 
at each age ‘ ^ In this specification, utility at one age is not directly affected 
by variation in the consumpuon of time and goods at other ages However, 
given a lifetime budget constraint, increasing consumption at one age 
reduces resources available at all other ages 

Within the context of this model, it is heunstically convenient (and 
formally correct) to imagine the consumer maximization process as a 
two-stage affair: subject to any income allotment, the family maximizes 
Its utility within each period To determine its allotment among periods, 
the family allocates mcome to equalize the marginal constnbution to 
lifetime utihty of an additional dollar of income in all periods if it is free 
to transfer resources among periods at a given borrowing and lending 
rate. 

To account for the mobihty of funds among periods, we must allow for 
saving or dissaving in a given period. For example, by borrowing against 

TTu* tpeci6cation is widdy used in the optunal-giowth literature (e g , Shell 1967) 
and the operatuMud-utiUty literature (eg, Strotz 1957) 



JOUKNAL OF FOUTtOAX. BOONOKT 


8148 

future income, households may enhance current resources. By saving, 
houseludds transfer some of the current flow of resources to future time 
periods. Accordingly, the budget line in the conventional labor-leuure 
diagram (see fig. 1) must be shifted up (down) by the amount of net 
debt accumulation (saving) in the period. If saving is set at zero, the 
traditional one-period model retains its vahdity in a life-cycle context.*^ 

Assuming that, there is no saving or borrowing is restrictive but tra¬ 
ditional. It simplifies the analysis but is not necessary to it. Since the 
value of saving at any age depends on all prices, wages, and exogenous 
incomes, including the current wage, the current wage enters the analysis 
in two ways: as a direct determinant of work effort through the slope of 
the budget constraint, as in the traditional analysis, and as a determinant 
of the level of the given period’s budget constraint through its effect on 
saving. Thus, in addition to the usual income and substitution effects 
confined to one period, there is an intertemporal income-transfer effect 
of wage change. 

Introducing intertemporal considerations alters the simplicity of some 
of the previous analysis. In particular, consider the effect of a government 
program which raises the wage rate for women at an age at which children 
are present and simultaneously requires that the woman use approved 
child-care sources In a one-penod context we are able to solve the im- 
phat level of utility, indexed by Y*, under each program and are able 
to conclude whether or not a woman participates in such a program. In 
an intertemporal context we lose this simple feature because it is possible 
for y* to be lower in one time period but for the resulting earmngs flow 
to be sufficiently higher to more than compensate the woman by en¬ 
hancing her future utility. Thus, we encounter an intertemporal index- 
number problem. 

To illustrate this point, consider figure 3 The dotted indifference 
curve represents the goods-leisure trade-off at a given period The solid 
curves are the indifference system after the price per unit quality is raised 
Consider a woman initially in eqmlibnum at point B', where TK is the 
flow of earmngs from other sources minus saving in the period. Holding 
saving fixed, raise the wage so that the new equilibnum is B”. As drawn, 
B" st 'Mt lower level of current utility than B'. Yet, if earnings arc 
ineiiilsed Sibfficiently, the woman might sbll opt for the program, since 
She can traiisfer resources to future time periods and enjoy greater utility. 
If this is the case, the budget line shifts down by the amount of saving 

For a proof of thqc propcmtiotu, see Heckman (1971) where the Strotz “utility tree” 
IS applied to an intertemporal labor-supply jxxiblem The astumptioa of additive separa- 
bO^ yields a traditional laboi-leisure diagram for each peri^ in the consumer’s life 
cycle. The level of the budget constraint in each period is determined, in part, by exo- 
genotis income flows in that period and the net transfer of resouroes into or out of the 
period. 



(ailU>*CASE ntOORAMS 


SI49 


Money tncoma 



and the woman works more since leisure is a normal good in each time 
period. Note, however, that if saving is held fixed, and the wage increase 
leads to a higher level of utihty m this period, the possibility of transferring 
funds among periods enhances the attractiveness of the program so that a 
woman will definitely choose to participate in it and thus we can estimate 
a lower bound on the number of program partiapants. Note, too, that 
we can estimate an upper bound: take all women who are made worse 
off in this period and assume they partiapate m the program. Solvmg 
their new equilibrium values, allowing for a transfer of resources among 
periods, we can solve for their new eammgs. If we also estimate their 
expenditure on child care, their earnings net of child-care expenditure 
(in the new program) must increase for resources to be freed to transfer 
to subsequent periods Then we know that only women who have such 
optirms will participate in the pn^;ram, although clearly not all of these 
women will participate. 

The only necessary modification in the previous empirical specification 
is the introductioo of a savings function which is subtracted from the 



S150 JOURNAL OF POLITICAL ROONOHY 

exogenous income flow Y. In the absence of data on saving, we are forced 
to estimate an implicit savings function. 


in. Estimates 

Estimates of the one<period model developed in Section II are presented 
below. The estimation of the complete life-cycle model is left for another 
occasion * ’ 

Before presenting the estimates, it is useful to stress the restrictive 
nature of some of our assumptions. The assumption that the wage rate is 
independent of hours of work ignores the eflcct of progressive taxation 
However, as noted in Appendix B, this defect can be remedied. A 
potentially more serious problem is our treatment of employer behavior. 
If firms incur fixed costs per worker, they are not indifferent to alternative 
combinations of men and hours per man which yield the same total 
man-hours (Rosen 1968) Under these circumstances, firms might pay 
higher hourly wage rates for longer weekly hours, at least up to certain 
levels of weekly hours worked Thus, it is possible that the equilibrium 
condition invoked in Section II, that wage rates equal the marginal rate 
of substitution for working women, might not apply This consideration 
creates a potentially important complication which is ignored in this 
paper 

As Gronau notes, both the presence of fixed costs of work and the 
possibihty of endogeneity of husband’s earnings create potentially im¬ 
portant biases (Gronau 1973, p S177) These factors could be accom¬ 
modated within our framework but only at much higher computational 
cost. 

The data used to form the estimates are from the National Longitudinal 
Survey (NLS) of 1966 for women aged 30-44 The functions arc estimated 
on a subsample of women who were married, spouse present, with at least 


‘ ’ An attempt was made to estimate a AiU-fiedged life-cycle model on a cron section of 
data. In pnnaple, this is a simple exercise In practice, since the necessary savings data 
are missing, it is necessary to specify a lavingi function, which depends on current and 
future wage rates and exogenous income flows and on current-asset variables The 
Additional wage and income vanable introduced in eq (5) as determinant of saving led 
to unstable estimates In fact, plausible altemauve specifications for the savings function 
lead to undendentification for some parameten These results suggest that we are asking 
too much of the data from one crass section. When the second-year data from the National 
Longitudinal Survey are available, it will be possible to estimate a separate savings 
function and gM reliable estimates of the complete model. 

If there were a fixed standard work week, the analysis of Section I]„ could be 
genersdued to accommodate this conqilicabon. For those working fixed hours, suid not 
moonlighting, die marginal rate of substitution evaluated at the standard work week 
(which dep<»ds, anumg other things, on the income flow firom work and length of time 
at work) is less than the potential moonlighting wage, while for moonlighters the tqiposite 
it true. Thus, as shown m Aj^iendix B, it is potnble to gam some inibimatian about 
coaRuner pre&renees even when the marginat equality conditioas no longer ^ply. 



amUhOM PROOIUMt 


SI5I 

one child under age 10. This subsamplc u chosen for two reasons The 
principal reason u that it is a population obviously by child¬ 

care programs.*’ Previous studies (Rudennan 1968; Bowen and Fmegan 
1969; Gronau 1973) suggest that only younger children exert an im¬ 
portant retarding efTect on women’s work eilbrt A second reason for this 
choice of sample is closely related to the first. The NLS dau give m- 
formauon on the availability of informal sources of child care, dc^ed as 
care by relatives or zero (dollar)-cost day care. A unique feature of these 
data IS that, in addition to questions asked on the type of child care used 
by working women, questions concerning the availabihty of inibrmal 
sources were asked of nonworking women as well It is computationally 
and conceptually more difficult to incorporate in our sample women for 
whom child care is an irrelevant issue 

In Appendix A I present an estimation method that incorporates the 
informaUon that (1) the reservation wage is less than the offered wage 
and (2) the price per quality unit for child care is less than that available 
in the market Both types of sample-inclusion criteria create the potential 
of censoring bias, but this is avoided by the procedure used 
By the construction of the sample, the presence of at least one child 
younger than age 10 is assured To allow for scale effects, I created 
separate variables for number of children aged 0-3, 4-6, and 7-10 Other 
children variables led to unstable estimates Husband’s earnings are 
added to net worth multiplied by an estimable flow coeffiaent to approx¬ 
imate the flow of exogenous income relevant to a one-penod model 
Michael (1973) has argued that educauon may affect the household’s 
ability to organize and produce its final consumption To accommodate 
this hypothesis, I present estimates which measure the effect of the 
education (in number of years) of the wife on the marginal rate of 
substitution between goods and leisure I contrast these results with more 
conventional estimates which exclude education fixim household pref¬ 
erences 

In order to include in the sample nonworking women for whom 
wage data are missing, it is also necessary to esumate a function for the 
market wages of all women. Its specification is kept simple The natural 
logarithm of wages is postulated to depend on education and labor-market 
experience * * I explicitly allow for correlaUon between the disturbances 
of the preference function and the wage function. 

A novelty of the procedure used is the explicit estimation of an informal 
child-care price funcUon (discussed in Section II) Previous studies (e g., 

‘ ’’ However, the matt relevant policy population is women on welfare who have young 
children. Small sample nze and ignorance of the features of the welfare lyttem con¬ 
fronting poor mothen precluded esUmation on this sample 

For a diicuaion of the expenenoe variable, sec the paper by Mincer and Polacfaek 
m this volume. I aggregated their segments of expenence into a total-expenenoe variable. 



S153 JOURNAL or POLITICAL BOONOMY 

Bowen and Finq;an 1969) have shown that children aged 14-18 serve as 
stimulants to married female labor-force participation, and it is plausible 
that they supply a ready source of child care. Similarly, it is plausible 
that relatives living in the home, husbands with flexible working hours, 
and fnends serve as low-cost sources of child care (see Low and Sjnndler 
1968; Ruderman 1968) Accordingly, these determinants of the quality- 
adjusted price of day care are incorporated in the estimation. 

To capture the effect of relatives on the pncc of child care, I created a 
dummy variable which assumes the value of one when a sister, parent, or 
grandparent lives m the household The number of children aged 14-18 
IS mcluded as a variable. As a crude measure of the flexibility of the hus¬ 
band’s hours and his availability for child care, I use the number of hours 
he works in a week. As a measure of the availability of low-cost care 
from friends and relatives living nearby, I use two variables a dummy 
which assumes the value of one when a woman has lived in a Standard 
Metropolitan Statistical Area (SMSA) all her life and a continuous 
vanable measuring the number of years a woman has lived in an SMSA 
Both are expected to depress the price of child care. 

The estimates are presented in table 1 Column 1 reports estimates of 
the parameters for whites obtained by excluding the effect of education 
from household preferences, while column 2 shows the effect of including 
education Comparable results for blacks are in columns 3 and 4. 

Consideration of column 1 coefficients for whites will illustrate the 
meaning of the coefficients. A unit increase in the logarithm of the 
quahty-adjusted price of child care divided by its standard error raises 
the marginal rate of substitution by 14 3 percent Each $10,000 of 
exogenous income raises the marginal rate of substitution by 22 percent 
Each 1,000 hours worked raises the marginal rate of substitution by 
51 percent. An additional child aged 0-3 raises the maiginal rate of 
substitution by 24 3 percent; an additional 4-6-year-old child raises the 
marginal rate of substitution by 8 4 percent, while an additional child 
aged 7-10 raises it only 2 percent The coefficient on assets is a number 
converting the stock of net worth into a flow yield. Thus, if a family has 
$10,000 worth of assets, the estimated mcome flow is $380 This flow is 
added to other exogenous income flows in calculating the effect of income 
on the marginal rate of substitution. 

Similarly, the coefficients for the quality-adjusted price of child care 
show the eflect of the assoaated variables on percentage changes in this 
price divided by the standard error of the price distribution. Older 
children, relatives living in the home, and length of residence in the SMSA 
all have the expected negative dieca on this price while the other co¬ 
efficients cannot be deemed signifleant by conventional standards. 

The estimates for the market-wage ffinction show the effect of the 
associated variables on the natural logarithm of market wages. The 



amvo-cAja noonAia S 153 

population correlation coefficient measures the interequation residual 
correlation between the market-wage function and unmeasured tastes 
for work. 

In comparing columns 2 and 1, it is seen that while convenUonal 
significance tests suggest that education should be included in household 
preferences, its inclusion alters few coefficient estimates. The only notable 
exception is that the effect of education on market wages is increased 
and is not sig^nificantly different from the effect of education on reservation 
wages. This suggests that for white women with at least one child younger 
than age 10 , better-educated women do not work in the labor force as 
much as less educated women. For comparable black women, better- 
educated women are more likely to work 

The most useful way to summarize these numbers is to consider their 
potential application. Making the bold assumption that the preference 
parameters for married black women with spouse present arc those 
appropriate to all black women, we can investigate the possibility of using 
tied child-care offers to alleviate welfare dependency Consider a group 
of black women currently ehgible for an annual welfare payment of 
$3,000. For the sake of lUustradon, we assume that each woman has 
three children—one in each of our estimated age intervals We assume 
that each woman uses informal child-care sources and that she has one 
child from 14 to 18 years old, one older relauve livmg in the home, and 
that she has hved in her current SMSA for 8 years We further assume 
that each woman has no asset income and 10 years of schoolmg 

Each woman has a choice of working or taking welfare The geometric 
mean indifference curve appropriate to potenUal welfare reapients is 
sketched as AA' in figure 4 Fifty percent of the potential welfare recipients 
have an indifference curve emanaung from A ( = $3,000) which hes 
below AA' Twenty-five piercent of the group have an indifference curve 
starting at A which lies below AC. This latter group may be termed the 
most “work prone” because they require less compensation to be induced 
to work Again, solely for the sake of argument, suppose that each womrm 
has an identical wage of $1.50 pier hour which she can earn in the market 
More realistically, we might assume a distribution for wages as well 

In this extreme example not all women will choose to go on welfare 
The few who are not on welfare will be observed to work long hours at 
low wages. Agam, solely for the sake of argument, suppose no women 
work. Suppose the government offers tui unrestricted “day-care” voucher 
to a woman if she works 1,500 hours From budget hnc OB we can see 
that if the woman were to worit 1,500 hours at $1 50 per hour she would 
cam $2,250. The minimum compensation beyond her own camings that 
would be necessary to induce half the women to leave weUare is the 
distance (-$3,300). Note, however, that if a $2,560 child-care 

subsidy were t^red (—fl'B") 25 percent of the women would leave 





amJD-OMKt raoostMa 

TABLE I (Cmtbaud) 


S»55 


NtAiB^ or OBSuivA'nom 



Whites 

Blacks 

Do not work, would use formal sourcest 

24S 

87 

Do not worit, would use informal aources 

148 

62 

Work and use formal sources 

104 

45 

W(^ and do not use formal aources 

293 

187 

Total . ... 

793 

381 


SoURCia — DaU ar« Cram 1967 Natiocul Looffitudmal Survey of Work Eapenence oT Women ased 30-44 
For a deKnplJoa of that data, aee Dual Cmmt (Panief, Shea, Spiu. and Zeller 1970) 

Noth —Numben m parenthcaes are arymptouc oonnal «tal«tKa from the mformatioo mairu 

• « 1 if yes. 0 if otherwiae 

t • I tf Slater, pucnt, sraodparent. 0 if otherviae 

% Formal aources are defined as paid care at chdd-care centen and care at a nocirelative‘s home Informal 
sources include all other sources Noa%voriung %voinen are classified mto two groups those lookina for work 
(croup B) and those not lookioff for work (group C) Everyone m the first group was asked wbeUier or not 
cntld-care arrangemenu would be necessary and vdiat type of arraogctnents would be made if th^ were 
necessary This wowed for a classifiratmo broadly consistent with the dassificatioa of actual use for working 
vromen Information on the group not locriung for %vork is more scarce To be included m the sample they 
had to answer “yes** or **it depends*' to a hypothetical ouestion about whether tj^ would work if a wete 
ofiered to them (Parties et aj 1970, question 30a. p 2o0} and a subsequent question on need for child care 
and what form vvould be used (question 31) To deumune whether the compositioD of the sam^c Was biased 
by using this inclusion cnterioo, separate dtscnminant fiinctions were fitted by race for all enamed women 
with at least one child youiuer chan JO yean old id group C Respoadotti were distutguished from noo- 
retpoodenti by a set of vanaoles including the number of caBdren in diflerent age intervals, assru, age and 
educatioo of wife, and husband*! eamtogs The only significant discruninants for races were ancts 
and the number of children a^ 0-3 yean. Both vanablo tended to be larger for Donra^MndeBta Accord¬ 
ingly, the coefficient estimated reported in this t^>le may be biased In my opmion the added informattoo 
about potential ciuld-care aources for oon%voriung women was vrorth the ruk of bias 


welfare voluntarily, and the welfare cost to society at large would diminish 
by $440 per person who leaves welfare * * Clearly, m this case there is 
scope for using tied offers to reduce welfare dependency The transfer- 
nummizing policy is Pareto optimal and is straightforward to determine 
Note, too, that we can determine what proportion of women will work 
beyond the nunimum number of hours needed to qualify for the program 
Consider a woman whose indifference curve is dC At 1,500 hours of work, 
the slope of the indifference curve (at 5") exceeds SI 50, so that if she 
were given sUghtly more than $2,540 she would leave welfare and work 


*• Re m ember that it wai asmmed thmt all women were on welfiue In reality, only mme 
Giaction of thu 25 percent would be. 

For each value for hour, worked. A, a child^care bonus B determines a proportion 
of welfare clients P{B) who will leave welfare, given that bonus Clearly, B lies between 
aero and S3,000, the welfare payment The mean transfer to the group is P{B)B + 
[1 — P(B)]($3,000) Mininuzing mean transfer is equivalent to min i m izing group 
transfer since I assume no discnminauon within the group is possible For each A, there 
will exist a B which miiunuzcs mean transfer, and the minimum mean transfer for all A 
can be detenmned. Thus, knowing the distribution of preferences, il is possible to 
simultaneously determine B and A Under general conditions, there exists a umque 
Pareto-optiinal pohey for tied welfare oflen which can be estunated from the data. Note 
that, in the example, it is assumed that no one works initially To deteimme the Pareto- 
optunal tied offer if some work, it is necesnry to subtract the transfan pud to those who 
would have worked without a child-care bonus. Note that Pareto optunahry a used here 
in the restneted sense of improvuig community welfare from a current pouttoo, it u 
not the global optunahty concept of general equilibnum theory 




Fio. 4 


S156 



Money Incomt (thowond dollon) 



Fio 5 


S157 




Si58 


^®*****'w wunoAL 


KOONOMy 


«etJy iKwri The i«K» ih« MwUd i»t 
herK*ervadonw«r«( **^^««e* Woflbed wagcofs^ 

At t fiirtber Uhtttntion of the me of the emplHcal mulu conside 
rqmwntttive workinf white women, uuag iobmul child^lre mu J ^ 
whole husband canu $5,500 pw annuin, Suppoie that the hat 12 yean of 
schooling, amti of $5,000, and two children, one from 0 to 3 years old 
and one from 4 to 6. In figure 5, we plot a solid system of indifference 
curves appropriate to the case of lower prices for informal child care 
drawn on the assumption that the woman has not lived in the SMSA 
all her life, that her husband woriu 40 houn a week, that a relative lives 
in the home, and that she has lived in her current SMSA for the last 
10 years Superimposed on this system it a broken-line indifference turve 
appropriate to the use of child care purchased in the formal marJcci 
As the lines are drawn, the woman it in equilibnum at point B earning 
$2 30 per hour Suppose the government raises her wage to S2 40 per 
hour by providing a 10^ per hour chitd-care subsidy conditional on her 
use of formal sources Will she fake the offer ^ In this case, the answer is 
no The dashed “program budget line” lies below the iso-utility contour 
for formal arrangements, equivalent in uulity terms to the indifference 
curve for informal arrangements on which she previously was in equil¬ 
ibrium Hence, she abstains from the program Given the parameters of 
the distnbuuons of the curves, we can make similar statements for groups 
as a whole and hence can estimate the proportion of working women 
currently using informal child-care sources who will switch to formal 
sources if only restneted vouchers arc available ** 


IV. Summary and Suggested Extensions 

The economics of tied work payments and methods for estimaung the 
effect of such payments on labor supply have been discussed in this paper 
It IS important to distinguish the conceptually easier problem of modeling 
the response to tied offers from the more demanding problem of pro¬ 
viding reliable estimates of the appropnate behavioral functions It has 
been shown that knowledge of consumer preferences is necessary to 
estimate program effects, and methods have been suggested for determin¬ 
ing these preferences. 

By directly estimating indifference curves, hours of work and work- 
participation equations have been derived from a common set of para¬ 
meters. The separation of preferences from constraints allows us to estimate 
the labor-supply parameters of individuals from data generated by non- 

To answer both policy questions ftUly would require consideration of the distribu¬ 
tions of wage rata, prefereoca, and pnea for child care Such projections would be 
suspect at this stage this research beuusc the lull tu-ad|)usted life-cycle model has not 
bem atimated. 



omUXSAU PBOORAIO S159 

(tandard constraints, such as the broken-line budget constraints resulting 
from the tax system, where a tractable labor-supply function does not 
exist. At the cost of estimating a savings function, we can embed the 
traditional one-period model of labor supply into a hfe-cycle model Both 
the distribution of tastes for work and the distribution of market wage 
rates for the population at large arc estimated The estimates suggest 
that wage rates are strongly correlated with preferences for work so that 
simple “reduced-form” labor-supply functions obtained by regressing 
hours worked on wage rates give biased esUmates In fornung esUmates, a 
statistical procedure is employed which avoids this bias and the censonng 
bias discussed above. 

Perhaps the most controversial aspect of this study is the treatment of 
the choice of the mode of child care A latent distribution of informal 
prices for child care is assumed to exist in the population at large It is 
further assumed that a competitive market in child-care quality exists so 
that individuals facing informal prices in excess of the competitive pnee 
use formal child-care sources if they work The decision to work and the 
decision to use formal sources arc related in the sense that the pnee of 
child care is a determinant of the decision to work and of the actual hours 
worked This model enables us to use information on hours of work and 
the decision to work, in combination with direct information on the form 
of child care used by working mothers, to esumate the determinants of 
the mode of child care mothers select The empirical resulu are con¬ 
sistent with this model but have not been tested against possible alternative 
models 

If families value the development of their children, and the mother’s 
time IS an important input into this development, custodial child care 
might be chosen if a mother works a hnuted number of hours but might be 
unacceptable if she works full time. If formal sources provide higher- 
quality day care, the decision to work, the length of the work week, and 
the choice of the mode of child care jomtly determined and mutually 
dependent on the wage rate and the pnees of formal and informal care 
Although this alternative model is much more difficult to esUmate, it is 
conceptually more attractive and contains the present model as a special 
case For these reasons it is an approach that might prove fruitful in future 
research on the economics of child care 


Appendix A 

The Estimation Procedure 

The parameters of the marginal rmte-of-«ubsUtuUon funcuon (4) are estimated 
by a maximum-likelihood techmque developed m a previous paper (Heckman 
1972). In that paper, I present a method that permits observauons on nonworking 
women to be pooled with observauons on working women to esumate hours-of- 



S> 6 o JOURNAL OP POLmOAL BOONOMY 

work iunctiaia, wortpartidpatian functioni, and wage Unctions ibr all women. 
The procedure avmdi the biai that renilu from emmating wage and houn-of- 
work futtctiona uung standard regression techniques on censored subaampiei of 
workmg women. 

Four distmet subsamples are considered here m contrast to the two tubsamplet 
consideied in my previoiu paper. Some workmg mothers use informal child'care 
souiees while others use formal sources. Potentially, the same dichotomy oostt 
among nonworking women, and a unique feature of the data used here is that for 
many nonworkmg women it u possible to identity which mode of child care they 
would use if they were to work. 

Leltmg Wf be the wage rate for the >th woman and using the notation of the 
text, woman i works and uses formal sources of child care if 

Xtfi + S( > 0 and In W^ > 00 + 02^1 Rs^i {^0 

where Xffi + sj is the quality-adjusted price of child care. Woman 1 works and 
uses informal sources if 


X,p -h *2 < Oand In W; > «o + + « 2 J'i + + «i«i + "r (A2) 

For nonworking women who would use formal sources, 

XiP + Si > Oand In W, < «o -f ajJ'i + «sZ, + u„ (A3) 

while for nonworkmg women who would use informal sources 

X,P -I- Si < 0 and In W, < «o + “i-^i^ + “jl'i + «s^i + «i*i + “1 (A4) 

To circumvent the practical difficulty that wage rates are missing for non¬ 
working women, I postulate a wage fimcbon for the ith individual. 


hi IV| = nt + nil -(• p„ 


(A 5 ) 


where i| includes determmants of market wages such as education and labor- 
market experience, and where P| is a disturbance term with zero mean and 
vananceff’ 

To denve the appropriate sample hkclibood function, consider the joint 
density of the disturbances A(a, n, s) For a workmg woman who uses formal 
sources, the density function for u, v, and t for the domam defined by (Al) is 


_ hju, p, t) _ 

Pr(m + ni - «o - tjT - > a - pAe > -Xfl)' 


(A6) 


when iV( ■) denotes the pcobabibty of event “ ■ ”. Assummg independence between 
e and other disturbances, but allowmg for dependence between ■ and p, we may 
write Afa, p, s) » g(a, p)((e), so that the conffitional density of a, p, and s, given 
that a wonuui works and uses formal sources, may be wntten as 


_ <(«. »}»(•) _ 

fV(« + »i-«0-«2T - t4,Z > a - p)/V(s > -XP)' ^ 


(A 7 ) 


The density function for observed bours of work, wages, and the event “use of 
format child-care services" is readily derived. To simplify the exposition, 1 
preMot only the distribution for a, p, and the use of child care, triiich is 


__. 

/V(m + ni - - ajf - «4Z > a - t)flr(g > ^XP) 


(A8) 



OHILD-OARE PROORAMS Sl6l 

For working women who uie infbmud tourcei, it u more convenient to work with 
the dennty ftinction 

4(u + a,e,», e), (A9) 

which can eaiily be denved from h. For expository ease, it is cmivcment to define 
u + r- Clearly, c u not mdependent of e. This imphes that the conditional 

distnbution of e, a, and the dummy variable associated with the “use of informal 
sources” may be written as 

/V(m + B* - - onXp - ajY — > cAa < -Xfi)' 

We may combine observations described by (AS) and (AlO) to form the cm- 
dttional likelihood function for wm-king women. To mcorporate sample informauon 
for nonworking women, we know that the appropnate probabihty statement for 
nonworking women who would not use formal sources is 

Pr(m + n* - — a^Z < cAe < -Xfi), (All) 

while for nonworking women who would use formal sources the appropnate 
probabihty statement is 

Pr{m + n* — — a^Y — a^Z < u — vAe > -X 0 ). (A12) 

We may combine observations desenbed by (AS) and (AlO) with observations on 
nonworking women by weighting the conditional densities by the probabihty of 
the conditioning event. 

To form the sample likelihood fiinction in terms of observable variables, 
substitute for u and v from equations (5) and (A5), respecuvely. The Jacobian of 
transformation is jaj + «iW(l — «****)! It is straightforward to verify that 
smee A ^ 0, the transformauon is one-to-one smee Sj > 0 and a^ > 0. Ordering 
the observauons so that the first T^f work and use formal sources, the next 
work and use informal sources, the subsequent do not work and would use 
formal sources, while the last Y'nwi do not work and would use informal sources, 
the sample likelihood may be written as 



n' l«s + fi) f" ■«(«) * 

1.1 J-i,s 

n i«s + - ♦■**01 «) * 

‘•*'irr+i J*" 

Pr{m + nifcj - «o - ajFj - a^Z, < « - v)PT{e > -X,p) 

^wr* ^wi * 

J] fV(m + at, - «o - «2*'i 

— a^Z, < e, — p, As < —X,f) 


where«, i> In W, - ag - a,k, - SI 4 Z, - «a[W,A, + K, - (W,/«j)(l - * ***0]. 
e, w tt, — aiXffi, and In W, — in — at,. In deriving the estimates, I assume 
that the disturbances sue jointly normally distributed, vntb s uncorrelated with 
a and o. The predse funebonsJ forms are well known and are not praented here. 
Note that each term in the third groiqi of products mvolves the product of two 



JOURNAL OF POLrnCAL 
a ewily computed. The J^^’a^/compu- 

Iv be apphcd to more gene conjugent 


‘’’^x’ IUt» «»• «•>»»• o( 1 »»» •t '>““ 

'” - Tirr»rjr:sp- 

!»»-p-'* 

into inte , ,,»broken hne, i intervals lii tj* w,«toet constraint 

the budget line « a n the mte ^ ^ '^S^the slope 

rrsv 

woman to w ,_ ...jgetvation wage „j the known income —. , ^ second 

deternuneo y ^ at the 8 beyond t *i happen* to 

between the >^°^° known for timauon procedure ^^ure, 

the “PP^ man m eqtuUbnuin co"*^’*'"!’,!,^ .egment, while the 

consider a wom* geement ot tne slope of the seg 

" Z t 'ZX^ “ •'“ 

“ ot wuut"'"? “^^'5 wwk- it" “tj” ^oijhitorwuJ » 

IS®-bov. 



0 


'2 


Leisure 


T 


Fio 6 


S163 





Comment 


Sherwin Rosen 

C/nMmty {/' Rochisitr and Htnmi Vmumty 


Heckman has applied sophisticated theory and empirical methods to an 
important practical problem He uses a variant of Tobit analysis to 
combine quabtative data on labor-force status with quantitative data on 
hours of work, These methods not only take account of the usual dif¬ 
ferences m market wage opportumties confronting individuals, but also 
account for differences in tastes among them Few previous labor-supply 
studies have attempted to incorporate variations in -tastes, and virtually 
none has treated the problem with other than ad hoc methods In addition, 
Heckman’s statistical approach directly addresses a problem of interpret¬ 
ation raised by Lewis (1972): Cross-section regressions of labor-force 
participation rates on wage rates and other vanables do not necessanly 
capture underlying substitution effects between leisure and goods. For 
example, suppose there are differences in preferencesor home productivities 
among members of the population under study, over and above those 
represented by such exogenous variables as education and number of 
children Then there is a distribution of unobserved reservation wages 
Those whose reservation wage exceeds available market wages choose 
not to participate in the labor market. They specialize in home production 
and leisure instead. As wages increase, the fraction of the population 
satisfying this criterion decreases and labor-force participation rates rue 
Therefore, wage variation in the sample sweeps out a tail of the under- 
lyuig distnbution of reservation prices. Regression estimates may onl^ 
give us back properties of this distribution and may be uninformativi 
with respect to the more interesting aspects of the structure of preference; 
Heckmsui explicitly deals with this problem in his analysis. Furthernion 
both labor-force status and hours-of-work data are utilized efficiently i 
estimating labor-supply parameters. I know of no other work th 
utilizes the sample information in this way. 

S 164 




OOMUENT 


S 165 

Heckman does not rest content at this pomt He offers some conceptual 
modifications as well. Predictions of labor-supply responses to child-carc 
programs require more knowledge than the uncompensated supply 
function, because these programs mtroducc a complicated wedge between 
average and marginal wage rates. However, no difficulties arise if the 
whole preference map or, what amounts to the same thmg for prediction 
purposes, the utility compensated supply function for hours of work is 
known. The paper gives the impression that parameters of indifference 
curves are estimated directly. However, appearances arc deceiving Under 
the assumptions of the model, the plain fact is that the observations arc 
generated by an uncompensated, garden-vanety supply function Con¬ 
versely, the data identify an ordinary supply function, not an uncom¬ 
pensated one The latter, as well as mdiffercnce-map parameters, always 
must be inferred from the former, using the theory of revealed preference 
Since most of this theory is found in rather inaccessible discussions of 
integ;rability and related topics, it is worthwhile to spell out some details 
in a simple case. 

A decision maker maximizes a quasi-concave utility function 
tt = U{x, h) subject to the constraint x <= wk + j/, where h is hours of 
work, X is all other goods, w is the wage rate (in terms of units of x), and 
y is nonlabor income (also in units of x) ^ Necessary conditions arc 
t/, “ A and t/j = — wA, where A is the marginal unhty of mcome These 
conditions and the budget constramt imply a set of uncompensated 
demand and supply functions. 


* ~f{w,y) ( 1 ) 

and 

A = (2) 

It is equation (2) that generates the labor-supply observations Equations 
(1) and (2) arc not independent of each other. Substitutmg mto the 
budget contramt and differentiating gives 

»)gw -/w “ -g (3) 

and 

+/, »1. W 

Define an indirect utility function by substituting the demand and supply 
funedons into U{x,h): u — U[/{w,y), g{w,y)] »= V{w,y) Totally dif¬ 
ferentiate V(ti),y), du — {U,J„ + Utgjjdw - 1 - (UJ^ -I- U^g^)^,siids\xh- 

* Altenuuivdy, U{x, A) can be a synthebc ubbty {uncuon, after op tir a inn g out die 
cfaild-care quality decMon oooditional on A. That is the approach followed by Heckman 
Ihcn the quality price of child care u a proper argument of U, but u mppresed in dm 
diieimion. 



Si66 


JOURNAL OF POLITICAL ECONOMY 

sdtute the necessary conditions plus restrictions ( 3 ) and ( 4 ). Simplification 
yields the result 

du = k[g{w,y)dw + dy\. ( 5 ) 

What combinations of wage rates and nonlabor income results in the 
same level of utihty when consumption-labor choices are optimal* 
Setting diu =: 0 in ( 5 ) and using the fact that >l > 0 , these values must 
satisfy 

dyjdw - ~g{w,y). (6) 

Equation (6) defines the marginal rate of substitution between w and It 
gives the slope of an "mdifference curve” between w and y rather than 
between x and h Notice that the slope only depends on A » g(w,y), 
which is known and estimated from the data.* Now treat equation (6) 
as a differential equation my and mtegrate to obtain 

y = F(w, c), ( 7 ) 

where c is a constant integration, dependent on the initial state (aio.^o)- 
The conditions under which a solution exists arc minimal and almost 
certain to be satisfied by any empirically tractable supply function In 
fact, ( 7 ) IS the equation of the indirect preference map and c is a utihty 
indicator. F(w, e) is an inverse of the indirect utility function. Once the 
uncompensated supply function A =s has been estimated, F(iv c) 

can be obtamed without prior knowledge of the utility function by 
integration, if not analytically, then with numerical methods on a 
computer Finally, substitute equation ( 7 ) into equation ( 2 ), 

A = g[w, F{w, c)] = G{w, e) (8) 

Equation (8) is the utility-compensated supply function for hours of work 
and = g^ + g^Fy, ~ gy, — kg^ is the pure substitution effect 
Alternatively, solving for w in terms of A and c m equation (8) yields the 
marginal rate of substitution function, the form used by Heckman. Note 
that (8) or its inverse is inferred only from knowledge of the uncom¬ 
pensated supply function. 

The discussion above shows how it is possible to learn somethmg about 
the utihty fimction purely from observmg the uncompensated supply 
function. Heckman’s procedure is more or less the opposite. He begins 
with a functional form restriction on the preference map (i.c., a restriction 
on equation [8]—his equation [ 4 ]) and integrates back to an uncom¬ 
pensated supply function (his equauon [ 5 ], comparable with equation [2] 
above) It is the latter that is estimated. His restrictions arc such that 

* The following development is a variant of an argument due to Hotelling (1932) 

• DilTerentiatirai of (6) reveals that d*yldw* tm — dhidw + h(^ld^). TTut u the 
negative of the pure substitution effect and the wage-nonlabor income indifference 
curves are concave to the origin. 



COUUENT 


8167 

estimated parameters of the tmeompensated supply function exactly 
identify parameters of the preference map without further mampulation 
Of course the theory does not speedy precise fimctional forms, and some 
prior specifications are necessary for estimation The compensated and 
imcompensated supply functions stand in a one-to-one relationship, and 
a functional form restriction on either one imphes a corresponding 
restricuon on the other. Thus, the quesUon bods down to where one should 
impose the restrictions m practice My own preference is to put the func¬ 
tional form restrictions on the uncompensated supply function and work 
back from that to the compensated function After all, the observations 
directly identify only the uncompensated supply function, and, in the 
nature of the revealed preference argument, the structure of preferences 
only can be inferred indirectly from observed behavior. There really is 
no way out of a specification search at the point of observations We 
simply have no way of knowing whether semilog hnear preference mapw 
are appropriate The proper specification must come from observed, 
imcompensated behavior. 

Heckman’s restriction makes the generatmg function of the observa¬ 
tions (his equation [3]) nonlinear and difficult to estimate Computational 
difficulties are compounded by incorporatmg observauons on labor-force 
nonparticipants. On the other hand, predictions of various program effects 
are more easdy computed. If one starts from the other end, as suggested 
here, estimation is far less costly, and much greater experimental flexibihty 
IS achieved at the stage where that experimentation is most desirable. 
But this has its costs too, for preference-map mfcrenccs and program 
projections become more expensive. Still, if complex numencal integra¬ 
tions are necessary to obtam compensated response functions, they need 
be computed only once, after the functional form issue has been resolved 
Finally, Heckman’s arguments m favor of his procedure arc slightly more 
persuasive when income tax distortions are introduced into the analysis. 
However, it remains true that withm the received framework the ob¬ 
servations are generated by uncompensated structures Therefore, most 
of the above apphes after average wage rates have been replaced by their 
appropnatc after-tax maigmal values. 

Let me now turn to the empirical estimates If nonworkmg women are 
to provide information about the structure of preferences, it is necessary 
to know what market wage oppiortunitics they have foregone Extra- 
piolations are made from wsiges of working wives with the same productivity 
(education and expierience) characteristics Yet there are theorcucal 
reasons for supposing these extrapolations arc biased, along the hnes of 
the paper presented by Mincer and Polachck in this volume Women 
who have more-pemuinent labor-force attachments engage m produc¬ 
tivity-enhancing activities to a much greater extent than those who have 
less permanent attachments. 



SiW JoauM. or rouncAi economy 

To put it inotiwr wiy* ”A« iww* Vtuied’ by wipt, or are wages 
'cauaed* by boun?" Heckmtn'i method unimed aa anawer of “no''to 
the second part of thii queition. But Minoer and Pbiachek’s reiulti suggest 
the opposite. Women who greatly value market work or who have 
relatively low home pioductivitiei have prior expectatioiu leading them 
to acquire labo^raarket aldlli to a much didoent degree than women 
possessing the opposite characteristics. In practice, the result u that 
market wage opportunides of thoae out of the labor force tend 
to be overstated. Estimates of subsdtudon tenns in labor^uppiy equations 
are biased upward, resulting in upward biased projecdons of labor-supply 
responses to chiid-care programs. Heckman has estimated the model 
separately for working wives, but the comparisons do not dispose of this 
point because the statisdcal treatment of child-care sources is quite 
diderent 

My final comment concerns an anomaly in the data. The observations 
are very thin in the range 0 < A < 600, and almost all women who 
engage m market acdvides work a significant amount of time during the 
year It is problematic how semilog linear reservation wage functions and 
normally distributed unobserved differences in preferences and market 
opportunities can account for this gap in the observations. If there is a 
significant mass of individuals at a boutuiary solution (A m 0) and the 
distributions are smooth, a nontrivial iracdon exhibiting interior solutions 
close to the boundary is to be expected This raises some questions 
regarding sensitivity of the estimates to alternative distribubonai 
assumptions. Considering the estimation difficulties associated with 
normally distributed unobserved “error” components, pursuing other 
distributions would appear to be infertile ground for future research 
But there is a much simpler explanation for the gap. There may be 
almost no jobs that offer short work sdiedules: Maritet wage opportumties 
in this range of houn may be mi. The point can be generalized. Most jobs 
outside the personal service sector offer rather inflexible work schedules 
(the major exception being volunteer work), with houis-of-work require¬ 
ments varying fioro job to job. Insofar as choice of working hours is inter¬ 
mediated by a commensurate choice of job, there is no reason for a single 
hourly wage to clear all markets for jobs. That is, it may be ffictually 
incorrect to assume equality between average and marginal wage rates 
in the analysis. Wage-boun regrestioiu are not readily interpreted as 
direct estimates of labor«ipply functions in this case. They undoubtedly 
are related to worker prribrences, but contain elements A>f employer 
demand and technological considerations as well. It is impossible to say 
at this point what difference such modifications will make to labor-supply 
estimates, since a slighUy different conceptual firamework is required. In 
any event, this issue seems wdl worth pursuing. 



oomoOT S 169 

In concluBon, Heckman clearly hai opened a lot of new territory in the 
economics lalxw supply The origiiMty of his effort is all the more 
remarkable when one considers the extent to which the literature shows 
these grounds have been worked over. Future studies m this area arc 
bound to be affected for many yean to come by Heckman’s paper 



Economics of Postwar Fertility In Japan: 
Differentials and Trends 


Masanori Hashimoto 

Vnmist^ if WishmgUm md Nalmal Butem if Eamtmc Seumh 


I. Introduction 

Japan expenenced a precipitous decline in fertility during the decade 
following the postwar baby boom of 1947-49, and then her fertility rates 
leveled off The rapid decline m fcrtihty paralleled hberahzation of abortion 
laws and an active campaign to disseminate contraceptive informaUon 
Given these developments and the apparent subordination of women m 
Japanese culture, one might be inclined to ascnbe the fertility decline to 
increased availability of abortions and other means of birth control and 
to doubt the explanatory relevance of economic theory which emphasizes 
the effects of rising wages and educational attainment of women on 
fertility. The conclusion that emerges from this study, however, is that 
the basic economic forces identified by the new economic theory of 
household decision making have been operating to produce a con¬ 
siderable part of the observed differentials and trends m Japanese 
fertility ' Indeed, increased use of abortion and contraceptive devices 
appears to have been induced to some extent by economic forces * 

1 benefited greatly from comments and suggestions made by Gary S Becker, Margaret 
L Hashimoto, Masatoshi Kuralam, H Gregg Lewis, Gail Mabnen, Jacob Mincer, 
Marc Nerlove, Douglas S Paauw, Gary Saxonhouse, T W. Schultz, T Paul Schultz, 
Miron Stano, Kozo Yamamura, and Dennis Zimmerman Haeng-Ja Song’s research 
assutance is gratefully acknowledged Sachio Kohjima of the Bank of Tokyo obtained 
some data for me This paper was written while I was at Wayne State University 
I thank the Economics Department at Wayne State University, where this research was 
performed, for providing resources for my research Finally, I alone am responsible 
for the contents of this paper 

' Silver (1966) found that births, and to a somewhat lesser extent marn^es, responded 
positively to both ordmary and Kuznets cycles between 1878 and 1957, and that the 
magmtudes of the cychcal responses of births were similar to those in the United States 
and Britain 

^ It would be premature, however, to look for exact magmtudes of the parameten of 
the Japanese fertihty function or for a complete explanation of ferblity trends The 
statisbcal model employs only a few key variables and is still formulated m static terms 
The statuucal analysu is permed mainly on cross-section data which are highly 
anregated Further refinements of the model and the data base are both desinble and 



POSTWAR FERTILITY IN JAPAN SI 71 

n. Differentials and Trends in Japanese Fertility: Background 

The phenomenal growth of the Japanese economy dunng the postwar 
years has been widely noted Between 1950 and 1969, real gross national 
product grew at an average annually compounded rate of close to 10 
percent, while the comparative rate for the United States was 4 percent 
Real gross national product per capita grew at 9 percent, contrasted with 
2 percent for the United States The rapid mdustnahzation of the 
Japanese economy is documented in the share of agricultural employment, 
which fell from about 49 percent in 1950 to 30 percent in 1960 and then 
declined to less than 18 percent in 1970. Employment opportunities for 
women grew also, the projxirtion of women in nonagncultural employ¬ 
ment increased from 29 percent in 1950 to 35 percent m 1960 and to 37 
percent in 1970 

Against this background of rapid economic development, Japanese 
fertility dropped sharply (6g 1) The crude birth rate, which had been 
about 35 in the late 1940s, 6rst dipped below 30 in 1950, 5 years later it 
fell below 20 and stayed there Fertility decline in Japan began about 
1920, but the rate of dechne accelerated after 1950 ^ By the early 1960s 
the persistent decline apjjears to have ended, and then the leveling off 
began A sharp drop in fertility m 1966 in response to the superstition of 
Hinoeuma vividly underscores that fertihty is very much within the realm 
of deliberate control * Birth rates just before and after 1966 were to some 
extent adjustments to the 1966 phenomenon 

Rural regions expenenced a more rapid and a more persistent birth 
rate decline than industrial areas (see table 1) Only part of this difference 
IS accounted for by shifts in age compiosition as women of childbeanng 
age migrated from rural to urban regions ’ Until 1962, the average num¬ 
ber of children ever bom by duration of marriage for women married 
before they were 30 years old continuously declined for all groups (see 
table 2) There was a shght mcrease between 1%2 and 1%7 among 
women mamed less than 10 years But completed fertihty measured 
approximately by the fertihty of women mamed more than 10 years 
continued to dechne Table 3 provides more direct evidence on the 
decrease in desired completed fertihty during the postwar years, 1950-67 

’ Between the two penods, 1920-22 and l9«>-42, the crude birth rate declined from 
35 2 to 30 7 (7 1 percent per decade), and the total fertihty rate declined £nom 5 21 m 
1920 to 4 11 in 1940 (11 2 percent per decade) Between the 1949-51 and 1959-61 
penods, the crude birth rate dropped from 28 8 to 17 2 (40 3 percent), and the total 
fertihty rate from 3 72 to 1 99 (46 5 percent) 

* Hmotuma takes place every 60 years The supeisuuon is that a woman bom m that 
year will "eat men ^ve,” which is to say that she will be very aggressive and no man will 
want to marry her 

’ According to Japan’s Institute of Popuhmoii Problems, holding constant the age 
composition of the population at 1930 values, the birth rate in urban areas (sAt) dropped 
from 26.4 m 1950 to 17.8 m 1960 and then rase to 19 0 in 1965 The birth rate m rural 
(gun) continued to drop from 32 7 m 1950 to 19 0 m 1960 and to 17 6 in 1965 
These results were reported m Aoki (1970, table 24) 



JOURNAL OF POLinOAL ECONOMY 


Siya 



Fio I.—In&nt death rate u deaths at ages 0-1 per 1,000 hve births Abortion rate u 
number of registered abortions per 100 hve births Other vital statistics have their usual 
definitions. (Data on births, deaths, and infant deaths are from Japan Ministry of Health 
and Welfiurc [1970], abortion rate, total fertihty rate, and net rep^uction rate are from 
Mainirhi Newspaper Company [1970, p 246]). 


For families already having two children, the proportion desiring no 
additional children increased throughout the years, while the proportion 
desiring twQ and three or more additional children continued to decline 
Moreover, the percentage of couples with "no idea" how many addi- 
tional children they wanted tended to decline, suggesting an increased 





JOURNAI. OF POUTICAI, ECONOMY 


S174 


TABLE 2 

Aveeaoe Numsee of Children Ever Born 
(Wife Marroo refori Aoe 30) 


Marruoe Duration 
(Years) 


Survey Years 


1940 

1957 

1962 

1967 

1-4 

098 

093 

087 

1.06 

5-9 

234 

2 16 

1 92 

195 

10-14 

3 55 

2 92 

246 

2 28 

15-19 

4 39 

3 62 

285 

2 66 

20-I- 

517 

4 72 

390 

3 43 

All groups 

349 

286 

2 37 

2 28 


SouRCU —Aoki and NakaDa(l967, table 31 and japan Mintilrv of Health and Welfare (1968, table 2<B) 
Nora —Includes married women with husnand present 


awareness of family planning in later years. My estimates of completed 
fertility (table 3) also show a pronounced decline over the years * 

The rapid decline in fertility in the postwar years coincided with the 
liberaluation of policies on abortions and contraceptive use Abortion 
was first legalized in September 1948, and m June 1949, grounds for 
abortion and stenlization were extended After May 1952, abortion could 
be obtained with only the consent of man and wife Alarmed at the 
increased number of abortions and possible health hazards from abortion, 
the government in 1952 imtiated programs to disseminate birth control 
information through local agenaes Resort to abortion continued to 
increase, however, until 1957, when the number of abortions per 100 live 
births reached its peak of 71 6 


ni. Economic Framework for Fertility Analysis 

The following discussion will emphasize the effect of parental education 
on fertihty The testing of economic hypotheses about the effect of parental 
education on fertility also provides a test of the usefulness of economic 
theory in interpreting Japanese fertility behavior 
The basic proposition in the economic theory of fertihty is that observed 
changes and differentials in fertihty reflect to a large extent changes and 
differentials in the demand for fertility and that resource constraints on 


* The (rue decline u likely to have been even more rapid than Ihex esumates suggest 
becauK my calculauons may underesumate completed fertihty to a greater extent in 
earlier years Fust, the true mean of the open-ended class would be larger thkn three, the 
figure used for my calculations, by a ItrgCT amount m earlier years if completed fertility 
dedmed over tune. Second, the group with "no idea” as to number of additional children 
desired was omitted from the calculations. If people gave the "no idea" answer because 
of greater ignorance (lower level of education) than other respondents, then economic 
analysu predicts their desired completed fertility to be greater than that of others Thu 
omission would have mattered less m recent yean because of general declines m fertility 
and because this group made up a smaller fraction of respondents in more recent yean 










S 1 76 JOURNAL OF POLmOAL BOONOUY 

households exert appreciable effects on fertility demand Following 
Becker (1960), most economic models of fertility view children as 
primarily consumer durables. In this conception, a flow of services 
emanates from the stock of children, and these services enter the parents' 
utility function. Since parents must make outlays (e.g., food, clothing, 
housing, and, most important, their time) to obtain child services, they 
are not free goods, but command a pnee. To be qmte general, therefore, 
demand for fertility is determined by the preferences of parents, the 
resources available to parents, and the cost of producing child services 
relative to the cost of other commodities (usually identified collectively 
as the standard of living) that also yield utility.'’ 

Typically, the fertility rate has declined m countries expenencing 
sustained growth m real income Smee growth in real mcome is usually 
brought about by growth m real wages, this phenomenon may be in¬ 
terpreted as reflecting the dominance of substitution over income effects 
An increase m real mcome would, other things bemg equal, mcrease the 
number of children per family unless the number of children was an 
“infenor good.”* Assuming that child services are more time-intensive 
for mothen than the standard of hving, an increase m the wife’s wage 
rate would raise the relative price of child services, imparting a substi¬ 
tution effect away from them * Relying on mcome and wage rates alone, 
one may, therefore, interpret the deelme m fertihty accompanying growth 
in real mcome as reflectmg the dominance of the substitution effect of a 
rise in the wife’s wage rate over the combined income effects of increases 
in both spouses’ wage rates. However, both income and wage rates are 
correlated with the educational levels of the parents and the level of 
contraceptive knowledge, rcladonships which complicate the empincal 
estimation of the magnitudes of the Income and substitution effects 

The effect of educational attainment of parents on fertility has been 
mvestigated more thoroughly than that of mcome or wage rates by Ben- 
Porath (1970«, 19706, 1973) and Michael (1971) among othen. Educa¬ 
tional attainment of parents is typically negatively associated with their 


’ Most modeb uk the Ume-iUacaUon hamevrark (Becker 1965) lod they arc uniaUy 
cast in a hfetune dunennon, thereby avoiding the complcxitiei of tuning and ipacing of 
births (see, for example, Ben-Porath [197^, OeTray [1972], Michael [1971], and WiUu 
[1973]) Some proU^ with the use of static moe^ are da ci ua ed by T. (W Schults 
(1975). 

* Abo rdevant to the decline m iettihty aie ehanga m the qiiahty of children at teal 
income rba. ReKtrdi on the demand 6)t eoniamer durables tuggaa that the eSect of 
income on the demand ibr quality it greater than on the dem^ for quanbty. Even 
awutning mnnberi of children are not inienoc, parents would tend to a greater extent to 
tpeoi more per child than to have more childtm as real income luei. 

' An increase m the husband's wage rate, however, b expected to produce mainly 
ineeme efieett heeauK hb tiine intensities are asnuned to diSbr little between child 
services and the standard of hving and because hb wage income constituiet a large 
*»>martion of bmily mcome (see, far exain|de, Ben-Pontb [1979]). 



POffTWAK FERTIUTY IN JAPAN S, 

fertility. Proposed explanations of this relationship fall generally into 
three groups the tastes hypothesis, the cost-of-fertility-control hypothesis, 
and the cost-of>time hypothcsu. 

According to one version of the tastes hypothesis, parents with higher 
education may be more “raUonal” than others m that they tend to regard 
the number of children as a decision variable, while less educated parents 
regard it as something beyond their ointrol (Ben-Porath 19704) There¬ 
fore, parents with higher levels of education would control fertiLty more 
and have fewer children than other parents Another version is that 
education affects the perception of child quahty (Ben-Porath 19704, 
Gsutlner 1972) Parents with more education may desire a greater 
expenditure per child to obtam a given level of child quahty Other 
things bemg equal, they will have a smaller number of children. Econo¬ 
mists have not succeeded m mcorporating such taste factors in their 
theory m ways that generate testable hypotheses Such taste factors arc 
still r^arded as residuals 

The second explanation hypothesizes that parents with more education 
have better access to information about birth control devices and are more 
efficient at using them. In other words, education lowers the cost of 
ferdhty control. More educated parents would engage in fertihry control 
to a greater extent than other parents, other things, particularly the 
benefit of control, being the same. This hypothesis may partly explain the 
negative association between education and fertility ' 

The third explanation emphasizes the effect of education on fertihty 
through Its effects on the cost of time. In the timc-allocauon framework 
of the fertility model, education is assumed to raise the productivity of 
Ume spent at home m the production of commodiucs (such as child 
services and the standard of living) as well as that of tunc spent in the 
labor force. An mcrease m the producUvity of tune at work leads to an 
increased real wage rate, thereby raismg the relauve pnees of ume- 
intensive commodities. An mcrease m the productivity at home, say, at 
the same rate for ail commodities, lowers the relauve pnees of tune- 
mtensive commodities. An mcreased producUvity at home or market 
also results m mcreased real income, imparting mcomc effects The full 
effect of educauon on the demand for children is the net of these forces 
To rely on the cost-of-time hypothesis to explam the negauve conelauon 
between education and fertility, one must argue that the subsutuuon 

*** More educated people may atao use birdi oootrol more dun othen because tbey 
denre fewer children, i.e., the benefit of buth control u greater tor them Both Becker 
(1960, p. 218) and Be^Fondi (19704, p 6) propos e a test of the hypothens that education 
Inwen the coat of fertility oonli^. If in^rovexnent m birth control techniques comes as a 
*iaee>aiid-lamll change, then fertility difierentiab edurational groups would 

vnden at fiist but would tend to narrow over time as infbnnatioo on buth control measures 

heeame more diffiaed (see Section IV below ibr some evidence of a differential rate of 
difltnton wiueatinoal groups of buTth oontToi techniques m Japan afier 1948-52) 



JOURNAL OF FOUnCAL ECONOMY 


Si 78 

effect imparted by education outweighs the income effect. One plausible 
case is that education, especially of the wife, raises the producdvity of 
time spent in the market more than that of time spent in producing child 
services or the standard of living. In this case, the relative price of child 
services rises, causing substitution effects away from them “ The substi¬ 
tution effect of the husband’s education is likely to be small, assummg 
that time intensity varies little between the two commodities for him 
The husband’s education is expected, therefore, to exert mainly income 
effects (Ben-Porath 1973). 

The cost-of-time hypothesis has played a central role in the economic 
analysis of fertility It emphasizes the role of the relative price of child 
services, and its predictions are empirically distinguishable, at least for 
Japan, from those of the tastes and the cost-of-fertility-control hypotheses 
The latter two hypotheses predict that the education of either parent will 
be negatively associated with fertility, but they leave the relative impor¬ 
tance of the effects of each parent’s education an empirical question. In 
Japan, where the husband apparently dominates family decision¬ 
making,'* one would expect his education to be at least as imjiortant as 
the wife’s education, if not more so The cost-of-tune hypothesis predicts 
that the negative association between education and fertility is more 
strongly manifested when the education vanabic refers to the wife’s 
rather than the husband’s education The husband’s education would be 
positively correlated, if anything, with fertihty when other things, m 
particular the level of contraceptive knowledge, are the same. In other 
words, husband’s education serves as a proxy for family wealth, while 
wife’s education represents the pnee of child services 

These three hypotheses, however, are not necessanly mutually ex¬ 
clusive They help identify possible mechanisms through which parents’ 
education affects fertihty For example, the cost-of-timc and the cost-of- 
fertility-control hypotheses would best be viewed not as alternatives but 
as complementary hypotheses The cost-of-time hypothesis emphasizes 
forces affecting the demand for the service of birth control, a demand 
which IS denved from the desired reduction in births, given the level of 
fecundity. Underlying this demand is the demand for fertihty. The cost- 
of-fertility-control hypothesis emphasizes the supply side, or the cost of 


‘ ‘ Even if the wife does not work in the market m bet hfctimc, her shadow pnee of lime 
could be increased by education, nuking child services more expensive (see Gronau 
[1971]) , 1 

I’ A survey of 422 couples m the Sutima prefecture in 1960 showed that 78 percent 
were usmg ctmdoms, a device for the husband's use, and that 62 percent had the husband 
obtam the conttaoeptives. The same study ibenved that for 70 percent of white-collar 
workers, 52 percent of bnners, and 51 peicent of laboten, the husband wu responsible 
for obtaining contraceptive devicci (Muramatiu 1962). Matsumoto, Kouumi, and 
Nohan (1972, p. 254) also report that husbands ate primarily reqxmlible for decwoni 
concenung family hmitaMa. 



POSTWAR FERTILITY IN JAPAN 


S179 

achieving the desired reduction in births The cost-of-timc hypothesis 
has been the most useful to economists for analyzing information on 
fertility behavior 

rV. Empiricsd Analysis 

Study of Japan appears to offer an opportunity to obtain evidence 
concermng both the effects on fertility of decreases in the cost of pre¬ 
venting births and changes in fertility demand Thus, I want below first 
to examine evidence concerning the proposition that both decreased 
demand for fertility and increased access to abortions and contraceptive 
techniques contributed to the phenomenal piostwar decline in Japanese 
fertility 

Abortion, Contraceptive Use, and Fertility 

The rapid decline in Japanese fertihty in the piostwar decade coincided, 
as I have stated, with abortion hberahzation measures and the subsequent 
dissemination of information about other means of fertihty control It is 
tempting to explain the rapid decline in fertihty as simply the result of a 
widespread resort to abortion According to this view, many births pnor 
to abortion legalization were unwanted the liberalization of birth- 
control policies simply eliminated this excess fertihty In other words, if 
abortion had not been legalized, fertility would not have declined 
appreciably But since fertihty m Japan had been dechmng persistently 
since around 1920, there is a presumpuon that it would have dechned 
even in the absence of abortion reform 

Since abortions and other fertility-control measures entail costs, both 
monetary and nonmonetary, the extent of use of these measures pre¬ 
sumably was influenced by considerations of gains and costs Abortion 
legalization and birth control information programs certainly decreased 
the costs of fertility control, leading to increased use of control metisures 
On the demand side, as discussed in Section III, the main determinant 
of the demand for fertihty control is the demand for feitihty A fuller 
interpretation of the developments m Japan would be that the price of 
child services increased and demand for fertility dcchned, leading to an 
increased demand for fertility control, and that the increased availability 
of birth control measures and abortions faahtated the attainment of 
lower fertility. Put diflferently, even without the hberahzation of aborUon 
and other control pohcies, resort to illegal abortion and the use of con¬ 
traceptives would have gfrown because of the increase in the pnee of child 
services. In fact, the availabihty of ill^al abortions had been growing 
before 1948, and the Eugenics Law of 1948 was motivated by hygienics, 
not by intent of population control (Muramatsu 1967) In short, legaliz¬ 
ation and liberalization of control measures were not so much autonomous 



Si8o 


JOURNAL OP POLmCAL ECONOMY 


events as responses by authorities to increased demand for fertility control 
and the health hazards of illegal practices 
How much did legalization of abortion contribute to the rapid decline 
in fertility^ To answer this question accurately, one would have to 
estimate (1) the increase m the number of abortions caused by abortion 
legalization, and (2) the proportion of those abortions that would have 
terminated m births had there been no legal abortions Since such esti¬ 
mates are beyond the scope of this paper,‘^ 1 report the results of a crude 
calculation Assuming that all pregnancies aborted legally would have 
terminated in live births if abortions had been illegal, I calculated, for 
each year from 1949 to 1961, the “maximum” general fertility rate as 
births plus abortions per 1,000 women of ages 10-49.The excess of 
actual fertility decline over any decline in the “maximum" fertibty rate 
can be viewed as a measure of fertility decline due to abortion 
Between the 1949-51 and 1959-61 periods, the actual general fertility rate 
dropped by 44 percent, from 93 to 52 According to my calculations, the 
"maximum” general fertility rate decreased by 22 percent, from 111 to 
87. Thus, It would appear that abortions about doubled the rate of 
decline in the general fertility rate dunng the decade after 1950 ” But 
resort to abortion by no means accounted for the entire fertility decline 
Indeed, the rate of decline m the estimated "maximum" rate is still 
substantially greater than the rate of decline in the actual fertibty rate 
dunng the prewar years'* This evidence of a substantial fertility 
decline—indeed, a possible acceleration of the prewar decline—after 
adjusting for the effect of abortions suggests that the actual fertibty decline 
partly represents decreases in the demand for fertility. 


In pnnaple, eficcti oflegal sbortioni on fertibty in any year could be aiMHcd from 
a lunultaneoui-equauaiii model in which key endogcnoui vanabtei would be the fcrtikty 
rate and pncea of abortiona and ofcontnuxptivc meaaura Once the effect of legalization 
on the pneei of abortiotu and contraceptive meaiura waa catimatcd, the ultimate effect 
on fertibty coubl be obtained from the demand-for-ferttbty function 
*'* ThcK are maxunum esumatei for each year of the fertibty rate m the afaience of 
legal abortions because, if abortions had been illegal, some pregnancies would have been 
aborted illegally or prevented by other means. In other words, the abortions in each year 
were not solely due to decreases m costs but alto to mcreaset in the demand for fertility 
control. On the other hand, these estnnaies may understate the true maximum values 
because of uoderreportmg of abortions. Muramatsu (1960) esUmated that m 1955 at 
most 77 percent of mduced abortions performed were repoctM He also estimated that in 
1955 the number of bve births in the absence of abortion would have been twice (or 
more) the munber actually registered. I calculated the ratio of the mavunnm to the actual 
generalfertihty rate tobe 1 66 in 1955 . 

* * Because of the underreporting of aboruotoi, my calculations may oveisute the effect 
of abortions on fertility decline. Since the underreportiDg was probably greater m earlier 
years, the actual drop m the max mum ftrobty rates woubi have bera gieater, and the 
efiect of abortions on fertibty decline smaller t^ I calculate. 

‘*Belw«en 1920 and 1910, the rate of decline per decide in the genetil fertihty rate 
was about 13 petcenL 



POSTWAR PERTIUTY IN JAPAN 


Si8i 


TABLE 4 

Rate and CoMPOsmoN op Peeonanciu bv Regions, 
1955, 1960, AND 1%5 






Relative Cmanoe 

Averaoe 

5-YEAfc-PERlOD 

Change 

(%) 


1955 

1960 

1965 

1955-60 

(%) 

1960-65 

(%) 

Rate of pregnancy 

All Japan 

223 32 

187 01 

166 01 

-16 26 

-11 23 

-13 74 

Induitnal regions 

206 25 

191 23 

183 00 

-7 28 

-430 

-580 

Rural-farm regioni 

251 14 

191 30 

155 59 

-23 83 

-18 67 

-21 29 

Share of births (in %) 

All Japan 

57 96 

58 14 

66 04 

0 31 

13 59 

6 74 

Industrial regions 

54 59 

56 79 

65 91 

4 03 

1606 

988 

Rural-farm regions 59 58 
Share of abortions (m %) 

59 85 

65 28 

0 45 

9 07 

4 67 

All Japan 

39 17 

38 49 

30 53 

-1 74 

20 68 

-11 71 

Industrial regions 

42 24 

39 40 

30 42 

-6 72 

-22 79 

-15 14 

Rural-larm regions 

37 79 

37 01 

32 22 

-206 

-1294 

-766 

Percentage of females 
(Ages 15-1-) of 

childbearing age (20--$4) 






(in %) 

All Japan 

36 88 

35 73 

34 21 

-3 12 

-4 25 

-3 69 

Industrial regions 

39 28 

38 95 

39 29 

-084 

0 87 

001 

Rural-farm regions 

35 66 

34 06 

3017 

-4 49 

-1142 

-802 


SouKctt'^Prepumcus, births, and abortioai from Shmozaki (Id7l), fenude population compoaitioo 
from Japan Pnme Minister*! OfB^ (1955. 19606, 1965) 
jSTon -^Rice of Pregnancy <■ lODO x (births + fetal deaths ‘f abortioni)/mamcd women of child* 
beanng with huabimd present Share of births ■< (births/pregnaocies) x 100 Share of abortions 
(abortions/pr^nancKs) x 100 Regions are the same as in table 1 


Dissemination of birth-control measures other than abortions un¬ 
doubtedly played some role in the decline in actual fertility rates, but in 
the absence of estimates of the number of pregnancies prevented by the 
use of modern contraceptive measures, one cannot directly assess ns 
impact on fertility. Since the efforts to disseminate birth-control in¬ 
formation started only in 1952, the major effects of birth control on 
fertility were probably felt dunng the late 1950s Assuming that most 
couples with accidental pregnancies turned to abortion as a last resort to 
control fertihty, the share of aboruons compared with the share of actual 
births m total pregnanaes may provide information on the trend of 
fertility demand and the use of contraceptives Since aboruons had already 
become widespread by the middle of the 1950s, subsequent changes in the 
share of abortions would reflect, to a large extent, an increased use of 
contraceptives Table 4 summarizes trends and differentials in the rate of 
pregnancy and in the shares of aboruons and births in pregnancies after 
1955. There was a general increase of the share of births and a decrease 
of the share of abortions in total pregnanaes For Japan as a whole, the 
pregnancy rate among fertile women declined at an average 5-ycar rate 




JOURNAL OF POLITICAL ECONOMY 


Si82 

of 14 percent from 1955 to 1965 The rate of decline was greater m the 
rural regions (21 percent) than in the mdustnal regions (6 percent) 
Indeed, by 1960 the pregnancy rate in rural regions had dropped below 
that m industrial regions. During the 10-year period, the rate of increase 
in the share of births and the rate of decrease in the share of abortions 
were both greater in industrial than in rural regions, suggesting that 
“unwanted" pregnancies declined more rapidly in industrial areas 
What can one infer from these regional differences^ Suppose that all 
these differences are attributable to the differential spread of contraceptive 
methods (i e, differential reduction in costs of control) Then the implicit 
assumption would be either that there was no change m demand for 
fertility in industrial and rural regions, or that the decline in demand for 
fertility did not vary systematically between the two types of regions 
The slower rate of increase in the share of births and the slower rate of 
decrease in the share of abortions m rural regions would then presumably 
be attributable mostly to a slower spread of contraceptive measures in 
these regions The more rapid decline of the pregnancy rate in rural 
regions, however, is the opposite of what one would expect if contraceptive 
measure did spread more slowly in these regions. Therefore, a more 
plausible interpretation of these differences would rely on a combination 
of differential change in fertility demand and differential spread of 
contraceptive knowledge One interpretation would be that fertility 
demand declined more rapidly in rural than in industrial regions, resulting 
in a more rapid decline of the pregnancy rate, but that the spread of 
contraceptive knowledge was not rapid enough in rural regions to 
dimmish rapidly the use of abortions as a last resort to control fertility 
Although the changing age composition of females precludes a firm 
judgment about the validity of the above interpretation,* ’’ this interpret¬ 
ation is supported by a comparison of the 1955-60 and 1960-65 penods 
The pregnancy rate dropped more rapidly during 1955-60 than it did 
during 1960-65 in both regions If this was attnbutable entirely to the 
spread of contraceptive measures, one would also expect to observe more 
rapid changes in both the share of births and the share of abortions m the 
earlier period Indeed, given the early 1950s’ reforms regarding the use 
of contraceptives, this expectation would appear to be quite plausible, but 
the course of events was just the opposite as both shares changed more 
rapidly m the later period (table 4). Again, a more plausible inter¬ 
pretation would be that demand for fertility continued to decline in both 
periods; that the spread of contraceptives increasingly facilitated effective 

The mdustnal prefecture! expenenced a slight increaie and the rural prefecture! a 
rather large decreate in the proportion of women of childbearing age (table 4) These 
change! in age componuon undoubtedly explain m part the more rapid decline in the 
pregnancy rate in the rural than m the induttrial prdcctutts. 



POSTWAR FERTILITY IN JAPAN 


S183 


TABLE 5 

Diffusion op Contraceptive Use by Education Level 
(Percentaoe of Couples Usino Contraceptive Devices) 



1950 

1952 

1955 

1957 

1959 

All 

19 5 

26 3 

33 6 

39 2 

42 5 

Husband’s education 

<9 

14 2 

18 2 

28 2 

33 4 

37 6 

10-12 

25 4 

37 0 

37 7 

465 

43 9 

13 + 

37 3 

47 0 

488 

52 5 

540 

Range 

23 1 

28 8 

20 6 

19 1 

16 4 

(MAD/mean) 

30 31 

31 03 

18 39 

16 25 

1306 

Wife’s education 

<9 

130 

20 1 

28 2 

33 3 

35 0 

10-12 

32 4 

38 7 

46 1 

48 1 

51 6 

13 + 

36 0 

59 1 

47 8 

53 2 

51 9 

Range 

23 0 

290 

196 

199 

16 9 

(MAD/mean) 

34 75 

33 59 

20 47 

17 16 

1609 


Source— ^Japan Miniitry of Health and Welfare (1959) 

Note —Range u the diff 2 rence between the largeit and the imallett valuer The MAD is the mean abcolute 
deviation In all yean eacept 1959 the data refer to couples with wife under age 49 fur 1959, wife 1 age is 
less than 50 The sample iize vanes from 3,500 couples m 1952 to 3,835 couples id 1939 


fertility control, thereby reducing unwanted pregnancies, and that the 
role of abortion continued to diminish in imjjortancc as a control measure 
The above evidence documents an increased use of contraceptive 
measures from 1955 to 1965 The relationship between parents’ education 
and the use of contraceptives was discussed in Section IH m the form of 
the cost-of-fertihty-control hypothesis If there was a penod in which the 
use of birth control techmques was introduced or improved once and for 
all, then this hypothesis predicts that dififerences among education groups 
in the extent of use of birth control would widen at first, then narrow 
gradually In Japan, a marked increase in the availability of aboruons and 
birth control information occurred after 1948-52, so this penod would 
appear to provide a reasonably proper setting for testing this hypothesis 
Table 5 presents the percentage of families within education class 
(husband’s or wife’s) that reported using some birth-control technique 
The data are taken from surveys conducted in selected years to determine 
the attitudes of Japanese families of vanous socioeconomic classes toward 
contraceptive usage In most education classes, the proporuon of families 
using birth-control techniques increased over the years, presumably 
because costs of control declined and benefits increased The differences 
across education groups in the extent of contraceptive usage as measured 
by either the range or the mean absolute dcviauon (relauve to the mean) 
varied most in the early 1950s and persistently declined after 1952 One 
detects a shght increase in these measures of dispersion in 1952 with the 




JOURNAL OF POUnCAL ECM3NOMY 


S 184 


TABLE 6 

COMFLETBD FERtlUTV BY EOVOATION LEVEL 

(Averaoe Number op Children Ever Born per Couple) 



1940 

1952 

1962 

1967 

All 

5 10 

4 47 

3 91 

3 36 

Husband’s educabon 

Low 

5 19 

4 62 

4 05 

3 45 

Middle 

4 81 

3 62 

360 

3 27 

High 

4 17 

3 47 

321 

2 92 

Wife’s education 

Low 

5 19 

4 57 

404 

348 

Middle 

4 39 

3.58 

3 47 

3 14 

High 

4 74 

3 13 

309 

2 69 


SouacEi —Aaki utd Naluma (1967, tabic 11) and Japan Miniitry of Health and Welfare (1968, vanoui 
ubln) 

Note —For 1940, thr figure » for couples m*med more th*o 21 yean, for 1952, couples with wife sge 45 
or older, and for I96i and 1967, couples mamed more than 20 years Schooling definitioni are low, leu 
than 10 years, middle. 10-12 years, and high, 13 yean or more 


start of the campaign to disseminate birth-control information These 
findings, though based on sparse information, agree with the predictions 
of the cost-of-fertility-control hypothesis 


Education and Fertility 

The preceding discussion suggests that a decline in fcrulity demand and 
increased access to abortions and birth-control information played a 
joint role in producing the observed decrease m fertility in Japan 1 now 
explore the relationship between fertility and parental education m the 
light of the hypotheses discussed in Section III 

According to the tastes and cost-of-fertility-control hypotheses, 
education of either parent is expected to be negatively associated with 
fertility For Japan, education of the husband, because of his dominant 
family role, is likely to be a more signiHcant explanatory variable than 
IS the wife’s education. The cost-of-time hypothesis predicts that the 
htisband’s education will be weakly, but positively, assoaated with 
fertility and that the wife's education will have a strong negative associ¬ 
ation with fertihty 

Table 6 presents a measure of completed fertility by education groups 
for 1940, 1952, 1962, and 1967 The data are from institute of Population 
Problems surveys. Education of other parent generally is»ncganvcly 
associated with fertility, in agreement with findings for other countna 
These data also document the peisistently declining trend in completed 
fertility at all levels of parental education, underscoring that education 
u by no meant the only factor contributing to the decline in completed 
fertility over time. 







pOrrWAR FERTILITY IN JAPAN 


S 185 


TABLE 7 

Tkenm in Completed Febtility Standardized by Education 


Survey Years Mean 

- Absolute 



1940 

1952 

1962 

1%7 

Deviation 

Actual . 

5 10 

4 47 

3 91 

336 

0 85 

Standardized. 

Husband’s education 

499 

4 21 

384 

336 

0 75 

Wife’s education 

4 93 

422 

384 

336 

0 72 


SouKCB —Ga)cu)at«<l from CAbl^ 6 

Notb —Staadardised ntte« were celculeted by usir^ the ]967 diitribution of coupiei by educalion of 
hiulMUid and wife The mean absolute deviation was calculated ai the average of a^iute devtatiom of 
npwisal values from the 1967 value 


The negative association of fertility and the education of either parent 
may at first appear to agree with the prediction of the tastes and the cost- 
of-fcrtihty-control hypotheses Unfortunately, these data are not cross- 
classified by education of parents, so that one cannot discern the “pure” 
relationship between the fertility rate and the wife’s and husband’s 
education separately Since the levels of husband’s and wife’s education 
are (positively) correlated, table 6 reflects the joint influence of both levels 
of education. Since the correlation is not perfect, one may still attempt 
(see table 7) to determine the relative importance of the education of 
either parent in “explaimng” observed diflerences in fertility 

Table 7 presents the observed completed fertility m the 4 survey years 
and estimates of completed fertility obtained by using the 1967 distri¬ 
bution of educational attainment of husband and wife That the general 
increase in the level of school attainment played a role in the decline in 
observed fertility is demonstrated by standardized rates which are con¬ 
sistently lower than actual rates in every year The difference between 
standardized fertility rates using husband’s and wife’s education appears 
to narrow in later years One reason for this tendency may be the increased 
correlation of husband’s and wife’s educational attainment accompanying 
a general increase in the dispersion and level of women’s educational 
attainment According to the mean absolute deviation of yearly values 
from the 1967 value (shown in table 7), the year-by-ycar differences m 
the fcrtihty rate are decreased when yearly rates are standardized by 
educauon of either parent, but the reduction is somewhat greater when 
the distribution of the wife’s rather than of the husband’s education is 
used Standardizing by the husband’s education reduces the mean abso¬ 
lute deviation by 12.8 percent, standardizmg by the wife’s education 
decreases it by 15.3 percent. Thus, the trend in the distribution of the 
Wife’s education appears to have been more important than that of the 
husband’s in the fertility trend. 

Since the data for table 7 were not cross-classified by parents’ education, 
one could not directly observe the “pure” effect of the education of either 




Si86 


JOURNAL OF POLITIGAL ECONOMY 


parent To determine the relative magmtudes of the pure effects of 
parents’ education on fertility, multiple regressions of the following form 
were estimated using cross-section data for 46 prefectures’ C = 
a -f- bi{Ed„N) -h b 2 {Ed„„) + + CiiEdfru) +/(other variables) 

4- «) where C = fertility rate; Edg,^ = proportion of males who never 
attended school; Edgg = proportion of males who completed approx¬ 
imately 10 or more years of schooling, Edyirg = the same as Edgg, for 
females, Edg-g = the same as Edgg, for females; other variables = other 
standardizing vanables discussed in the text, and u = disturbance term 
Since timc-sencs data on education and other vanables arc lacking for 
Japan, as for many other countnes, the regression analysis was restneted 
to cross-section data. Regressions reported in table 8 use the average 
number of children ever born for selected age groups of women (com¬ 
pleted fertility) as the dependent variable, in regressions m table 9 the 
live birth rate is the dependent vanablc The Japanese census presents the 
distnbution of educational attainment in four categones—none, ele¬ 
mentary, middle, and high Instead of estimating a single measure of 
educational attainment, such as the mean or median, I used two categones 
of the education variable for each parent One advantage m using these 
measures is that the coefficients, that is, b^ and for the husband and 
r, and for the wife, may reveal differential effects on fertility of a shift 
in the education distnbution between the lower and the upper segments 
of the distribution The expected signs of the regression coefficients 
according to the hypotheses discussed earlier are shown below 


Tastes and Cost of Fertility Control* 

Cost of Timet 

i, > 0 

< 0 

i, S 0 

> 0 

Cl > 0 

Cj < 0 

Cj > 0 

C2 < 0 


* A wraker version of the above —< 0 and —f| ^ (i < 0 
t A weaker version of the «bovc — + ^3 2: 0 and — rj -f fj < 0 


Regressions of Completed Fertility 

For the regressions of completed fertility reported in table 8, each 
observation was weighted by the square root of the number of age- 
speafic ever-mamed women m each prefecture In addition to the 
education vanables, standardizing vanables included in these regressions 
are the percentage of persons, age 15 and older, employed in agriculture, 
forestry, and fisheries, and the percentage of the population living in 
urban areas These variables refer to 1960 and therefore do not reflect 
accurately the background in which the couples made fertility decisions 
For these variables to have substantive meaning, one must assume that 
the majonty of couples have hved in the prefecture they resided m in 1960 
for most of their married lives and that the rankings of prefectures by 
these vanables have not changed significantly over time Unfortunately, 



3^ 

gs 

< 


SS SS 


oc^ 


S ' CO CO 

^ M OO ^ 

• 

o o ^ oOooooe^oo 
I 1^1 I — } I 

CM w— 


# 

CM 

CM 


11 


Sfr«2o«gi?«o2^ 

fiO CM* CM ® CM S CM gs^-'CM « 

® C^tT^o i.Sr'o ^S"o 5v 


SS g^* gS* ss 

o®So®Soo®oo 

' J.O S 1 .1.0 > 


o 

u*> 


.Si- 


2^ • ot# 

^ art -—vCO I.. -__ - 

-S<^ — ^ 

•— O ^eo O' CM eo "• CM CO •*' 
Q tt Q CM t, a. CO ly O 60 

1 I I I M 1 » 


o«* O — # 0©ao0'^0 — • 
©■“Jto^'JSoCM-*© — ©-"SS 

1IIS I J.2: 1111^:! 


II 


II 

s 


oO 


CM&CiCMO^CMp’rC^'*’ 

j^§0-S-g;-S^g-- 


s; 


1 

I 


CM 


o A 

1 5 


t s 

:S 2 


2i 

P 

0 

K t 

|l 


C52 

E =s 

11 & 

c • ^ 

93 - 

I ac 

O V “ 
b P «* 
fl r ^ 


28* 

'Zs a 


w a^ 
v 5 S 


7C: 

*'■3 ^ u 

a s 3<| 

||l|" 

l«^SS 

aiecg. 

u > 


SI 87 






pOnWA* FERTILITY IN JAPAN S, gg 

neither income nor earnings is held constant in these regressions for lack of 
data pertaining to these age groups 

Regressions 1 and 2 were run for mamed women, ages 40-49 and 50-59, 
respectively No coefficient m regression 1 is significant The ambiguous 
result obtained for this cohort may be partly because the majority of 
these women were mamed during World War II times Uncertainty about 
the future and participation by males in war activities may have distorted 
family-size decision making The sizes of the regression coefficients for 
the education variables satisfy the weaker prediction of the cost-of-time 
hypothesis, however For regression 2, the results are less ambiguous 
Judging from the 1-values, both Edg^ and Ed^f^ are significant at the 
1 percent level, and the F-values for the education variables indicate that 
both the husband’s and the wife’s education are statistically significant 
at better than 1 percent Both husband’s and wife’s education obtain 
regression coefficients that are largely consistent with the cost-of-time 
hypothesis 

To determine if the fertility relationships for the two groups of mamed 
women were different, 1 tested three null hyjxitheses using Chow’s (1960) 
test for equahty of two regressions The three null hypotheses were that 
(1) the two relationships are completely the same, (2) the two relation¬ 
ships arc the same except for the intercept, and (3) the two relationships 
arc the same only with respect to the education vanables The first null 
hypothesis was rejected = 10 93), but the latter two were accepted 

Based on these findings, I pooled the observauons for the two groups and 
obtained regressions 3 and 4 Regression 3 assumes that only the intercept 
differs, while regression 4 assumes that everything except the cducauon 
coefficients differs between the two groups The results for pooled re¬ 
gressions 3 and 4 are quite similar Both husband's and wife's education 
arc statistically significant at better than 1 percent, with regression 
coefficients as expected from the cost-of-time hypothesis 

The results of these regressions clearly support the economic framework 
based on the cost-of-timc hypothesis According to the coefficients of the 
pooled regressions, male education vanables higher by one standard 
deviation unit, holding female education constant, arc associated with 
higher fertility by about 0 13 childrm Holding male education constant, 
female education vanables higher by one standard deviation are associ¬ 
ated with about 0 30 fewer children However, the relationship between 
education and fertility is apparently nonlinear The distnbution of 
either parent’s educational attsunment has a greater effect on average 
completed ferdlity when it shifts from no cducauon to some educaUon 
than when it shifts from some to high education ' * While influences of 

Ben-Porath (1973) reported a umilar finding for Israel In parucular, the sharpest 
dechne in feruhty was observed between women with no education and women with some 
education 



JOURNAL OF POLITICAL ECONOMY 


S19O 

education on fertility through tastes and the cost of fertility control are 
undoubtedly present, the dominant influences appear to be through in¬ 
come and prices 

These regressions refer to famihes which made fertility decisions before 
and dunng World War 11. One would expect the decision-making process 
to have been more male-dominated in these than m postwar years Yet 
female education is one of the most important variables in these regressions, 
a hnding which strongly supports the relevance of the economic analysis 
of fertility For this paper, regressions of completed fertility could not be 
run for families making fertility decisions during the postwar years, since 
the 1970 census was not available for all prefectures As an alternative, I 
now turn to regressions of live birth rates in 1960 


Regressions of Live-Birtk Rates 

Multiple regressions of the number of live births m 1960 per thousand 
married women, age 15 or older, are reported in table 9 In addition to 
the percentage employed in agriculture and the percentage living in 
urban areas, these regressions hold constant the age composition of 
women and, in some cases, wage rates and labor-force participation rates 
The age-composition variable is a weighted average of age-specific birth 
rates for Japan, the weights being the percentage of women in each age 
class in each prefecture The percentage distribuuons of all women rather 
than of only marned women were used because the latter may reflect 
endogenous decisions about the timing and duration of marriage Both 
the female wage rate and the female labor-force participation rate are 
expected to affect fertility negatively The male wage rate and the male 
labor-force participation rate would capture income effects and are 
expected, therefore, to obtain positive regression coefficients. 

The results for the birth-rate regressions are largely consistent with the 
predictions of the cost-of-time hypothesis An increase in male education, 
holding constant female education, increases, and an increase in female 
education, holding constant male education, decreases the birth rate 
Female education is always more significant than male education 
Female educauon tends to remain statistically significant with negative 
effects on fertility even after the female wage rate and labor-force 
participation rate, cither separately or together, are introduced in the 
regressions As expected, the female wage rate obtains negative and 
significant regression coefficients The male wage rate obtaina coefficients 
that arc insignificant, though with expected signs Neither the agricultural 
employment nor the urbanization variable has significant coefficients, 
perhaps because urban-rural differences m birth rates had largely 
disappeared by 1960 (sec table 1). 



POSTWAR FERTILITY IN JAPAN S1 9 1 

Adding wage-rate variables to the regressions greatly reduces the effect 
of female education on the birth rate Without the wage-rate variables in 
regressions 1 and 3, a uniform percentage shift upward in the female 
education distnbution reduces the birth rate by about 10, for example, 
— 3 279 — 6 322 = —9 6 for regression 1, but the magnitude drops to 
about five when the wage-rate variables arc introduced m regressions 2 
and 4. This finding is not surprising since the cost-of-timc effects are 
likely to be captured jointly by female education and wage rates The 
female wage rate also reduces the magnitude and significance of the female 
labor-force participation rate, as expected if decisions about fertility and 
labor supply are made simultaneously on the basis of the wage rate ” 
The nonlinearity usually observed in the effects of female education is 
observed only when wage rates are held constant Otherwise, the results 
are ambiguous the deterrent effect of female education appears greater 
when education increases from some to high than from none to some 
schooling, though the regression coefficients for “high” are never 
statistically significant 

These regressions suggest that area differences in the birth rate reflect 
differences in the female variables more than in the male variables For 
example, according to regression 2, increasing all the male vanables by 
one standard deviation unit, holding constant other vanables, increases 
births per thousand mamed women by about three, but similar increases 
in female vanables decrease the birth rate by 10 Calculations for re¬ 
gression 4 show the effect of male vanables to be -t-1 2 and the effect of 
female variables to be — 10 9 


V. Summary and Agenda 

Increased access to aboruons in Japan undoubtedly contnbuted to the 
post-World War II acceleration of that country’s decline in fertihts But 
I conclude that fertility would still have declined at a more rapid rate 
than in the prewar penod, which had been charactenzed bv a mild and 
rather monotonic dcx:hne, if abortion had not been legalized Therefore, 

A rue in the female wage rate, other things being equal, increases the atiracliveness 
of labor-force parucipauon relauve to home production, including child rearing There¬ 
fore, increased female wage rates not only directly reduce fertilit> demand bul also 
increase labor-force participation, which in turn indirectly reduces feruliis still more 
Moreover, the direction of causauon may also run irom higher birth rates in an area to 
lower female labor-force participation there To allow for such simultaneity, regressions 
3 and 4 were also estimated by two-stage least squares, treating female labor-force 
participation as endogenous However, these estimates did not substantially differ from the 
ordinary least-squares estimates The labor-force parucipauon regressions seem to hav e 
been plagued by multicolhncanly Although the regressions were statisUcally sigiuhcaiil. 
judgi^ by the f^-vstluet, with coefficients of dcterminauon above 0 70, they did not 
successfully disentangle effects of the exogenous variables 



JOURNAL OF POLmOAL ECONOMY 


S192 

disaemination of contraceptive information and/or an accelerated decline 
in the demand for fertility must also have contributed to the trend. 

Cross-section analyses of 1960 data reveal that the economic theory of 
fertihty, with its emphasis on the effects of female education and earning 
power on fertility demand, provides a useful framework for analyxing 
fertility differentials in Japan. In particular, regressions of completed 
fertihty and live birth rates in 1960 show that female education and 
earnings affect fertility negatively and arc the most significant vanables 
in the regressions Male education and earnings tend to have positive but 
less significant effects On the basis of only sociological, demographic, or 
taste considerations, one would not necessarily expect to find female 
education more important than male education in explaining fertility 
differentials. Indeed, in Japan, because of the male’s generally donunant 
role in household decisions, one would have expected to find male educa¬ 
tion to be at least as relevant as female education, if not more so, in 
explaining fertility differences 

How useful are these results in interpreting the fertility trends in Japan? 
A tentative answer can be obtained from the regressions in table 9 
Assuming that the parameter estimates remain stable over time and using 
the observed magnitudes of change between 1950 and 1960 for all the 
explanatory vanables except for the agc-composition vanable, my results 
predict the decline of births per thousand mamed women to be only 4 
percent at the mean according to regression 2, and 3 percent according 
to regression 4 The actual decline between 1950 and 1960 was about 42 
percent, of which one would hope to explain about half if Icgalizauon of 
abortion explains another half For the pienod 1960-70, both regressions 
predict a decline, while in fact births per thousand married women 
increased somewhat On the basis of these results, one may be inclined 
to doubt the usefulness of cross-section regressions in interpreting the 
trend 

However, since the regression models and the economic hypotheses 
underlying them pertain to long-run adjustments, one would expect from 
these regressions better predictions of longer-term changes in fertility It 
is possible that the sharp dechne of fertility during the 1950s and the 
subsequent leveling off in the 1960s represent swings around a longer-term 
trend that has persisted since 1920 If so, a more appropriate com¬ 
parison of actual and predicted declines might be made between 1950 
and 1970 The predicted decline for this period was 19 percent according 
to regression 2 and 15 percent according to regression 4 The actual 
decline was about 30 percent. These regressions appear, therefore, to be 
more helpful in providing an interpretation of the longer-term fertility 

i» 'p Sc]iultz (1973) ditcuffo problems in the interpretation of time senes from 
cross-section estimates of parameters Here I ignore the kinds of problems he raised. 

A similar observation is made by Okacalu (1970, p 26) 



POSTWAR FERTIUTY IN JAPAN 


S193 


TABLE 10 

A Comp AMBON of Twsnim « Fertiuty and Economic Variables 1950-69 
(Japan and United State#) 



1950 

1960 

1969 

Rate# op Change 

1950-60 1960-69 1950-69 

(%) (%) (%) 

Crude birth rate 

Japan 

28 1 

172 

17 9* 

-38 8 

4 1 

-36 3 

U S (white) 

23 5t 

23 2X 

16 6 

-13 

-28 5 

-29 4 

Total fertility rate 

Japan 

3 63 

1 99 

2 10$ 

-45 2 

1 05 

-446 

U S (white) 

3 05t 

3 56t 

2 3711 

167 

-33 4 

-22 3 

Real GNP per capita 
Japan 

(1950=100) 

lOO 

247 4 

529 3 

147 4 

1140 

429 3 

US 

100 

1152 

152 7 

152 

32 5 

52 7 

Real wage (1950=100) 

Tapan 100 

170 2 

253 9 

70 2 

49 2 

It3 9 

US 

100 

1362 

177 3 

362 

30 2 

77 3 

Perccnti^ employed 
Japan 

in ^nculture 

48 7 

300 

178 

-38 4 

-40 7 

-63 4 

U S 

12 1 

83 

46 

-31 4 

-446 

-62 0 

Pcrcenuge female m 
Japan 

non-agncultural employment 

29 0 35 1 36 6 

21 0 

43 

26 2 

US 

31 3 

346 

383 

10 5 

10 7 

22 4 


Sources —U S data U S Bureau of the Census ibstratt of the Lniitd hiatts (>arious luu*^ and 

U S Department of Health, Eiducatioo, and Welfare (l96Sa). Japanese data Japan Pnmr Mmiuer & f)frMr 
Statistual Ytarbook of Jopan (various annual issues) and Mainichi Newspaper ^rr^pany M970 
* Average between 1963 and 1970 
t Average between 1949 and 1931 

{ Average between 1957 and I960 
Average between 1965 and 1968 
I For 19^ 


decline in terms of pnce and income variables, but they are not bkcl) to 
facilitate understanding of the shorter-term movements b> whicli the 
decline may actually occur 

In table 10 some of the salient features of the United States and 
Japanese experiences during the postwar years are compared II the 
economic framework for fertility analvsis has general vahdits, it uould 
predict more rapid fertility changes for countries expencncing more rapid 
changes in their econormes Clearly, both the economic vanables examined 
and fertility rates changed more rapidly in Japan than in the United 
States during 1950-69 A similar point is observed if one confines the 
comparison to the penods in which fertility declined 1950-60 for Japan 
and 1960-69 for the United States, 

Much more work is needed before we fully understand the reasons for 
the possible acceleration in the decline of fertility demand in Japan in 
the 1950s and the leveling off in the 1960s A promising area for in¬ 
vestigation is the interaction of the demands for quality and quantity of 
children in Japan. There may have been an accelerated substitution of 
quality for quantity during the postwar years because of income effect' 




JOURNAL OF POUTICAL ECONOMY 


S194 

Also, the prospect of future economic growth may have raised the ex¬ 
pected return to education and nsmg family income increasingly 
facilitated the financing of children’s ^ucation beyond the level reqmred 
by law. It is noteworthy that the enrollment rate in kindergarten (which is 
not compulsory) increased more sharply between 1950 and 1960 (by 95 
percent per 5 years) than between 1960 and 1965 (by 58 percent) Also, 
the enrollment rate in senior high school (not compulsory) rose more 
rapidly between 1950 and 1960 (by 26 percent per 5 years) than between 
1960 and 1965 (by 20 percent) The model may also be extended to in¬ 
clude such interrelated aspects of behavior as mamage and female 
labor-force participation Such a model may be estimated using more 
disaggregated data, say by age, than I use here The rapid decline in 
Japanese fertility from 1950 to 1960 coincided with an increase in the 
labor-force participation of females and a decrease m the proportion of 
married women among women of childbearing age For example, among 
females, ages 20 to 39, living in urban areas, the labor-force participation 
rate increased from 38 percent in 1950 to 43 percent in 1960, and the 
proportion ever married decreased from 75 to 70 percent The most 
dramatic changes occurred for females, ages 20 to 24, among whom the 
proportion ever married dropped from 43 percent in 1950 to 31 percent 
in 1960 but showed no appreciable change m the 1960s The pace of all 
these developments appicars synchronous with the fertility history of 
postwar Japan, and possible relationships among them deserve rigorous 
analysis 

Finally, it would greatly enhance the scope and accuracy of analysis 
to have micro, or household, data for Japan on such related character¬ 
istics as fertility, wage rates and family wealth, age, and work experience, 
parents’ and children’s education, and contraceptive use Ideally, such 
data would be collected longitudinally for a given set of families and made 
available to interested researchers in raw rather than summarized form 



Comment 


Gary R. Saxonhouse 

Ummstty of Michigan 


At a conference of distinguished specialists on human capital theory my 
comparative (absolute^) advantage is in the study of the Japanese 
economy Much the same can be said for Hashimoto’s paper No attempts 
at any significant new departures in human capital theory or in the 
testing of such theory are made in this paper Rather, one finds here an 
extension of a now-familiar approach to Japanese data with now-familiar 
results ‘ This is a study that can stand alongside the Israel and Taiwan 
studies by Ben-Porath (1973) and T Paul Schultz (1973), respectively 
Hashimoto’s paper is another line, and an admirably executed line at that, 
in T Paul Schultz’s table A1 (1973) 

Hashimoto’s ultimate concern is in explaining the acceleration m the 
1950s of the long-term decline in Japanese fertility and the subsequent 
bottoming out of this decline in the 1960s The achievements of this 
particular paper are more modest An attempt is made to challenge what 
Hashimoto finds is a simplistic overemphasis on the diffusion of abortion 
and contraceptive practice after 1948 (i e , a downward shift in the supplv 
of birth-control schedules) As an alternative, Hashimoto stresses outwaid 
shifts of the demand schedule for birth control resulung from the in¬ 
creasing cost of female time in postwar Japan * 

* Hashimoto states that, previous to this paper, there had been but one other economic 
study of fcrtihty in Japan 'ITiit is a bit unfair Only if the literature is restricted to that m 
English and one assumes the economic study offerulity starts with Becker is this statement 
true Unlike the United States, in Japan throughout the twentieth century demographic 
study remained within the province of economists For a sample of Japanese work, sec 
Tachi and Arao (1935) and Tachi (1946) 

’ My reading of the Japanese literature suggests that most Japanese demographers 
would applaud Hashimoto’s stress on changes in both the demand and supph schedules 
Some Japanese demographers do appear to part company with Hashimoto when it 
comes to a discussion of the determinants of the demand-schedule shifts There is a 
Japanese emphasis on the role of wesdth Relaove to the period pnor to World War II, 
Jftpan of the early 1930s is seen as a poor country 



JOURNAL or POLITICAL ECONOMY 


S 196 

I am much m sympathy with Hashimoto’s hypotheses. There is 
considerable evidence to suggest that control of fertility within marriage 
was widespread long bdbre 1948, and I do find stress on the cost of female 
time provocative ^ At the same time, I do not think Hashimoto’s various 
analyses arc altogether persuasive. For example, if instead of using max¬ 
imum crude birth rates and maximum general-fertility rates, maximum 
total-ferulity rates for 1950 and 1960 arc calculated, it appears that the 
practice of abortion might very well have tripled the rate of fertility 
decline.* The decline in the estimated maximum total-fertility rate 
between 1950 and 1960 is 16 8 percent This is only a modest acceleration 
of the 12 6 percent rate of decline exhibited in the actual fertility rates 
between 1930 and 1940 An acceleration of this magnitude might easily 
be explained by the diffusion of modern contraceptive techniques during 
the 1950s This rmld acceleration might also be viewed as a reaction to the 
much higher-than-trend total-fertility rates between 1940 and 1950 
Thus, It would appear that if Hashimoto wishes to stress the role of 
demand factors m the acceleration of Japanese fertility decline, he more 
than likely cannot avoid an inquiry into the extent to which legal 
abortions were substitutes for older forms of birth control 

Regardless of the magnitude of the acceleration remaining to be ex¬ 
plained by some factor other than the legalization of abortion, I am not 
satisfied with Hashimoto’s analysis of the spread of contraception That 
the share of abortions in total births is high and nsing in the rural regions 
of Japan in the presence of a sharply declining pregnancy rate can be 
taken as evidence that the spread of contraception is not sufficient to 
explain fertility changes unexplained by the legalization of abortion only 
if ( 1 ) abortion as a technique of birth control is already completely 
diffused, and ( 2 ) abortion is viewed as a technique of last resort ’ 

I doubt cither of the above conditions held for Japan during the 1950s 
Abortion diffused throughout Japan in much the same manner and only 
slightly in advance of modern contraception At any point during that 
decade, for any age cohort, the better educated a couple and the larger 
the city in which a couple lived, the higher the probability they had 
experienced an abortion Among couples who had never practiced 
contraception, the probability of an abortion expencnce was relatively 
low Significantly, high rates of abortion were found among couples who 
had practiced contraception and for one reason or another discontinued 

* For a duciinion of the history of birth control in Japan, see Honjo (1941) 

*The maximum total fertility rates in 1950 and 1960 are 4 16 and 3.46. I have 
calculated these rates with data tsdien from Japan Ministry of Health and Welfare (1967) 
and Aoki (1967) 

’ The industrial-rural dichotomy presented in Hashimoto's table 4 u based on a 
companion of six urban prefectures with five rural ones. As an alternative, the share of 
induced abortions among pregnancies of Japanese women not hving in oties of 50,000 
or more (the rate typically regarded as the rural rate) can be exaimned. This rate rose 
throughout the 1950s, readiing a peak in 1961 (see Aoki [1967, p. 44]) 



CCOOfENT 


S197 

this practice * I would conclude that abortion was not necessarily a 
birth-control technique of last resort and that the rising share of abortions 
in total pregnancies in rural areas dunng the 1950s nught very well reflect 
a positive reaction to the diffusion of a new technique ^ 

While Hashimoto does not make manifest the need for a new demand 
vanable to explain the movement of postwar Japanese fertility, the final 
section of his paper does stand on its own as a test of competing hypo¬ 
theses in explaining regional differentials The results reported in tables 
8 and 9 tend to support Hashimoto’s de-emphasis of the role of the cost of 
fertility control In both sets of regressions, I have a particular problem 
with the education variables Hashimoto chooses to use as explanatory 
variables The male education variables in the completed-fertility 
regressions are based on the assumption that Japanese women are 
marrying Japanese men of the same age Actually, Japanese women of 
the age cohorts in question were marrying men who were, on the average, 
4 to 5 years older than themselves * The pieculianties of the Japanese 
enforcement of their compulsory-education legislation make it a matter 
of some importance whether a male reached school age before 1900 (as a 
substantial proportion of the husbands of the women in these cohorts did) 
or after 1905 There was considerable variability among prefectures in 
the degree of male compliance with the compulsory education laws before 
1900 After 1905, compliance was almost complete among the prefectures ’ 
The degree of compliance across prefectures is only weakly correlated 
between the two periods Interestingly enough, noncompliance for males 
in the earlier period is rather strongly correlated with noncompliance 
for females in the later period It follows that if one is a believer in the 
cost-of-fertility hypiothesis (and only if one is a believer in the cost-of- 
fertility hypothesis), it is piossiblc to aiguc that the errors in the male 
education vanable are imparting a posiuve bias to the estimate of the 

* The surveys on which the above analysis is based were conducted and periodically 
pubUshed by the Mainichi Newspaper Company (1950), see also Taeuber (1958, p 282) 

’ There is every reason to believe that the spread of contracepuve methods m rural 
Japan was rapid m the late 1930s The aforementioned Mainichi surveys indicate a 
better-than-25-percent increase in contraceptive practice between 1955 and 1959 These 
same years saw family-planning expenditures by Koscisho (Ministry of Health and 
Welfare) at thar real height Also, survey evidence analyzed by Aoki (1967, p 57) 
suggests that increases in contracepuve efRaency were also grealest dunng ihis penod 

' Summary statisUcs on the average age at first marriage, by sexes, arc available in the 
annual reports, DtsenpttM Matmals on Japa%t$t Vital Statistics, published by the Japan 
Prime Minister’s Office (1917/) 

* In 19(K), the percentage of the tiule school-age population attending pnmary school 
or otherwise complying snth the compulsory-educauon laws ranged across prefectures 
from 79 1 to 97.5 percent By 1905 the degree of compliance vaned from 94 7 to 99 5 
percent These itatitttcs arc from annual repor ts published by J span Ministry of Educauon 
(1904/). 

Using the 1960 census data, the simple correlauon coefficient between the pre- 
fectural pattern of males with no education, age cohorts 50-59 and 60-69, is 39 The 
simple correlation coefficient between the prefectural pattern of males with no education 
tn the age cohort 60-69 and females with no educaUon in the age cohort 50-59 is 83 



JOURNAL OF POLITICAL ECONOMY 


S 198 

coefficient of the no-education variable for females and a negative bias 
to the estimate of the coefficient of the no-education variable for males. 
These biases could be quite substantial and might account for Hashimoto’s 
interesting results. 

Unlike the completed-fertility regressions, Hashimoto’s regressions 
reported in table 9 deal entirely with postwar reproductive activities and 
specifically with 1960 live births. The education variables in these re¬ 
gressions refer to the entire (over age 15) prcfectural population Un¬ 
fortunately, virtually the entire interprefectural variation in the proportion 
in the no-cducation variables relates to the experience of age cohorts that 
in 1960 were well beyond their reproductive years! I do not doubt that, 
inspired by Leibowitz’s paper m this volume, work can be done on how 
a mother’s education affects a daughter’s fertility, but this is not the 
hypothesis in question ‘ ‘ 

Also with respect to these same regressions, I am a bit puzzled by the 
use of the logarithm of earnings rather than arithmetic earnings I suspect 
one gets a higher I-statistic on female wages when this form is used, but 
when using what should be thought of as grouped data, I am unsure of the 
theoretical justification 

In concluding my remarks, I would like to offer a suggestion for future 
research Notwithstanding Hashimoto’s very useful analysis, I do not 
find the postwar period to be the most interesting stretch of Japanese 
demographic history While there are many instances of fertility 
decline in the context of rapid economic growth, the expenence of Japan 
between 1875 and 1920 appears to be unique There was rapid economic 
growth starting from a relatively high income base, a declining agn- 
cultural labor force, a revolution in female education coexisting with a 
high and, as recent research indicates, very likely a rising level of 
fertility Further, in view of the Japanese experience after 1920, 1 


‘' Nationwide, 82 percent of men with no education and 96 percent of women with 
no education came from the age groups 50 and older and 40 and older, respectively The 
umplc correlauon by prefecture for never-educated men above age 50 and younger than 
age 50 IS 25 The simple correlation by prefecture for never-educated women above and 
below age 40 is 21 

’ ^ The official statistics collected for Japan suggest rising fertility throughout most of 
this penod These statistics have been traditionally treated with considerable skepticism 
on account of underreporting in the earliest periods Taeuber, in analyzing these statistics, 
suggests that fertility probably rose substantially during the first part of this period and 
subsequently declined slowly (see Tacuber [1958, pp 52-54]). Estimates which allow 
for the traditional cnticisms of the official statistics have been prepared by Okazaki 
(1965). Okazaki estimates that the Japanese population in 1875 was almost 2 million 
above the official statistics More recently, Aiuuaka Keiko has provided another series of 
alternative estimates Somewhat surprisingly, in view of the earlier cnUcisms, she finds 
that the official statistics overestimate population by 600,000 in 1875 and understate the 
race of population growth at least chroi^ 1910. For Akasaka’s estimates, see Hicotsubasln 
University Institute of Economic Research (1968). For an informative discussion of the 
alternative estimates, see Umemura (1969) 



CXJMMENT 


Si99 

would subnut that this episode constitutes a fascinating and important 
puzzle for those who would stress the role of economic factors in general, 
and the cost of tune, m particular, in the explanation of fertility history 



Household and Economy: Toward a New 
Theory of Population and 
Economic Growth 


Marc Nertove 

Umuerstty of Ckuago 


It IS somewhat unusual to begin the treatment of a subject 
with a warning against attaching too much importance to 
It, but in the case of economics, such an injunction is quite 
as much needed as explanation and emphasis of the import¬ 
ance It really has It is charactenstic of the age in which we 
live to think too much in terms of economics, to see things 
too predominantly in their economic aspect There is 
no more important prerequisite to clear thinking in regard 
to economics itself than is recognition of its limited place 
among human interests at large. [Frank H Knioht, 1933 
(1965)] 


1. Introduction: Malthus Revised 

Malthus, in essays published in 1798 and 1830, and the clcissical econo¬ 
mists combined a very simple model of family decision making— 
procreation without bound except possibly by “a foresight of the difficult¬ 
ies attending the rearing of a family and the actual distresses of 
some of the lower classes, by which they are disabled from giving the 
proper food and attention to their children” (Malthus 1970, p 89) - 
with an equally simple model of the operation of the economy. According 
to the latter, a high level of capital accumulation induced by a high level 
of profits—representing the difference between output and the rent of 
land (natural resources) and wages—permitted a continual increase m 
output and population, albeit at the cost of resort to land oflncrcasingly 

I am indebted to Gary S Becker, Glen Cam, Richard B Freeman, Margaret G Retd, 
and T. W Schultz for discuMioni concerning the nibject of thu paper and for comments 
on a related paper, Virginia Thumer contnbuted valuable editonal advice. Thu reaearch 
was supported by a grant from the Rockefeller Foundation to the University of Chicago 
for the study of the economics of populauon and family decision making 

S300 




household and economy 


Szoi 

poorer quality, it did not, as the result of the model of family decision¬ 
making, lead to a rising standard of living for most people Thus, the 
classical economists achieved a very simple model of economic growth 
and development (Baumol 1970, pp 13-21) Modern growth theorists in 
the tradition of Solow (1956) and Swan (1956) have developed theories 
of economic growth btised on far more elaborate theories of the economy, 
but few theories of population growth and family decision making have 
gone much beyond the Malthusian model (Pitchford 1973, pp 1-10) 
Although natural-resource constraints may be readily incorporated 
through the device of diminishing returns to scale in the vanable factors 
(Swan 1956, pp 340-42), it is a constant proportional rate of population 
growth, pierhapis aided and abetted by exogenous technological progress, 
that essentially drives the mechanism Discussions of the optimal rates of 
population growth or level of population do often attempt to integrate 
an endogenously determined population in the model But none, to my 
knowledge, have examined the feedback from changes in the economy 
and changes in the relative prices and costs which farmlies face when they 
decide how many children they will have and what they will invest in 
those children’s health, nutrition, or education, although Pitchford (1973) 
does discuss the costs of and returns to population control at the macro 
level 

In recent years, the recogmtion, crucial to the understanding of long¬ 
term growth, that much investment which occurs in the economy is 
made in human beings rather than in physical capital and that fertility 
Itself IS shaped in important ways by economic considerations has led to 
renewed interest in the economics of the household decisions In that tvpic 
of unit, not only decisions about fertility, but also those related to in¬ 
vestments in human capital, consumption and savings, migration, labor- 
force participation, and, in a sense, marriage itself, are made 

With this in imnd, in Section 2 I briefly and cntically desenbe the 
fundamental elements of the l 2 irgely static theory of household production 
and choice This theory was dcvelopied in its modem form bv Gary' Becker 
(1965) and others, but most of its essenUals onginated in the much earlier 
work of Margaret Reid (1934), and it owes a good deal to Wesley 
Mitchell’s seminal observations in his essay (1912) on “The Backward 
Art of Spiending Money ” Section 3 concludes this papier with some 
speculations on how the “new home economics” may be integrated in a 
theory of economic growth and development through an understanding 
of the way in which investment in human capital increases the value of 
human time and thus changes over umc the resource constraints and the 
relaUve costs and^rices which "households” face in their decisions on 
the number and quality of children they attempt to produce The 
question as to whether this constitutes a true economic explanauon of the 
so-called demographic transition, and thus a revision of the Malthusian 
tradition, is basically an empirical one and is left opien 



$202 JOURNAL OF POLmOAL ECONOMY 

2. Tlie “New Home Economics”: Summary and Critique 

In Its most unadorned form, economics is the theory of allocation of 
limited resources among competing ends in order to maximize satisfactions 
(or utility), subject to the constraints imposed by limitations in the 
availability of the resources required to achieve those ends. Various 
elaborations and accretions are necessary to accommodate this central 
theoretical core to the dynamics of choices made sequentially over time 
and in the presence of uncertamty regarding future constraints and future 
preferences At several points I shall have more to say about our present 
failure to reach successful accommodations in our underlying theory in 
these directions, as well as about our even more important Imutations as 
economists to cope with the whole complex of issues raised by inter- 
gcnerational transfers, within society as an ongoing concern, and par¬ 
ticularly within the family 

The first element in the new home economics is the utility function to 
be maximized Its form and its arguments (i c , what variables determine 
Its level) are obviously crucial in determining the choices which result 
from Its maximization But whose uuhty function is it that is maximized 
m connection with choices pertinent to marriage, children, consumption 
of commodities, work and leisure, and investment in all forms of capitaP 
Considenng the household as already formed, much of the theoretical 
underpinning rests on what I have called elsewhere the “Chicago model” 
(Nerlove 1972i) ‘ 

The “Chicago utility function,” if I may call it that, has several key 
characterisucs First, it docs not involve nonmarket goods or physical com¬ 
modities and purchasable services as we usually think of them in eco¬ 
nomics, but Its arguments are abstract goods composed of a number of 
“attributes” which must themselves be produced within the household 
(Becker 1963, Lancaster 1966, Muth 1966) The importance of this 
characteristic of the utility function is that it leads directly to the kev 
questions of household technology and the composition of different types 
of market goods and services and physical commodities, in terms ol 
attributes contributing to satisfactions This, indeed, is the point made 
forcefully by Muth (1966) and recently emphasized by Michael and 
Becker (1972) Eiarlier (1947), Leontief pointed out that the theory of 
consumer behavior as then developed, although of great generality, 
lacked content to the extent that it gave no clue to the types of relations 
to be expected among different categones of goods 

i 

The assumption of the existence of genera] categories of needs, 

different from demands for particulaJ individual commodities, 

' At thu conferena, Jacob Mincer objected to this and said that the model should 
really be called the “Monungiide Heights” model, however, force of habit leads me w 
peniit tn this tenmooiogy. 



HOUSEHOLD AND ECONOMY 


S 203 

but still specific enough to be clearly distinguishable from each 
other, is basic to the man-in-the-street idea of consumers’ 
demand One spieaks of the desire for food as existing behind 
and separately from the particular demand for bread, apples, 
or Lobster k la Newburg This need for food is at the same time 
spoken of as something clearly distinguishable from the similarly 
general needs for clothing or, say, for shelter, each of the latter 
also thought of as existing separately although manifested 
through the particular demands for one-family houses, apart¬ 
ment flats, or woolen suits and raincoats [Leontief 1947, p 371] 

Strotz (1957), in his introduction of the notion of a utility tree and in 
his later (1959) discussion with Gorman, attempted to give more empir¬ 
ical content to the theory of consumer demand in precisely this way, 
although he interpreted his results in terms of a “budgeting” process An 
impiortant aspect of the household production model, mcluding time and 
market-purchasable goods, as introduced by Becker (1965), Lancaster 
(1966), and Muth (1966), is precisely, as Muth points out, that it does 
yield “a utility function which is weakly separable when viewed as a 
function of commodities purchased on the market” (Muth 1966, p 700) 
While It is true, in a sense, that many conclusions of the “new home 
economics” could be derived directly from Strotzian utility trees and other 
specializing assumptions, the home-production aspect of the Chicago 
function lends an intuitive insight and empincal content which are 
lacking in the more abstract formulation It suggests a more direct look 
at the technology of processes within the household and particularly at 
how such processes use household time and nonpurchased inputs in 
addition to market-purchasable commodities Indeed, ihe supposed 
differences in the time intensity of the production of household goods give 
much of the content to recent applications of the “new home economics” 
to the problems of fertility and human capital formation ^ I return to this 
point in Section 3 where I speculate on how the increasing value of human 
time works through a Chicago utihty function/houschold-producUon 
model to alter the behavior of generations through time 

A second key characteristic of the Chicago utility function is that ii is 
just that, one utility function—the welfare of the children and other 
members of the family is assumed to enter the utility function of a single 
decision maker (not always the husband and father'), thus obviating the 
assumption of a “fanuly utility function” w'lth all the concomitant 

* Unfortunately, to date relatively little attenuon has been paid to the implications of 
die household producUon model for the more general composition of consumpuon Time 
*nd market-purchasable-commodity intensities do differ grcall> for different types of 
consumption (e g , drinking beer or going to a concert), and the new home economics 
has implications going beyond fertility and human capital formauon (sec, however, 

Michael [1972]). 



JOURNAL OF POUTICAL BOONOUV 


S904 

problems of social utility functions in general. It is perhaps not entirely 
accurate to identify this position too closely with Chicago; it should 
perhaps be called the “Samuelsoman finesse ” Samuelson (1956) writes. 

Where the family is concerned the phenomenon of altruism 
inevitably raises its head; if we can speak at all of the indifference 
curves of any one member, we must admit that his tastes and 
marginal rates of contnbution are contaminated by the goods 
that other members consume These.. external consumption 
effects are the essence of family life , Such problems of home 
economics are, abstractly conceived, exactly of the same logical 
character as the general problem of government and social 
welfare [P 9] 

if within the family there can be assumed to take place an 
optimal reallocation of income so as to keep each member’s 
dollar expenditure of equal ethical worth, then there can be 
derived for the whole family a set of well-behaved indifference 
contours relating the totals of what it consumes the family 
can be said ta act as if it maxirrazes such a group preference 
function. [P. 21] 

The problem with the Samuelsonian finesse, however, is that it assumes 
a fixed family membership, and a great deal of what the Chicago utility 
function is designed to explain is how that family composition gets 
determined This requires much more than Samuelson allows for in his 
formulation When, for example, are children members of the family, 
and thus codeterminers of the utility function, and when are they just 
arguments in the utihty function determined for the family not including 
them? The full internalization argued by T W. Schultz (19724) seems a 
necessary addition to the argument Yet, for this to be true, what might he 
called the "John Donne effect” must be extremely powerful Casual 
observation suggests that each individual’s concern for othen diminishes 
with distance in both time and space Yet it may be true under certain 
ratnetive assumptions, as pointed out to me by Assaf Razin, that what 
might be called pairwise inteigenerational internalization (by which 1 
mean full internalization of the utilities of the next succeeding generation 
by the immediately preceding generation) would lead to essentially thi 
same type of problem as that encountered in the discussions of optimal 
growth with an infinite horizon. 

Morishima (1970, pp. 213-25) presents an extended discussion of some 
of the more technical inues involved in formulating dynamic utility 
functions and the conditions under which such functions can be reduced 
to the sum of discounted utilities of each future generation or at each 
future poiqt in time, irrespective of the generation involved. In general, 

• — -'Miditions are highly restrictive and closely related to the conditions 



HOUSEHOU) AND ECONOMY S205 

of Strotz (1959) and Gorman (1959) for strong separability of the utility 
function. But as Koopmans (1967, p 96) points out, the problem is really 
in some sense an ethical one 

What IS at issue is clearly an mtertemporal distnbuuon problem 
that of balancing the consumption levels of successive gener¬ 
ations, and of successive stages in the life-cycle of a given cohort 
of contempioranes The most pertinent decisions—individual, 
corporate, or governmental—are those that determine invest¬ 
ment in physical capiul, in human capital, and in research 
and development Investments in physical capital, if well made, 
augment future consumption through an increase in future 
capital-labor ratios Investment in human capital raises the 
quality of labor and, one hopes, of life Successful research and 
development augment future capital and labor inputs through 
the development of better techniques of production 

Even regarded as a strictly behavioral model, pairwise internalization, 
which seems central to the Chicago utility function, has most important 
implications for the intergenerational transfers of matenal wealth and 
human capital which Knight (1921, pp 374-75) has so eloquently des¬ 
cribed as central to the continuity of the social order * Some of these 
implications bear on the issues discussed in the next section Here, how¬ 
ever, It seems essential to pioint out the profound problem in, on the one 
hand, internalizing all the faimly members’ satisfacuons in one utility 
function and, at the same time, using this same utility function to 
determine the number and “quality” of the family members themselves 
Elssentially the problem results from the condensation of a sequential, 
dynamic set of decisions into a theory of choice based on the maximization 
of a single, static, timeless utihty function In its most extreme form, the 
issue of the conceptual adequacy of the approach arises in connection 
with the application of the new home economics to household formation— 
a formation which in an earlier, less aberrant, and nonconformist era 
imght have been described as “marriage”* Yet mamage, in some sense, 
remains very much associated with procreation, and the act of mamage, 
or at least of household formation, is the normal first step in the central 
process of choice with which the new home ccononrucs deals Where then 
IS the utility function'* Can the entire process really be separated into two 
distmet parts, what the econometrician would call a recursive system** 
If a fundamental purpose of mamage is the procreation of a couple’s 
own legitimate children, given a society’s definiuon of both mamage and 

* Indeed, there i* good reason to suppose that the reason for many institutions of 
society u preciidy to ensure that the interests of future generations will be adequately 
guarded by the present. A meaningful theory of intergenerational welfisre compansoiu 
haa not yet been devd^ied, but such a theory is surely central to our undentanding of 
househ^ decaiion mriring and its cootequences 



S2o6 


JOURNAL OF POLITICAL ECONOMY 


legitimacy, clearly the process cannot be regarded as recursive * Again, 
the staUc character of the analysis, while not necessarily hmiting its 
usefulness in an empirical sense, introduces a conceptual difficulty of a 
high order 

The second clement in the new home economics is the technology of 
household production described by a production function or functions 
and a list of the resources utilized in the processes involved. Typically, 
following Becker (1965), the inputs arc time, perhaps distinguishable by 
household member (e g, husband and wife) and market-purchasable 
commodities These inputs are used to produce within the household 
the goods and services that in turn lead to satisfaction In the simplest 
form m the economic theory of fertihty, two time inputs (the husband’s 
and wife’s) and one general market-purchasable commodity are assumed 
in the household technology to produce three household goods, child 
numbers, child quality, and a general commodity called “other satis¬ 
factions’’ (Willis 1973, DeTray 1973) In all analyses to date an important 
further simplification of this basic technological structure has been made 
each good entering the utility function is assumed to be produced by a 
separate independent production process Jointness in production anses 
not because of common overhead factors within the household, but 
because the factor inputs available to the household are subject to overall 
constraints * Willis (1973) and DeTray (1973), for example, both assume 
that child quality per child and child numbers are produced by inde¬ 
pendent production processes Each factor separately enters the household 
utihty function in the Wilhs formulation The one is simply multiplied 
with the other to arrive at the final good, child services, which is assumed 
to enter the household utility function in DcTray’s formulation Both 
child numbers and child quality are generally assumed to be mother’s 
time-intensive 

This formulation also neglects the sequential and essentially dynamic 
character of investments in child quality Casual observation suggests 


* Still, it may be helpful analybcally to tackle the problem in a itepwue fashion (see 
Becker 1973) In part of his work, Becker does consider the division of the gains from 
mamage, the most important gams being the iJuldren, and this is, of course, a matter 
closely related to the uuhty function maximized withm the marriage 

’ A partial exception is the work of Michael Grossman (1971; 1972, pp 74-79) In 
his analysis Grossman does assume separate production processes but stipulates that the 
entire amounts of certain factors available to the household must enter each production 
function as one of its arguments That is to say, in the standard analysis, if one could 
separately identify the amount of one factor—say, vnfe’s tune—used m a particular 
suitivity, one could measure the effect of a umt mcrease in that mputxin the output of 
that activity independently of the overall resource constraint on the wife's tune to the 
household, whereas in Grossman's formulation the overhead iactor, say, the family house, 
cannot m prinaple be allocated among the acUvities but must enter fully into each one 
Grossman’s woric, however, itiU does lack generality in the respect liiscussed above, 
since complementarity among final goods cannot be encompass^ in hu formulation 
except through the nitber Ptolemaic device of introducing some of the final outputs si 
inputs in the production proceues of some of the other outputs. 



HOUSEHOLD AND ECONOMY S207 

that in many societies and families, the eldest son is often the beneficiary 
of the bulk of the investment in the human capital that takes place in the 
second generation Indeed, Sloan’s recent work (1971, pp 31-32) suggests 
that in poor countries the sex, as well as birth order of a child, may be an 
important determinant of the infant’s survival probabilities, presumably 
because of the differential investments in health and nutrition that take 
place within the family rather than for purely genetic reasons 

The restnctiveness of the rather special assumption of separability of 
productive activities has not been apparent m the basically two-good 
(child services/other satisfactions) static models usually considered, for 
what is ruled out is complementarity among different outputs in a multi¬ 
product context As is well known, under conditions of variable propor¬ 
tions, complementarity (in the sense that the output of one good could be 
increased without decreasing the output of the other good and without 
using additional resources) cannot occur if the production unit, in this 
case the household, is at an optimum and using resources fully In a 
three-or-more-product case, however, it is fiossible for several of the 
outputs to be complementary with each other, although subsUtutable 
jointly against the rest (Hicks 1939, p 92) ® In the latter case, holding 
available resources constant, increasing the level of one of the outputs 
optimally might well involve also increasing another, although clearly a 
third output level would have to be decreased One ran easily see how 
restrictive the elimination of complementarity could be in a context 
involving several dimensions of child quality, for example, health and 
physical development and intellectual achievement In a dynamic con¬ 
text such potential complementanties are of even greater significance 
since there is, for example, some evidence that early underinvestment in 
nutritional capital may substantially affect the productivity of later 
investments in intellectual human capital (Berg 1973, pp 9-12) ’ 

The third element in the new home economics is a set of assumptions 
about the way in which household resources, pnncipally time, (an be 
transformed into market-purchasable commodities to be used in the 
household production process Stnctly speaking, I suppose one could 
consider this set of assumptions as part of the general technology of house¬ 
hold production and subsumed under the second element of the theory 
It IS, however, better to treat the matter separately, since most of what is 
involved concerns the terms upon which household members can enter 
the labor market, the wages they can earn, and, somewhat secondarily, 
the prices at which market commodities can be purchased It is here that 
the lack of dynamic character of the new home economics cuts most 

* In Hick’s argument, substitute the word “outputs” for “factors” and vice versa, and 
hold the factor of production (his “output”) fixed 

It would seem more straightforward and perhaps more desirable in future work to 
deal with simple forms of multiproduct production fUncuons For an empincaJ example, 
see Eads, Nerlove, and Raduchel (1969) 



Saofi JOURNAL OF POLITICAL ECONOMY 

deeply into its potential implications for the central problems of fertility 
and female labor-force participation. The timing and spacing of children, 
the opportunities for part-time work and accumulation of lifetime labor 
market expenence, and choices as to the amount of education to be 
invested in early in the life cycle all revolve heavily on the terms under 
which women can participate in the labor market and thus share in the 
transformation of the household’s time resources into market commodities 
The human capital literature (Ben-Porath 1967; Mincer 1970, 1973) is, 
of course, replete with dynamic analyses of investment in human capital 
over time and the hfe-cycle effects of these investments on earnings But 
little of this work has entered the more general framework of the “new 
home economics,” particularly as this theory bears on decisions concerning 
the numbers of children and their timing and spacing within a marriage 
and the relation of these decisions to the accumulation of other forms of 
assets The work of Heckman (1971), Ghez and Becker (1972), and un¬ 
published work of Frank Stafford does, however, represent a notable 
beg^inning of the extension of this part of theory into more dynamically 
relevant realms ® Yet we need to understand far more than we presently 
do about why the labor market functions so differently for men than for 
women, the role of institutional constraints, discrimination, and the 
relation of these to women’s choices of occupation and timing of labor-force 
participation Once again, the simultaneities of the system severely 
limit our ability to break out a single segment for proper analysis 

The fourth and final clement in the new home economics is the resource 
constraints facing the household in its produebon and optimization 
decisions Traditionally these constraints are divided into time (husband’s 
and wife’s, although often the husband is assumed to devote full time to 
the market) and “other” nonwage income While it is universally recog¬ 
nized that some elements of household production and consumption — 
sleep and food, for example—arc m fact inputs into the production-of- 
time resources, little attention has thus far been paid to the quality of the 
time resources and of other family resources—both genetic and material 
passed from one generation to the next Arleen Lcibowitz’s study of 
Terman’s 1921 sample of California school children, reported in (hi:, 

' Much recent work, however, is dynamic only m the relauvely tnvuU sense of in¬ 
volving maximization over a number of penods of ume without uncertainty concerninj; 
the values of future values of exogenous conitramts to future decisions This u the sense 
in which a dynamic programming problem can be turned into an ordinary programming 
problem of much larger dimensions, apart from the computactonal difficulties thus 
introduced, the chief delect of this approach is that it fails to lay barerthe sequential 
nature of the deciston-malung process m the way, for example, in which the recursivt 
solution to the general dynamic programnung problem of Bellman does In the absence 
of uncertainty, however, nothing essential u lost by the itrsughtforwatd multiperiod 
extenuon of the basically static framework When values of future constrainu are un 
certam, then it does become essential to understand and incorporate the sequential 
character of the deoiion process (see Nerlove I972«). So hr this has not been done in the 
hterature of the new hoiiM cconomia. 



HOUSEHOLD AND ECONOMY S209 

volume, bears importantly on the manner in which much human capital 
u passed from one generation to another, especially to a child of preschool 
age. The sample is very unrepresentative, but it is instructive, for the 
mvestments in her child of a mother’s time and the quality of those 
mvestments, as measured by her education, arc found to affect appreciably 
later measures of the child’s abihty and future carmng capacity 

The resource constraints facing the household, once it is formed, are, 
of course, a product of the household formation itself and thus connected 
with my earlier remarks on the inseparability of this complex of issues, 
from those of family choice and decision making. But, more important 
It IS in this area that the complex issue of intergenerational transfers 
figures most prominently We live, after all, 

in a world where individuals are bom naked, destitute, helpless, 
Ignorant, and untrained, and must spiend a third of their hves 
m acquinng the prerequisites of a free contractual existence 
The fundamental fact about society as a going concern is that 
it is made up of individuals who are bom and die and give place 
to others, and the fundamental fact about modem civilization 
IS that It IS dependent upon the utilization of three great accu¬ 
mulating funds of inheritance from the past, material goods and 
appliances, knowledge and skill, and morale Besides the torch 
of life Itself, the material wealth of the world, a technological 
system of vast and increasing intricacy and the habituations 
which fit men for social life must in some manner be earned 
forward to new individuals bom devoid of all these things as 
older individuals pass out The existing order, with the insti¬ 
tutions of the pnvate family and pnvate propierty (in self as well 
as goods), inheritance and bequest and parental responsibility, 
affords one way for secunng more or less tolerable results m 
grappling with this problem [Knight 1921, pp 374-75] 

So the apparently simple theoretical construct of a time budget plus 
other income constraint to the household conceals beneath its serene 
and mathematically differentiable extenor the central problem of the 
continuity of society itself * 

* Cloaely connected to the matter of what is inbentcd firom the past and transferred 
from generation to generation u the problem of how mdividual tastes and preferences 
are formed and how they may change over tune I have not menuoned this issue lest 1 
be exconunumcated from the economics profession' It is virtually part of the dcBmtion 
of what an economist is that he takes tastes as given, and I sometimes suspect that many 
of us require all tastes to be identical wd assume that all differences among mdividuals 
arise from differences m the resource constramts those mdividuals hice Indeed, one of 
the consequences of the coiuectures and ipeculahoni presented in Section 3 is that the 
tastes and preferences of at least different generaboos, if not of individuals within each 
generabon, could remain constant while the number of children per faimly declined 
over tune due to changes in the value of human bme mducod endogenously 



Szio 


JOURNAL OF POLITICAL ECONOMY 


By themselves the four mam elements of the theoretical structure of the 
new home economics—(1) a utihty function with arguments which are 
not physical commodities but home-produced bundles of attributes, 
(2) a household production technology, (3) an external labor-market 
environment providing the means for transforming household resources 
into market commodities; and (4) a set of household resource constraints— 
are incapable of yielding a senes of well-defined implications about the 
mam problems of household behavior with which we are concerned It is 
only a framework within which to think about these problems Many 
special additional assumptions, some of which have been mentioned, 
must be added to the framework to amve at empincally refutable 
propositions Moreover, the nature of the required additional specific¬ 
ations is intimately related to the peculiarities of the particular bodies of 
data to which the new home economics has been applied. These data 
range all the way from aggregate time-series data covenng long periods 
of time, to cross-sectional census data for both large and small geographic 
regions at a point in time and over time, to household and family data 
based on individual interviews, with and without collection of retrospec¬ 
tive information 

If to the key simplification involved m the assumption of separable 
independent productive processes within the household, one adds the 
assumptions that young children are highly intensive of the mother’s 
time in comparison with other activities within the home and older 
children arc less intensive, and that for institutional or biological reasons 
the comparative advantage of the male partner in the acquisition of 
market-purchasable commodities significantly exceeds that of the female 
partner, a number of interesting implications of the theory emerge which 
are tolerably well supported by the empincal evidence so far analyzed 
Setting aside for the moment the inadequacy of the observed market wage 
to measure the cost of a nonworking woman’s time, the immediate 
implication of the theory is that a nsc m cost of mother’s time for the 
family will cause a substitution away from time-intensive goods such as 
children and toward those requinng more inputs of market-purchasable 
commodities Indeed, if we further assume momentanly that males are 
completely specialized in market activities, changes in their wages 
represent pure income effects for the family, and we do then observe for 
families with working mothers positive association of family size with 
income and negative association with female wage rates To the extent 
that education serves as a proxy for the relative costs of time which may 
measure some of these costs less imjierfcctly, or at least in a fashion different 
from market wages, similar differences arc observed between the effects 
of male and female educational attainments (Mincer 1963, Nerlove and 
Schultz 1970; Wilhs 1973, DeTray 1973; Ben-Porath 1973) 

The interpretation of many of the results is somewhat complicated 



HOUSEHOLD AND ECONOMY 


Sail 


by the effects of mamage on the association of the educational attain' 
ments of husbands and wives, and by the fact that wives and mothers for 
the most part specialize entirely in home activities dunng only varying 
fractions of their lifetimes Moreover, the effects of female education seem 
unexpectedly and puzzhngly nonhnear (Ben-Porath 1973) To illustrate 
the nature of the difficulty involved, consider the following simple 
analysis of the effects of assortative mating We know that men of higher- 
than-average levels of educational attainment tend to marry women of 
higher-than-average levels of educational attainment Suppose, however, 
that tastes for children differ among the population, women with a high 
taste for children and a low taste for market-related activities will tend in 
general to seek and to receive less formal schooling than women with the 
opposite preference Presumably a man’s taste for children m a society 
where nearly all men specialize in market-related activities will not be 
reflected very greatly in his formal schooling Some effect of the amount 
of his formal schooling may occur through early mamage, but he will 
certainly seek a mate with like preferences Thus, men with a given level 
of schooling with a high taste for children will tend to marry women with 
less schooling than the average associated with the level these men have 
achieved If, as is common, a husband’s educational attainment is 
associated pnmanly with permanent income effects within the household, 
and his wife’s is associated pnmanly with the opportunity cost of time, 
one can see that the negative effect of the opportumty cost of umc as 
measured this way on fertility will tend to be exaggerated, holding male 
educational attainment constant, since the difference between the two 
across couples partly reflects differences in tastes which are unobservable 
and not included in the statistical analysis (see also T W Schultz [19736, 
p S8] for a related comment) 

Turning the analysis around, we can ask what implications the new 
home economics has for female labor-force participation The new home 
economics predicts what is perhaps the obvious the composiUon of a 
woman’s family is strongly sissociated with her labor-force participation 
Typically, the number of a fanuly’s children under the age of 18 and the 
age of the youngest child are both strong predictors of a woman’s labor- 
force participation These facts about family composiuon have been 
interpreted by Cain (1966), for example, as measures of the opportunity 
value of a mother’s time in the home Gronau (1973) attempted a detailed 
and sophisticated analysis of just this proposition, using a subsumple of 
the 1/1000 sample from 1960 Umted States Census, moreover, he 
examined the interaction of educadonal attainment with family com¬ 
position Others (Smith 1972, Leibowitz 1972) have documented that 
highly educated mamed women participate to a greater extent in the 
labor force and work more hours when they do work than mamed women 
■with. Vesis s,c,YvOoVLn% "VikMrvtd. ■wo'cevea ax a. aXso tend, vo wu.hdt'vw 



SSI9 


JOURNAL OP POirnOAL BOONOUV 


from the labor force when they have children; this is an implication of 
the new home economics, on the assumption that children are more 
female timc-intenave than other commodities produced within the 
home, but the rates at which women with different educational attain¬ 
ments withdraw is not the same Durmg the child-rcanng years, more 
highly educated women reallocate more hours to household production 
than do women with less education. Ben-Porath’s (1973) finding of a 
U-shaped relation between education and labor-force participation for 
Israeh women with young children strongly suggests differences in the 
effects of educaUon at different levels on the relative efficiencies of home 
and market production and also interactions between female education 
and child quahty. But all of this has been insufficiently explored within 
the presently existing framework of the new home economics As Sweet 
(1968), Nerlove and Schultz (1970), and Hall (1973) have emphasized, 
educational investments, labor-force participation, and fertility must be 
viewed, at least partly, as simultaneously determined choices This is 
surely one of the most important impbeauons of the new home economics 
and only partly negated by the latter’s currently static character 

Finally, if we regard, as I think we must, the grand problem of the new 
home economics as the explanauon of the demographic transition, that is, 
the "economic and social processes and family behavior that accounts for 
the marked decline from very high birth and death rates to modem very 
low birth and death rates” (T. W Schultz 1973A, p S4), the new home 
economics does have some insights, albeit limited, to offer (O’Hara 1972) 
Clearly, a high probabihty of child mortality affects the costs of achieving 
a given family size, that is, the number of children surviving to a given 
age If it is assumed that on the whole parents achieve a greater (dis¬ 
counted) sum of satisfactions the longer a child survives and if dechnes in 
mortality result in greater relative mcreases in the conditional probabil¬ 
ities of survival from earlier ages to successively older ages, declines in 
mortahty should tend, according to the new home economics, to generate 
a greater demand for children This need not lead to an increase m 
births, however, since such dechnes in mortahty lower the cost of child 
quahty relative to the cost of numbers of children Of course, the net effect 
must depend on the technology of the production of child numbers and 
child quahty as well as on the relative importance of these in the utility 
function These factors, of course, may vary substantially from time to 
time, culture to culture, and place to place The elucidation of such effects, 
however, must surely constitute one of the central challenges to the 
empirical application of the new home econonucs To explore such effects 
fully, however, requires that the household decision-making process be 
accommodated in a model of economic growth and development, a 
subject to which we now turn 



HOUSEHOLD AND ECONOMY 


S213 

3. Houaehold Decision Making and Economic Growth: 

Speculations and Conjectures 

“Would you tell me, please, which way I ought to go from 
here?” 

“That depends a good deal on where you want to get to,” 
said the Cat [Alue's AdaetUures tn IVonderlanJ] 

In his classic paper on “Diminishmg Returns from Investment,” 
Kmght (1944) pomted out that “if new investment can be freely directed 
to all uses, 1 e., embodied m all types of productive agents indifferently, it 
will not be subject to diminishing returns” (p 33) Moreover, he stressed, 
as did Marshall before him, the concept of capital m human beings, and 
that “in the production of laborers the matter of ‘quahty’ is far more 
important than that of quantity in the crude sense of numbers” (p 35) 
Although investment “freely directed” might not be subject to diminishing 
returns, certamly, under static circumstances, continued investment m 
any one particular direction ought eventually to result in a declining rate 
of return Yet, as T W Schultz (19736) has emphasized, the rate of 
return appears to have diminished httlc, if at all, in response to a high 
level and even accelerated pace of investment in human capital, and, 
indeed, it may have actually nsen (T W Schultz 1971, p 173) 

We do not, of course, have any really accurate measure of the extent of 
mvestment in human capital as compared with mvestment in nonhuman 
capital and in the stock of knowledge through mvestment in research 
and development Yet, there are a number of clues which suggest that the 
capital stock invested in human beings, even on a per capita basis, has 
been a steadily growing portion of the total capital stock T W Schultz 
(1961, p. 73) suggests a rise of the value of the stock of educational human 
capital embodied in the stock of labor of persons age 14 and older from 
18 percent in 1900 of the total educational and physical capital to 30 
percent in 1957 These esbmates do not include on-the-job trainmg or 
investments in better health and nutntion In terms of gross capital 
formation, Kuznets (1966, p 243) calculates, on the basis of Schultz’s 
earher work, a nse in the share of investments in formal education alone 
from “about 9% in 1900 to over 38% in the 1950s ” Moreover, for 
Western countries as a whole, Kuznets (1971) calculates the share of 
labor has nsen from 55 to 75 percent of nauonal income over the same 
penod. These facts, meager as they may be, suggest two significant ques¬ 
tions which are germane to the issue with which this paper began, namely, 
how can the new home economics be integrated into a general theory of 
economic growth and development m a manner which has some hope, 
at least, of beanng on the grand question of the demographic transition^ 
These two questions are the followmg 



JOURNAL OF POLITICAL ECONOMY 


8214 

First, what accounts for the failure of the rate of return to investments 
in human capital, even counting educational investments, to fall, despite 
a high and accelerating rate of investment in this form of capital relative 
to other forms ^ In other words, why does there appear to be a persistent 
disequilibrium among these rates of return^ 

Second, quite apart from the possibility of disequilibrium rates of 
return (even if they are in equilibrium), what effect will increasing human 
capital investment per capita have on the allocation of resources within 
the household and what, if any, repercussions will it have for the rate of 
growth of population and labor forced A number of speculations and 
conjectures on the answers to these questions follow, they emphasize 
the role of the increasing value of human time over time and its relation, 
which IS reciprocal, to the increasing level of investment in human 
beings 

Razin (1969, 1972) showed how, under certain circumstances, the rado 
of human to total capital per capita would increase along the optimal 
growth path of an economy experiencing technical progress As indicated 
earlier, T W Schultz has emphasized this aspect of the persistant failure 
of rates of return to investments in human capital to dcclmc. The demand 
for skills and knowledge embodied in human capital docs not decline 
because of additional investments m the stock of useful knowledge and 
technique (technological change) which require the continual adaptation 
and adjustment of the human agent to utilize efRciently this augmented 
stork and seek out the new sources of investment opporturaties which 
maintain the growth process But even in the absence of a jiersistent 
disequilibrium created by the demand for human capital, it is possible 
that the rates of return to such investments would fail to fall over time, or 
fall only slowly, in relation to the rates of return to other forms of in¬ 
vestment because of endogenously changing relative cost of investment, 
that IS, changes on the supply side 

One of the most important consequences of the growing “quality” of 
human beings 21 s reflected in the increased stock of human capital per 
capita, as pointed out by T W Schultz (1973A), is the increasing value 
of human time per umt of such time Many of the consequences of the 
increasing value of time over time are amusingly explored by Linder 
(1970) in his penetrating study of The Homed Leisure Class. Yet Lmdcr 
and Schultz fml to note the important link which may exist between the 
increasing value of human time, due presumably to investment in human 
capital as well as to investment in other forms of capital and m technologi¬ 
cal change, and the terms on which investment in human capitaf takes 
place If one assumes, as I think plausible, that children (as regards both 
quality and quantity) are time-intensive as compared with other goods 
produced withm the home, it follows that unless the increasmg investment 



HOUSEHOLD AND ECONOMY 


S215 

of human capital increases the marginal productivity of a unit of time in 
the care and rearing of children within the home in an offsetting fashion, 
increases in the value of time will lead to a shift away from children to 
less time-intensivc activities To be sure, such a substitution effect may be 
offset strong income effect, but there are still further grounds to 
suppose that both subsutution and income effects will tend to lead to an 
increase in child quality rather than child numbers 

In a recent note (1973), Becker and Lewis explored the consequences 
of a simple model of the relation between the quality and quantity of a 
good (in this case, children) entering the utility function, under the 
assumption that quality per unit is the same for all units Increases in 
either quantity or quality cause the shadow prices of the other to rise 
and if, as is plausible, the income elasticity for quality is greater than that 
for quantity, the resulting increase in the shadow price of numbers would 
reduce the apparent income elasticity for quantity, perhaps even to 
negative level. Thus, the income effect of the increasing value of human 
time should, under these circumstances, lead to a substitution of quality 
for quantity of children, in addition to leading to a subsUtution away 
from children altogether to less time-intensive goods This rests of course, 
on the premise that the productivity of time in reanng children is un¬ 
changed or, at least, not greatly increased Moreover, the pure pnee 
effects are likely to work in the same direction, since it is plausible that 
quantity is more time-intensive than quahty of children per unit of equal 
quality time and that, indeed, the shadow price of quality is likely to be 
reduced by increases in the quality of the mother’s time input (I refer 
again to Arleen Leibowitz’s paper in this volume) 

The investments in child quahty referred to earlier take two major 
forms (1) sound nutrition and health care, and (2) education, skills, or 
attitudes conducive to acquisiUon of further education and skills 

Good nutrition and health care increase youngsters’ chances of survival 
and may also affect their ability to absorb future mvestments in intellectual 
capital To the extent that such investments increase the life span, par¬ 
ticularly the span of years over which a person can be economically 
active, such an increase in quality will raise the return to investments in 
human capital which sons and daughters may later wish to make in 
themselves To the extent that better health and nutrition result in a 
reduction in child mortality, they increase the satisfactions accruing to 
parents from other forms of investment which also raise child quality, 
for the returns to these investments may then be expected to be enjoyed 
over a longer period of time on average Increases in longevity, paiticularly 
of an individual’s economically productive years, increase the amount of 
human time available without increasing population, such an increase 
would tend by itself to lower the value of time per unit, but, as we know. 



JOURNAL OF POLITICAL ECONOMY 

most of the effects of better health care and nutrition occur in childhood 
and enhance the quality of a unit of time in later years more than in¬ 
creasing the number of children On net balance, therefore, I would 
conjecture that better health and nutrition lower the costs of further in¬ 
vestments in human capital relative to those in other forms of capital and 
increase the returns therefrom 

The second main form which an increase in child quality may take is, 
as I have stated, through investment m the form of education, skills, or 
attitudes conducive to later acquisition of further education or skills. 
Much investment in human capital of this type tends to be time-intensive 
in the preschool years, although the productivity of a unit of a mother’s 
time, as remarked, may be especially enhanced by a greater stock of 
human capital embodied m her, so it is not necessanly true that over time, 
as the result of the increasing value of human time, substitution will tend 
to occur away from this form of investment Nonetheless, it is in this area 
that we might expect some induced “technological innovations” which 
could economize on a mother’s scarce time Nursery schools, day-care 
centers, and the proliferation of “educational” toys are perhaps examples 
On the whole, then, I think we may conclude that increases m quality of 
children relative to their numbers take the form of investments in human 
capital which ultimately have the effect of raising the value of time per 
unit in the economically active years of adulthood 

For reasons which I feel certain we do not fully understand, but which 
are due in part to the presence of children’s utilities in the utility function 
of the family to which they belong, parents do desire to bequeath a stock 
of capital to their children Since the stock of capital, material and 
intangible, human and nonhuman, is growing per capita in Western 
economies, one must assume that parents desire to pass along more than 
that which they received from their parents, or that institutions in the 
economy function in such a way as to induce this outcome Irrespective 
of the motivation, however, the increasing value of human time must 
have an effect on the form in which this capital is passed on As long as 
rates of return to investments in human capital remain above, or fall 
more slowly than, the rates of return to investments in other forms of 
capital, parents will be induced to bequeath a greater part in the form of 
human capital. Thus, the tendency toward increasing quality of children 
will be intensified by the bequest motive, despite the opposite tendency, 
resultmg from the increasing cost of time, to invest in bequests which are 
less time-intensive But as rates of return tend to equality over time—if 
they ever do—parents should tend to bequeath less m the form of human 
capital and more in the form of financial and physical capital Nonethe¬ 
less, as long as investment in human capital occurs, the value of a unit of 
human time will continue to nse with increases in the stock of capital 



HOUSEHOLD AND ECONOMY 


S217 

per capita, rcmforcing the tendency to fewer children of ever-highcr 
quality SubsUtution will occur in favor of fewer children of higher quality 
and perhaps eventually against both quality and quantity of children in 
favor of commodities and knowledge The “facts” cited at the beginning 
of this section, suggesUng an increasing portion of total capital formation 
in this century has occurred m the form of human capital, however, 
indicate that we may be far from the point at which such substitution 
begins to take place against children, quality, and quantity combined 
The outlines of a revised Malthusian model begin to emerge, albeit 
dimly, from the foregoing conjectures and speculations In this model, 
the value of human time and changes in that value over time are pivotal, 
and the limitations imposed by natural resources are mitigated, if not 
elinunated, by technological progress and increases in the stock of know¬ 
ledge and of capital, both human and nonhuman The main link between 
household and economy is the value of human time, the increased value 
of human time results in fewer children per household, with each child 
embodying greater investments in human capital which in turn result 
in lower mortality and greater productivity in the economically acuve 
years Such greater productivity in turn further raises both the value of a 
unit of time and income in the subsequent generation and enables 
persons of that generation to make efficient use of new knowledge and new 
physical capital Eventually, rates of return to investments m physical 
capital, new knowledge, and human capital may begin to equalize, but 
as long as investment occurs which increases the amount of human capital 
per individual, the value of a unit of human time must continue to in¬ 
crease It IS not possible to say whether the dimmishing ability of a human 
being to absorb such investment would eventually stabilize the number of 
children per household and at what level, given the satisfactions parents 
obtain from numbers of children as well as their quality Nonetheless, 
over time the model does predict in rough quahtative fashion declining 
rates of population growth (perhaps eventually zero rates or even negative 
rates for a time) and decluung rates of infant mortality These are the 
main features of the demographic transition 

Much remains to be done if this rough and speculative outline is to be 
translated into a true integration of the new home economics with the 
modem theory of economic growth, particularly as the former evolves 
along more dynamic and empirically relevant lines First, the model must 
be mathematized—and it is clear that there arc many forms in which this 
may be accomplished—so that the crucial parameters and behavioral 
relations may be isolated These must then be studied empirically, for 
without quantitative knowledge of the parameters and technologies 
involved and the key behavioral relations the course of the important 
variables over time cannot be predicted or compared with past behavior 



S2i8 


JOURNAL OF POLITICAL ECONOMY 


The role of the growth theorist is apparent in the first of these tasks and 
those of the econometrician and economic histonan in the second Finally, 
the conceptual foundations of the new home economics, particularly with 
respect to intergenerational transfers, must be clanfied if we are to 
understand the extent to which actual growth departs from the optimal 
path, the reasons for such departure, and whether collective action is 
either desirable or necessary to correct such departures. 

I hope this paper represents at least a modest beginning in showing 
where it is we want to get to, if not a set of directions on which way we 
ought to go from here. 



Comment 


Zvi Griliches 

Harvard Umvmtty 


I shall divide my comments into two parts, corresponding to Nerlove’s 
discussion of the “new home economics” and the stylized “facts” that 
appear to be building blocks for a future growth model 1 am very much 
in agreement with Nerlove’s exposition of the “new home economics,” 
but I would like to amphfy a bit two of his implied cnticisms of the state 
of this theory 

One of the major working assumptions of the theory is the existence of a 
common family utility function. In his paper in this volume, Gary Becker 
shows that by introducing the notion of “caring” (or the interdejjcndence 
of utilities) one can show that the faimly will behave as if it has a common 
utility function This will not do, I think, for the analysis of the empincal 
phenomena that we are really mtcrested in What parents care for is not 
the utility that their children receive, but the utility function that the 
children have and the resources that they control Parents care about the 
consumption basket of their children, they have preferences over actual 
actions, not just their subjective outcomes Much of the withm-family 
conflict comes from different evaluations of the same consumption 
opportunities For example, since many famihes appiear to subsidize their 
children’s higher education, and since the marginal valuation of family 
funds may not be the same for different family members, it may pay for 
the young both to pursue higher education further than appears warranted 
on straight rate-of-retum calculations (somebody eke is paying the cost), 
and to work less hard at it than the donors of the money would have liked 
them to do. In any case, a common utihty function cannot cxplam either 
the growth of households or their dissoluuon, or indicate the point at 
which It pays for the young to opt out of it It is probably also not necessary 
for the analysis of such questions, as is indicated by Becker’s model of the 
marriage market 



Saao 


JOURNAL OF POLITIGAL ECONOMY 


The main shortcoming of the “new home economics” for the analysis 
of fertihty decisions is that it assumes too little. The basic jiostulates are 
that children are goods, that all goods are subject to two constramts— 
time and money—and that children are relatively time-intensive goods 
But this does not distinguish children from hi-fi sets' Adding the observ¬ 
ation that children tend to use mother’s time more than father’s does not 
g^t us much further Moreover, given the aims of the theory, this should 
probably come out as a conclusion, rather than be assumed from the 
beginmng In any case, the theory focuses on children as a good whose 
costs have a relatively high time component (particularly mother’s time) 
Thus, any changes that have occurred have to be explained in terms of 
changes in total marginal cost (money and time) of this good (children) 
relative to the cost of all other consumption goods I do not think that this 
IS enough The theory currently provides one explanation (among several 
competing ones) of the secular decline in fertility, attributing it to the 
rising cost of tune, but it fails to provide any convincing explanation for 
the major fertility cycles that we have experienced As the discussion at 
this conference made clear, we have no economic explanation yet within 
the framework of this theory either for the baby boom of the 1950s or the 
current rather sharp decline in fertility 

My belief is that if we want to study the demand for children, we have 
to put more content into the theory and start asking why do people want 
to have children, what are the returns and not just the costs of this activity^ 
In terms of the theoretical framework used in this volume, I am looking 
for shifters of the utility function, or, alternatively, for factors that change 
the imphcit household production function If we are studying the demand 
for children rather than for hi-fi sets, we have to ask ourselves what it is 
about children that distinguishes them from other time-intensive durable 
goods Perhaps we should go back to some low-level discussions about the 
“motives” for having children, along the Imes of the discussion of the 
demand for money in older textbooks I would distinguish at least three 
mterdependent motives (1) economic security (current labor and old-age 
provisions), (2) the production of reciprocal canng, and (3) an attempt at 
immortality via one’s offrpnng 

The last motive explains the attempts to impose an image on the 
children (an improved version of our own) and the interest in the actual 
basket of consumption rather than just the summary state (utility) of their 
well-being The returns in terms of the first two motives have been 
declimng secularly, perhaps even faster than the nse in costs implied hy the 
cost-of-time hypothesis. It is my guess that part of the recent sharp decline 
m fertihty stems from the clear recogmtion on the part of the current 
childbearing cohort of the relatively low rate of return experienced by 
their own parents. 



COUMENT 


Saai 

Turning to the last part of Ncrlove’s paper, I want to take issue with 
the two stylized facts that appear to be empirical building blocks for his 
growth model: the constancy of the rate of return to schooling and a 
persistent disequilibrium reflected in higher rates of return to the invest¬ 
ment la human as compared with physical capital Both Finis Welch 
and I have to take some blame for populanzing the first “fact” at a time 
when It was begmning to cease being a “fact ” The persistence of relatively 
high rates of return to schoohng through the 1950s and 1960s in the face 
of nsmg schoohng levels throughout the economy was probably the result 
of an accidental constellation of forces rather than the expression of an 
underlying constancy in the econormc mechanism The higher education 
boom was sustained by three forces (1) the first round of cohorts to be 
educated after World War II was a relatively small and declimng fraction 
of the population, (2) the post—World War II baby boom increased 
greatly the demand for teachers at all levels, and (3) at about the same 
time the government supenmposed on all of this a space-defense-rcscarch- 
and-development boom, heavily human capital-intensive, resulting m a 
scramble for young, educated talent Unfortunately, these fortuitous 
influences have run their course The educaUonal system has probably 
reached its longer-run equilibrium level, if it has not overshot it The 
space research and development boom is over now, at a time when there 
IS, and Will be for the next 5 yean or so, an annual wave of an addiuonal 
one million highly educated workers amving at the doors of the full-time 
labor force The rates of return to schooling have already started falhng 
and will probably fall quite a bit further before supply response catches 
up with them 

Also, I am not sure that I understand the disequilibnum discussion m 
Nerlove’s paper First, I know of no study that shows that relevant rates 
of return, computed in comparable terms, are significantly higher for 
schoohng than for physical investment The fact that human capital may 
have been growmg faster than physical is no evidence for this proposmon, 
and It does not require it. 



Combined References 


Aigner, D “An Appropnate £k:onometric Framework for Estimating a 
Labor Supply Function from the SEO File ” Madison Soc. Systems 
Res Inst, Univ Wisconsin, 1971 

Alchian, Armen A, and Demsetz, Harold. “Production, Information 
Costs, and Economic Organization “ A E.R 62 (December 1972) 
777-95 

Andersen, Ronald, and Anderson, Odin W. A Decade of Health Services • 
Social Survey Trends in Use and Expenditures. Chicago: Univ Chicago 
Press, 1967 

Aoki, Hisao Selected Statistics concerning Fertility Regulation in Japan (In 
Japanese ) Tokyo Ministry Health and Welfare, 1967. 

- “A General View of Fertility and Its Regulation in Japan ” (In 

Japanese ) Jinko Mondai Kenkyu 114 (April 1970): 5-20. 

Aoki, Hisao, and Nakano, Eiko Summary of the Fertility Surveys in 1940, 
1952, 1957, and 1962 (In Japanese) Tokyo: Mmistry Health and 
Welfare, 1967 

Arrington, Leonard J. Great Basin Kingdom Lincoln. Umv. Nebraska 
Press, 1958 

Ashcnfelter, Orley A, and Mooney, Joseph D. “Graduate Education, 
Ability and Earnings ” Rev Econ andStatis 50 (February 1968): 78-86. 

Baumol, William J. Economic Dynamics • An Introduction. 3d ed New York: 
Macmillan, 1970 

Becker, Gary S “An Economic Analysis of Fertility ” In Demograpkic and 
Economic Change in Developed Countries. Universitics-National Bureau 
Conference Senes, no 11 Pnneeton, N.J . Pnneeton Univ. Press, 1960. 

- Human Capital New York Columbia Umv. Press (for N%t. Bur. 

Econ Res), 1964. 

- “A Theory of the Allocation of Time.” Econ. J. 75 (September 

1965): 493-517 

-. Human Capital and the Personal Distribution of Income: An Analytual 

S222 




COMBINED REFERENCES S223 

Approach Woytinsky Lecture no 1 Ann Arbor Inst Public Admm , 
Univ Michigan, 1967. 

- “A Theory of Social Interactions ” Unpublished paper Chicago 

Univ Chicago, September 1969 

- Economu Theory New York Knopf, 1971 

- “A Theory of Marriage Part 1” JP E 81 (July/August 1973) 

813-46 

Becker, Gary S , and Chiswick, Barry R “Education and the Distribution 
of Earnings " A E R 56 (May 1966) 358-69 
Becker, Gary S , and Lewis, H Gregg “On the Interaction between the 
Quantity and Quality of Children ” J P E 81, no 2, suppl (March/ 
April 1973) S279-88 

Ben-Porath, Yoram “The Production of Human Capital and the Life 
Cycle of Earnings " J P E 75 (Aug^ust 1967) 352-65 

- Fertility in Israel, an Economist’s Interpretation Differentials and 

Trends, 1950-1970 RM-5981-FF Santa Monica, Calif RAND Corp . 
August 1970 (a) 

- “On the Association between Fertility and Education ” Mimeo¬ 
graphed Research Repoi t no 20 Jerusalem Hebrew Umv Jerusalem, 
1970 (6) 

- “Economic Analysis of Fertility in Israel Point and Counter¬ 
point 81, no 2, suppl (March/April 1973) S202-33 

Berg, Alan The Nutrition Factor. Its Role in National Development Washing¬ 
ton Brookings Inst, 1973 

Blau, Peter M , and Duncan, Otis Dudley The American Occupational 
Structure New York Wiley, 1967 

Bowen, W , and Finegan, T The Economics of Labor Force Participation 
Princeton, N J Princeton Univ Press, 1969 
Bowles, Samuel “Schooling and Inequality from Generaaoii to Gener¬ 
ation ” J P E 80, no 3, suppl (May/June 1972) S219-51 
Box, G P , and Cox, D “An Analysis of Transformations ” J Royal 
Statis Soc 26 (1964) 211-43 

Box, G P , and Tidewell, P “Transformations of Independent Variables ” 
Technometnes 4 (1962) 531-42 

Broel-Platens, Alexander “Marriage Disruption and Divorce Law ” 
Ph D dissertation, Univ Chicago, 1961 
Cain, Glen G Mamed Women in the Labor Force An Economu Analysis 
Chicago Univ Chicago Press, 1966 
Cavalli-Sforza, L L , and Bodmer, W The Genelus of Human Populations 
San Francisco; Freeman, 1971 

Cheung, Steven N S “The Enforcement of Property Rights in Children, 
and the damage Contract ” Econ J 82 (June 1972) 641-57 



JOURNAL or POLITIGAL ECONOMY 


8224 

Chiswick, Barry R Inconu Intquality: Regional Analysis within a Human 
Capital Framework New York' Nat Bur. Econ Res , 1973 
Chiswick, Barry R , and Mincer, Jacob “Time-Senes Changes in Personal 
Income Inequality in the United States from 1939, with Projections 
to mb" J PE 80, no. 3, suppl (May/June 1972) S34-66. 

Chow, Gregory C “Tests of Equality between Sets of Coefficients in 
Two Linear Regressions ” Econometnea 28 (July 1960) 591-605 
CoaJe, A S , and McNeil, D R “The Distnbution by Age of the 
Frequency of First Mamage in a Female Cohort.” J. American Statis 
Assoc 67 (December 1972) 743. 

DeTraj, Dennis N The Interaction between Parent Investment in Children and 
Family Size An Economic Analysts R-1003-RF Santa Monica, Calif 
RAND Corp May 1972 

- “Child Quality and the Demand for Children " J P E 81, no 2, 

suppl (March/Apnl 1973) S70-95 

Dorjahn, V R “The Factor of Polygyny in African Demography ” 
In Continuity and Change in African Cultures, edited by W. R Bascom and 
M J Huskovity Chicago Univ Chicago Press, 1959 
Dugan, D J “The Impact of Parental and Educational Investment 
upon School Achievement ” In Proceedings of the Social Statutus 
Section, Ameruan Statistics Association, 1969 Washington Amencan 
Statis Assoc, 1969 

Duncan, Otis Dudley, Featherman, David L , and Duncan, Beverly 
Soaoeconomte Background and Achievement New York' Seminar, 1972 
Eads, George, Nerlove, Marc, and Raduchcl, William “A Long-Run 
Cost Function for the Local Service Airline Industry. An Expienment 
in Nonlinear Estimation ” Rev Econ and Statis. 51 (August 1969) 
258-70 

Ehrlich, Isaac "Participation in Illegitimate Activities’ An Economic 
Analysis ” Ph D dissertation, Columbia Univ , 1970 
Fane, George “The Productive Value of Education in Agriculture in the 
U S ” Ph D dissertation. Harvard Univ, 1972 
Fisher, R A The Genetical Theoiy of Natural Selection 2d ed New York 
Dover, 1958 

Freiden, Alan “A Model of Marnage and Fertility ” Ph D dissertation, 
Univ Chicago, 1972 

Fuchs, Victor, Differentials in Hourly Earnings by Region and City Size, 1959 
Occasional Paper no 101. New York Nat Bur Econ. Res., 1967 
Gardner, Bruce. “Economic Aspects of the Fertility of Rural-Fajm and 
Urban Women.” Southern Econ J. 38 (April 1972): 518-24 
Ghez, Gilbert R, and Becker, Gary S “The Allocation of Time and 
Goods over the Life Cycle.” Report no 7,217 Chicago. Center Math. 
Studies Bus. and Econ., Univ. Chicago, Apnl 1972. 



COMBINED REFERENCES S235 

Goldfield, S , and Quandt, R Nonlinear Methods in Econometrics Amster¬ 
dam North-Holland, 1972 

Goode, William J AJter Divorce Glencoe, Ill Free Press, 1956 

- “Marital Satisfaction and Instability A Cross-cultural Class 

Analysis of Divorce Rates” InUmat Soc Sci J 14, no 3 (1962) 
507-26 

- World Revolution and Family Patterns New York Free Press, 1963 

- Explorations in Social Theory New York Oxford, 1973 

- The Celebration of Heroes. Prestige as a Control System (forthcoming) 

Gorman, W M “Separability and Aggregation ” Econometrua 27 (July 
1959) 469-81 

Gnhches, Zvi, and Mason, William M “Education, Income, and 
Maility” JPE 80, no 3, suppl (May/Junc 1972) S74-103 
Gronau, Reuben “The Intra-Family Allocation of Time The Value of 
the Housewives’ Time ” Research Report no 28 Jerusalem Hebrew 
Univ Jerusalem, June 1971 

- “The Wage Rates of Women—Selectivity Bias ” Mimeographed 

New York Nat Bur Econ Res , 1972 

- “The Effect of Children on the Housewife’s Value of Time ” 

JPE 81, no 2, suppl (March/Apnl 1973) S168-99 
Grossman, Michael “The Economics of Joint Production in the House¬ 
hold ” Report no 7,145 Chicago Center Math Studies Bus and 
Econ , Univ Chicago, September 1971 

-. The Demand for Health A Theoretical and Empirical Investigation 

Occasional Paper no 119 New York Nat Bur Econ Res, 1972 
Hall, Robert E “Comment” JPE 81, no 2, suppl (March/Apnl 
1973) S200-201 

Hansen, W Lee, Weisbrod, Burton A , and Scanlon, Wilham J 
“Schooling and Earmngs of Low Achievers " A E R 60 (June 1970) 
409-18 

Harbcrger, Arnold “Ttixation, Resource Allocation, and Welfare ” 
In The Role of Direct and Indirect Taxes in the Federal Revenue System 
Pnneeton, N J Princeton Univ Press (for Brookings Inst and Nat 
Bur Econ Res ), 1964 

Hause, John “Earnings Profile Ability and Schooling” JPE 80, 
no 3, suppl (May/June 1972) S108-38 
Heckman, James J “Three Essays on the Supply of Labor and the 
Demand for Market Goods ” Ph D dissertation, Pnneeton Univ , 
1971 

- “Shadow Pnees, Market Wages, and Labor Supply ” Paper 

presented at the Econometric Society Meetings, Toronto, December 
1972. (Forthcoming in Econometnea ) 

Hicks, J K.JTalm of Capital Oxford Clarendon, 1939 (2d ed I946> 



$226 


JOURNAL OF POLITICAL ECONOMY 


Hill, C Russell, and Stafford, Fraijk P. “The Allocation of Time to 
Preschool Children and Educational Opportunity,” Manuscript, 
Ann Arbor; Dept Econ , Univ Michigan, May 1973 (Forthcoming 
in J Human Resources ) 

Hitotsubashi University Institute of Economic Research. “National 
Income Study Group Materials Cl, CIS ” Mimeographed (In 
Japanese ) Tokyo* Hitotsubashi Univ Inst Econ. Res , 1968 
Honda, Tatsuo “Summary of the Third Fertility Survey in 1957 ” (In 
Japanese ) Jtnko Mondai Kenkyu 77 (August 1959) 1-24 
Honjo, Eijiro Nihon jtnko sht [Japanese population history] Tokyo 
Nihon Hyoron-sha, 1941 

Hotelling, Harold “Edgeworth’s Taxation Paradox and the Nature of 
Demand and Supply Functions " J PE. 40 (October 1932) 577-616 
Huffman, Wallace E “The Contribution of Education and Extension to 
Different Rates of Change ” Ph D dissertation, Univ Chicago, 1972 
Japan Ministry of Education Yearbook of the Ministry of Education (In 
Japanese ) Tokyo. Ministry Educ , 1904 and succeeding annual issues 
Japan Ministry of Health and Welfare Vital Statistus of Japan Tokyo 
Ministry Health and Welfare, 1970 

Japan Ministry of Health and Welfare, Institute of Population Problems 
“Summary of the Fifth Mainichi Survey of Family Plannmg Opinion ” 
(In Japanese ) Jtnko Mondat Kenkyu 77 (August 1959) 60-88 

- Reproduction Indues for Japan, 1960-1965 (In Japanese ) Tokyo: 

Ministry Health and Welfare, 1967 

- Report on the Fifth Fertility Surv^ (In Japanese ) Tokyo Ministry 

Health and Welfare, 1968 

Japan Prime Minister’s Office Descriptive Materials on Japanese Vital 
Statistics (In Japanese) Tokyo Pnme Minister’s Office, 1917 and 
succeeding issues 

- Annual Statistus of Japan 1960 Tokyo Prime Minister’s Office, 

1960 (a) 

- Censuses of Population of Japan Tokyo Pnme Minister’s Office, 

1955, 1960 (i), 1965, 1970 

- Statistual Yearbook of Japan Tokyo Prime Minister’s Office, 

vanous annual issues 

Johnson, George E , and Stafford, Frank P “Social Returns to Quantity 
and Quality of Schooling ” J Human Resources 8 (Spring 1973) • 139-55 
Kagan, Jerome S “Inadequate Evidence and Illogpcal Conclusions ” 
Harvard Educ Rev 39 (Spring 1969) 274—77 
Kagan, Jerome S, and Moss, H A. “Parental Correlates of Child’s 
IQ and Height.” Child Development 30 (September 1959): 325-32. 
Kccley, Michael C. “A Model of Mantal Formation* The Determinants 
of the Optimal Age at First Marriage and Differences in Age at Mar- 
lage ” Ph D dissertation, Univ. Chicago, 1973. 



COMBINED REFERENCES 


Saa7 

Kendall, M G “Proof of Relations Connected with the Tetrachonc 
Series and Its Generalizations ” Btorntfica 32 (1941). 196-99 
Khaldi, Nabil “The Productive Value of Education in the U S Agri¬ 
culture, 1964.” Ph.D dissertation. Southern Methodist Univ , 1972 
King, Wdlford Isbell Employment Hours and Earnings in Prosperity and 
Depression—United States, 1920-1922 New York Nat Bur Econ Res , 
1923. 

Knight, Frank H Risk, Uncertainty and Profit Boston Houghton-Mifflin, 
1921. 

- “Diminishing Returns from Investment ” JPE 52 (March 

1944) 26-47 

- The Economic Organization New York Harper & Row, 1965 

(Originally published by the College of the University of Chicago, 
1933 ) 

Kogut, Edy L. “The Economic Analysis of Demographic Phenomena 
A Case Study of Brazil ” Ph D dissertation, Umv Chicago, 1972 
Koopmans, Tjalling C “Intertemporal Distnbution and ‘Optimal’ 
Aggregate Economic Growth ” In W Fellner et al, Ten Economic 
Studies in the Tradition of Irving Fisher New York Wiley, 1967 
Kosters, M “Income and Substitution Effects in a Family Labor Supply 
Model ” P-3339 Santa Monica, Cabf RAND Corp , 1966 
Kuznets, Simon Modem Economu Growth Rate, Structure, and Spread New 
Haven, Conn Yale Univ Press, 1966 

- Economu Growth and Nations Cambridge, Mass Harvard Univ 

Press, 1971 

- “Modern Economic Growth Findings and Reflections ” 

(Nobel Prize lecture, December 1971 ) A E R 63 (June 1973) 247-58 
Lancaster, Kelvin J “A New Approach to Consumer Theory,” J P E 
74 (Apnl 1966) 132-57. 

Lcibowitz, Arlcen “Women’s Allocation of Time to Market and Non- 
market Activities. Differences by Education” PhD dissertation, 
Columbia Umv, 1972 

Leontief, W “Introduction to a Theory of the Internal Structure of 
Functional Relationships ” Econometrua 15 (October 1947) 361-73 
Lewis, H Gregg. “Income and Substitution Effects in Labor-Force 
Participation and Hours of Work ” Mimeographed Chicago Dept 
Econ , Umv Chicago, 1972 

Linder, Staffan B The Homed Leisure Class New York Columbia Univ 
Press, 1970 

Low, S., and Spindler, A. The Child Care Arrangements of Working Mothers 
Children’s Bureau publication no 461 Washington Government 
Printing Office, 1%8 

McCall, J. J “Economics of Information and Job Search " Q J E 84 
(February 1970)- 113-26 



S228 


JOURNAL OF POLITICAL ECONOliIY 


Mainichi Newspaper Company. Family Planning Surv^ (In Japanese) 
Tokyo* Maimchi Newspaper Company, 1950 and succeeding years 

- Nihon no Jxnko Kakumet [Japan’s populabon revolution] (In 

Japanese ) Tokyo Maimchi Newspaper Co , 1970 

Malthus, T R i'ln Essay on the Pmctple of Population and a Summary View 
of the Principle of Population Baltimore: Penguin, 1970 (Onginally 
published m 1798 and 1830 ) 

Marshall, Alfred Principles of Economics. 8th ed (1st ed 1890 ) London: 
Macmillan, 1930 

Matsumoto, Y Scott, Koizumi, Akira, and Nohara, Tadahiro “Condom 
Use in Japan ’’ Studies in Family Planning 3 (October 1972) 251-55 

Michael, Robert T “Education and Fertility ” Mimeographed New 
York. Nat Bur Econ Res , 1971 

- The Effect of Education on Efficiency in Consumption Occasional 

Paper no 116 New York Nat Bur Econ Res, 1972 

- “Education and the Derived Demand for Children.” f P E 81, 

no 2, suppl (March/Apnl 1973) S128-64 

Michael, Robert T , and Becker, Gary S “On the New Theory of Con¬ 
sumer Behavior ” Mimeographed New York Nat Bur Econ Res, 
December 1972 

Michael, Robert T, with Edward P. Lazear "On the Shadow Price of 
Children ” Mimeographed New York Nat Bur Econ Res , December 
1971 (Presented at the 1971 meeting of the Econometnc Society, New 
Orleans, December 1971 ) 

Mincer, Jacob “Labor Force Participation of Married Women ” In 
Aspects of Labor Economics, edited by H Gregg Lewis Universities- 
National Bureau Conference Senes no 14 Princeton, N J Pnneeton 
Univ Press, 1962 

- “Market Prices, Opportunity Costs, and Income Effects ” In 

Measurement in Economics • Studies in Mathematical Economics and Econo¬ 
metrics in Memory of Yehuda Grunfeld, edited by Carl Chnst et al Stanford, 
Calif . Stanford Univ Press, 1963 

- “The Distnbution of Labor Incomes A Survey with Sjiccial 

Reference to the Human Capital Approach ” J Econ Literature 8 
(March 1970) 1-26 

- Schooling, Experience, and Earnings New York: Nat Bur Econ. 

Res, 1974 

Mitchell, Wesley C. “The Backward Art of Spending Money ” A ER 
2 (June 1912): 269-81 

Monshima, Michio Theory of Economic Growth 2d rev. impression Oxford: 
Clarendon, 1970 

Mortensen, D T “Job Search, the Duration of Unemployment and the 
Phillips Curve ” A.E.R. 60 (December 1970): 847-62. 



COMBINED REFERENCES 5^29 

Mostellcr, Frederick On Equality of Educational Opportunity New York 
Vintage, 1972 

Muramatsu, M “Effect of Induced Abortions on the Reduction of Birth 
in Japan ” Mtlbank Memorial Fund Q 38 (Apnl I960) 153-66 

- “Problems in Procuring Contraceptive Materials in Rural Areas 

in Japan ” In Research in Family Planning, edited by Clyde V Kiser 
Princeton, N J Princeton Univ Press, 1962 

- “Policy Measures and Social Changes for Fertility Decline in 

Japan ” In IVorld Population Conference, 1965 Vol 2 New York 
United Nations, 1967 

Muth, Richard F “Household Production and Consumer Demand 
Functions ” Econometnea 34 (July 1966) 699-708 
National Council of Churches, Bureau of Research and Survey Churches 
and Church Membership intheUS Ser A, no 3 New York Nat Council 
Churches, 1956 

Nelson, P J “Information and Consumer Behavior " JP E 78 (March/ 
Apnl 1970) 311-29 

Nelson, R , and Krashinsky, S “The Organization and Planmng of Day 
Care for Children ” Washington Urban Inst, August 1972 
Nerlovc, Marc “Lags in Economic Behavior ’’ Econometrua 40 (March 
1972) 221-52 (a) 

- “Tuition and the Costs of Higher Education' Prolegomena to a 

Conceptual Framework” JPE 80, no 3, suppl (May/June 1972) 
SI78-218 (6) 

Nerlove, Marc, and Schultz, T Paul Love and Life between the Censuses A 
Model of Family Decision Making in Puerto Rico, 1950-1960 RM-6322- 
AID Santa Monica, Cahf RAND Corp , September 1970 
O’Hara, Donald J Changes in Mortality Levels and Family Decisions regarding 
Children R-914-RF Santa Monica, Cahf RAND Corp, February 
1972 

Okazaki, Yoichi “Meiji Period Population ” (In Japanese) Knzai 
Kenkyu 16 (July 1965) 207-13 

- “An Analysis of the Socio-economic Factors Affecting Fertility m 

Japan ” (In Japanese ) Jinko Mondai Kenkyu 114 (Apnl 1970) 21-34 
Fames, Herbert S ; Shea, John R , Spitz, Ruth S , and Zeller, Fredenck 
A Dual Careers Vol 1 Manpower Research Monograph no 21 
Washington. Dept Labor, 1970 

Pitchford, J. D Population in Economic Growth Mimeographed monograph 
Canberra' Australian Nat Umv , 1973 
Polachek, Solomon “Work Experience and the Difference between Male 
and Female Wages ” Ph D dissertation, Columbia Univ , 1973 
Rahm, M “The Occupauonal Wage Structure” PhD dissertation, 
Columbia Univ., 1971 



JOURNAL OF POLITICAL EGONOHIY 


SS 30 

Razin, Assaf. “Investment in Human Capital and Economic Growth. A 
Theoretical Study ” Ph.D dissertation, Univ Chicago, 1969. 

- “Investment in Human Capital and Economic Growth ” Metro- 

economica 24 (May/August 1972): 101-16 
Reid, Margaret G Economics of Household Production. New York: Wiley, 
1934 

- Housing and Income Chicago Univ Chicago Press, 1962 

Rheinstein, Max Marriage Stability, Divorce, and the Law Chicago 
Univ Chicago Press, 1972 

Rivers, W H The Todas London Macmillan, 1906 
Rogers, Daniel C “Pnvate Rates of Return to Education in the U S • A 
Case Study ” Yale Econ Essays 9 (Spring 1969) 89-134 
Rosen, Sherwin “Short-Run Employment Variation on Class-I Railroads 
in the U S , 1947-63 ” Econometnea 36 (July/October 1968) 511-29 
Ross, Sue “The Effect of Economic Variables on the Timing and Spacing 
of Births ” Ph D research in progress, Columbia Umv , 1973 
Ruderman, F Child Care and Working Mothers New York Child Welfare 
League America, 1968 

Salzano, F M , Neel, J V , and Maybury-Lewis, D “Further Studies on 
the Xavante Indians ” American J Human Genetics 19 (July 1967) 
463-89 

Samuelson, Paul A “Social Indifference Curves " Q J E 70 (February 
1956) 1-22 

Schoggen, P H , and Schoggen, Maxine F “Behavior Units in Observ¬ 
ational Research *’ Paper presented at the symposium on Methodologi¬ 
cal Issues in Observational Research, Amencan Psychological 
Association, San Francisco, 1968 

Schultz, T Paul “Explanation of Birth Rate Changes over Space and 
Time A Study of Taiwan ” yP£ 81, no 2, suppl (March/Apnl 
1973) S238-74 

Schultz, Theodore W “Education and Economic Growth ” In Social 
Forces Influencing American Education, 1961, edited by Nelson B Henry 
Sixtieth yearbook of the National Society for the Study of Education, 
pt 2 Chicago Univ Chicago Press, 1961. 

- Transforming Traditional Agriculture New Haven, Conn Yale 

Univ. Press, 1964 

- Investment in Human Capital New York’ Free Press, 1971 

- Human Resources Fiftieth Anruversary Colloquium 6 New York’ 

Nat Bur Econ Res, 1972 (a) 

- “The Increasing Economic Value of Human Time ” Ameruan J 

Agricultural Econ 54 (December 1972); 843-50 {b) 

- “Explanation and Interpretations of the Increasing Value of 

Human Time ” Mimeographed. Woody Thompson lecture, Midwest 
Economics Association, Chicago, Apnl 5, 1973 (a) 



COMBINED REFERENCES 

- “The Value of Children An Economic Perspective ” J P E 81, 

pt 2, suppl (March/Apnl 1973) S2-13 (6) 

-, ed “New Economic Approaches to Fertility ” J P E 81, pt 2 

suppl (March/Apnl 1973) (c) 

Schwartz, Aba “Migration and Lifespan Earnings in the U S ” Ph D 
dissertation, Univ Chicago, 1968 

Shaw, George Bernard Man and Superman In The Collected Works of 
Bernard Shaw Vol 10, Ayot St Lawrence ed New York Wise, 1930 
Shell, K Essays on Theory of Optimal Economic Growth Cambndge, Mass 
M I T Press, 1967 

Shinozaki, Nobuo “Regional Problem of Population in Japan from the 
Viewpoint of Fertility” (In Japanese) Jinko Mondat Kenkyu 118 
(Apnl 1971) 5-20 

Silver, Moms “Births, Marriages, and Business Cycles in the United 
States ” J PE 73 (June 1965) 237-55 

- “Births, Marriages and Income Fluctuations in the United 

Kingdom and Japan ” Econ Development and Cultural Change 14 (Apnl 
1966) 302-15 

Sloan, F “Survival of Progeny in Developing Countnes An Analysis of 
Evidence from Costa Rica, Mexico, East Pakistan, and Puerto Rico ” 
R-773-AIU Santa Monica, Calif RAND Corp , July 1971 
Smith, James P “The Life Cycle Allocation of Time in a Family Context ” 
Ph D dissertation, Univ Chicago, 1972 
Solow, Robert M “A Contribution to the Theory of Economic Growth ” 
QJE 10 (February 1956) 65-94 

Son tag, Lester W , Baker, Charles T , and Nelson, Virginia L Mental 
Growth and Personality Development A longitudinal Study Monograph of 
the Society for Research in Child Development 23, no 68 Lafayette, 
Ind Child Development Publications, 1958 
Stiglcr, George J “The Economics of Information” JPE 69 (June 
1961) 213-25 

Strotz, R “The Empirical Implications of a Utility Tree ” Econometrua 25 
(April 1957) 269-80 

-. “The Utility Tree—Correction and Further Appraisal ” 

Econometnea 27 (July 1959) 482-88 

Suter, Larry E , and Miller, Herman P “Compionents of Income Dif¬ 
ferences between Men and Women in the United States ” Paper 
presented at the American Sociological Association meetings, Denver, 
1971 

Swan, T W “Economic Growth and Capital Aci umulation ” Econ 
Record 32 (November 1956) 334-61 

Sweet, James A “Family Composition and the Labor Force Activity of 
Mamed Women in the United States ” Ph D dissertation, Unn' 
Michigan, 1968. 



JOURNAL OF POLITICAL ECONOMY 


8232 

Tachi, Mmoni. “The Relationship between Production Increase and 
Population Growth among Japanese Prefectures, 1925-1935 ” (In 
Japanese ) Jutko Mondat Kenkyu 5 (February 1946): 15-44 

Tachi, Minoru, and Arao, M “Mamage Rate and Business Cycle in 
Japan ” Jtnko Mondat 2 (October 1935) 205-20 

Taeubcr, Irene The Population of Japan Princeton, N J . Pnnceton Univ. 
Press, 1958 

Taubman, Paul, and Wales, Terence “Education as an Investment and 
Screening Device." Mimeographed New York Nat Bur Econ Res , 
1972 

Terman, Lewis M Mental and Physical Traits of a Thousand Gifted Children 
Genetic Studies of Genius, vol 1 Stanford, Calif ; Stanford Univ Press, 
1925 

Terman, Lewis M , Burks, Barbara S , and Jensen, Dortha W The 
Promise of Youth Genetic Studies of Genius, vol 3 Stanford, Cahf 
Stanford Univ Press, 1930 

Terman, Lewis M , and Oden, Mehta H The Gifted Group at Mid-Life 
Genetic Studies of Genius, vol 5 Stanford, Calif: Stanford Univ. 
Press, 1959 

Umemura, Mataji “Meiji Penod Population Growth ” In Keizaisht nt 
okeru Jtnko [Population in economic history], edited by H Komatsu 
Tokyo Keio tsushin, 1969 

U S , Bureau of the Census County and City Data Books, 1962 Statistual 
Abstract, suppi Washington Government Printing Office, 1962 

- Statistical Abstract of the United States Washington Government 

Printing Office, various issues 

- U S Census of Housing, 1960 Vol 1 States and Small Areas Pt 1 

United States Summary Washington Government Pnnting Office, 
1963 (a) 

- U S Census of Population 1960 Vol 1 Characteristics of the 

Population, pts 1-52 Washington Government Pnnting Office, 1963 

(i) 

- US Censuses of Population and Housing, 1960. IjlOOO, 1110,000’ 

Two National Samples of the PopiUation of the United States, Description and 
Technical Documentation Washington Government Printing Office, 
1%4 

U.S., Department of Health, Education, and Welfare Vital Statistics of 
the United States. Vol 1 Washington Government Printing Office, 
1968 (a) 

-. Social Secunty Administration, Office of Research and Statistics 

Workers under Soaal Secunty, 1960 Washington Government Pnnting 
Office, 1968 {b) 



COMBINED REFERENCES 


S233 

U.S , Office of Economic Opportunity, Day Care Policy Studies Group, 
Institute for Interdisciplinary Studies “Child Care Programs Estim¬ 
ation of Impacts and Evaluation of Alternative Federal Strategies ” 
Report BOO-5121 Washington Office of Economic Opportunity, 
December 1971 

Wachtel, Paul “The Returns to Investments in Higher Education 
Revisited ” Mimeographed New York Nat Bur Econ Res, 1973 
Wald, A “The Approximate Determination of Indifference Surfaces by 
Means of Engel Curves ” Econometnea 8 (April 1940) 144-75 
Wales, T “Estimation of a Labor Supply Curve for Self-employed 
Business Proprietors ” Internal Eicon Rev 14 (February 1973) 69-80 
Weisbrod, Burton A , and Karpoff, Peter “Monetary Returns to College 
Education, Student Ability, and College Quahty ” Rev Econ and 
Statis 50 (November 1968) 491-97 
Welch, Finis “Education in Production” JPE 78 (February 1970) 
35-59. 

Willis, Robert J “A New Approach to the Economic Theory of Fertility 
Behavior ” y/* £ 81, pt. 2, suppi (March/Apnl 1973) S14—64 
Wilson, Frances Labour in the South African Gold Mines, 1911-1969 
Cambridge Cambridge Umv Press, 1972 
Young, Kimball Isn’t One Wife Enough^ New York Holt, 1954 
Zellncr, Arnold “An Efficient Method of Estimating Seemingly Un¬ 
related Regressions and Tests for Aggregation Bias ” J Ameruan 
Stalls Assoc 57 (June 1962) 348-68 



African Development 


POPULAR GOVERNMENT IN AN 
AFRICAN TOWN 

Kifa, Mali 

Nicholas S. Hopkins 

Etnphosizino process and idiom as wall as structure, Nicholas S. 
Hopkins examines the strong local identity and massive popular 
participation that set the framework for political struggle in Kita, 
a small town in Western Mali Mr Hopkins shows that the long¬ 
term survival of an emerging national government depends upon 
the government's success in fostering economic and social de¬ 
velopment consonant with local values In Mali, the disparity be¬ 
tween national and local notions of key political concepts ad¬ 
versely affected national sponsored development programs based 
on the cooperation of local participonts 
}?72 XXI, 245 pages Cloth $9,50 


DRAWERS OF WATER 

Domestic Water Use in East Africa 

Gilbert F. White, David J. Bradley, and Anne U. White 

Based on several years of detailed field investigations in a variety 
of sites, this study considers systems of water use and their im¬ 
provement in developing countries. Monetary and energy costs of 
obtaining water, health effects, productivity of water users, volume 
ond allocation of water, and how woter users choose among 
possible alternative sources of water are examined. The analyses 
of social costs and individual decision making will interest econo¬ 
mists and anthropologists dealing with social process in develop¬ 
ing areas. 

1972 328 pages Ulus. Cloth $14.00 



The University of Chicago Press 

5801 South Ellis Avonue • Chicago 60637 



On Modernization 

ECONOMIC GROWTH IN CHINA 
AND INDIA, 1952-1970 
A Comparative Appraisal 

Subramanian Swamy 

The governments that emerged in both China and India in 
the late 1940s struggled to transform their economies into 
modern industrial ones, each adopting an entirely different 
economic system Utilizing data from official records, 
Professor Swamy examines their success 

Originally published as a supplement to Economic 
Development and Cultural Change journal 
1973 128 pages Cloth $6 95, 

journal issue (vol 21, no 4, part 2) 


THE GOLDEN ROAD TO 
MODERNITY 

Manning Nash 

Considers the problem of transforming the newly inde¬ 
pendent, underdeveloped countries of Asia into modern 
ones by focusing on the structure and the nature of life in 
two villages in Upper Burma, near Mandalay The author 
provides a comprehensive picture of Burmese society by 
describing how the social and cultural heritage of commu¬ 
nities affects the economic and political change inherent 
in modernization 
1973 


344 pages 


Paper $2 95 


■ ■ Chicago - 

CHICAGO ESSAYS IN ECONOMIC DEVELOPMENT 
Edited aad with an Inttoductioa by David Wall 

In this collection ol essays by members of the Ckica^ school of eco¬ 
nomics, David Wall shows that the School’s methods go a long way 
toward the clarification and solution of the economic problems faced 
by the world's underdeveloped countries This volume rectifies mis¬ 
understandings about the Chicago school and circulates mor^ widely 
s<Mne of its best work Contributors Theodore Schultz, Harry C. 
Johnson, Arnold C Harberger, Bert F Hosehtz, Larry A. Sjaastad, 
and O Gale Johnson 

1972 ^ pages Cloth $1400 

DUALISTIC ECONOMIC DEVELOPMENT 
Theory and History 

Allen C. Kelley, Jeffrey G. Williamson, 
and Russell J. Cheetbam 

This innovative study provides an integrated application of formal 
economic theory, mathmatica! techniques of analysis, and economic 
development and structural change in the low-income economy. Il¬ 
lustrating the presentation is an exammation of Japanese economic 
growth during the period 1885-1915 

1972 392 pages Cloth $12.50 


INVESTMENT IN EDUCATION 

The Equity-Efficiency Quandary Workshop at the 
Unioenity of Chicago, June 7-10,1971 

Edited by Theodore W. Schultz 

These workshop papers, first published as a supplement to die 
Journal of Pcltiical Economy, deal broadly with the unsettled social 
question of die effects of education upon the distribution of personal 


income. 
1972 


298 pages 


Cloth 17.85 


Paper Journal of Political Economy (val. 80, no. 3, part H) InsUtu- 
tkms, $4.S0, IndMdtials, |34S0: Tieenty or more oopiet, $S,0(headi 

The Univetsity of Chica|;a Press 

Chicago 606^7 








